Git Product home page Git Product logo

MILE lab, IISc's Projects

androidscannerdemo icon androidscannerdemo

ScanLibrary is an android document scanning library built on top of OpenCV, using the app you will be able to select the exact edges and crop the document accordingly from the selected 4 edges and change the perspective transformation of the cropped image.

crnn icon crnn

Convolutional recurrent neural network for scene text recognition or OCR in Keras

degradedwordskannada icon degradedwordskannada

Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text

kannada-ocr-test-images-with-ground-truth icon kannada-ocr-test-images-with-ground-truth

This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.

mergedsymbolskannada icon mergedsymbolskannada

Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text

mile-transliterator icon mile-transliterator

A browser plugin to Google Chrome, which instantly transliterates a website present in any Indic script to Kannada. This plugin exploits the Unicode block parallelism and also uses a rule-based approach to transliterate web pages to Kannada. This enables a polyglot user to read online documents in other Indic scripts through Kannada script. Currently, it supports transliteration from Tamil, Telugu, Malayalam, Bangla, Gujarati, Odiya, Punjabi, Sanskrit and Hindi pages. The quality of transliteration was scored by 45 users on a scale of 1 to 5 and a mean opinion score of 4.6 has been achieved.

tuludocuments icon tuludocuments

OCR dataset of scanned pages of Tulu books along with groundtruth text

wikiclean icon wikiclean

A Java Wikipedia markup to plain text converter

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.