Git Product home page Git Product logo

Comments (4)

Breta01 avatar Breta01 commented on July 17, 2024

Hi,
First of all, this project focus on recognition of cursive handwriting and evaluation of different ML models.
In other to use these models you first have to get the images of words (names) from your papers. Then you have to standardize these images and feed them into the pre-trained ML model. To understand this process I would recommend to look on OCR-Evaluator.ipynb. I would recommend you to use the CTC model. For this model images are split into smaller slices which are than feed into the model. The model then outputs the sequence of letters.

One of the problems is that the current models are trained on my handwriting and you will probably need to find some better dataset for training and train the model on your own.

from handwriting-ocr.

yasersakkaf avatar yasersakkaf commented on July 17, 2024

Hey LogX7, what happened?
Did the code work?
I think some models are missing from the directory.
If you have successfully executed the code then please share how you did it

from handwriting-ocr.

shantanuo avatar shantanuo commented on July 17, 2024

Let's assume there are 100 employees in a company who will submit handwritten documents. Does it mean that I need to collect sample handwriting of all 100 individuals to train the model?

from handwriting-ocr.

Breta01 avatar Breta01 commented on July 17, 2024

The problem right now is that my training dataset is quite small (around 5k words). Therefore, the program won't work on handwriting of others. It would be the best solution to have samples from all 100 individuals to produce the best results, but I believe that if create a training dataset large enough, it will generalize for others' handwriting.

One of the main goals right now is to find an effective way of creating dataset and to create a dataset large as possible.

from handwriting-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.