Comments (4)
Hi,
First of all, this project focus on recognition of cursive handwriting and evaluation of different ML models.
In other to use these models you first have to get the images of words (names) from your papers. Then you have to standardize these images and feed them into the pre-trained ML model. To understand this process I would recommend to look on OCR-Evaluator.ipynb. I would recommend you to use the CTC model. For this model images are split into smaller slices which are than feed into the model. The model then outputs the sequence of letters.
One of the problems is that the current models are trained on my handwriting and you will probably need to find some better dataset for training and train the model on your own.
from handwriting-ocr.
Hey LogX7, what happened?
Did the code work?
I think some models are missing from the directory.
If you have successfully executed the code then please share how you did it
from handwriting-ocr.
Let's assume there are 100 employees in a company who will submit handwritten documents. Does it mean that I need to collect sample handwriting of all 100 individuals to train the model?
from handwriting-ocr.
The problem right now is that my training dataset is quite small (around 5k words). Therefore, the program won't work on handwriting of others. It would be the best solution to have samples from all 100 individuals to produce the best results, but I believe that if create a training dataset large enough, it will generalize for others' handwriting.
One of the main goals right now is to find an effective way of creating dataset and to create a dataset large as possible.
from handwriting-ocr.
Related Issues (20)
- Query: Punctuation Marks HOT 1
- Language HOT 3
- not giving output same as in your github ocr.ipynb ctc model HOT 9
- ValueError: zero-size array to reduction operation minimum which has no identity
- unimplementederror: tensor array has size zero, but element shape [?,256] is not fully defined. currently only static shapes are supported when packing zero-size tensorarray
- File models/gap-clas/CNN-CG.meta does not exist.
- No Function : imageNorm ? HOT 1
- 'TrainingPlot' object has no attribute 'updateCost' HOT 2
- Tensor shape error / not training my images HOT 1
- handwriting-ocr/word_classifier_CTC.ipynb question
- ModuleNotFoundError: No module named 'ocr'
- ValueError: too many values to unpack (expected 2) HOT 5
- training time
- How much time it takes for training i am waiting for 2 hours and what is value of LOSS_ITER and also can you check the train.csv, dev.csv, test.csv i have generated are good to use or have some error?
- What does this code doing and how can i visualize it's output. HOT 1
- ValueError: Cannot feed value of shape (13, 1, 3600) for Tensor 'inputs:0', which has shape '(None, 64, None, 1)'
- Javascript implementation HOT 1
- File does not exist. Received: F:\MY_PROJECT\handwriting-ocr-master\src\ocr\../../models/gap-clas/CNN-CG.meta. HOT 1
- Request for resources
- field to access
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from handwriting-ocr.