Git Product home page Git Product logo

Comments (11)

Breta01 avatar Breta01 commented on July 17, 2024

It can't right now, because I don't have the dataset for it. But there is a lot of projects using MNIST dataset for identifying handwritten numbers.

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

Thank you for your reply. I'm doing a test paper writing recognition system. Before that, I used CNN neural network to train data, but the problem of text segmentation encountered difficulties. So this project needs text segmentation?

from handwriting-ocr.

Breta01 avatar Breta01 commented on July 17, 2024

It depends on recognition method. In one approach, I test segmentation of text by bidirectional RNN and then classifying individual characters using CNN. I also test classification using CTC which process images of whole words (could be transformed to process whole lines of words).

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

What are the requirements for Bi-RNN and CNN for data training set? Need a single-letter data set?

from handwriting-ocr.

Breta01 avatar Breta01 commented on July 17, 2024

Yes, for the CNN you need single-letter dataset. For Bi-RNN you need dataset containing images of whole words along with text files containing positions of lines separating individual letters. If you have the words already, you can use WordClassDM.py for manual creation of letter separating lines.

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

Before I mentioned the project that I wanted to do, if I use OCR.ipynb to do the recognition, then I need to use two models, the models trained by CharClassifier.ipynb to identify, and the models trained by GapClassifier-BiRNN.ipynb to cut it? ?
But both of these are trained from data/words2/ reading data? Now that I have some handwritten letter training sets and word training sets in my hands, can I handle these with WordClassDM.py and do training sets?
Can you explain the work done by GapClassifier-BiRNN-Attention.ipynb, GapClassifier-Attention-RNN.ipynb, GapClassification.ipynb, GapClassification-CharClass.ipynb?
I'm sorry for delaying your time, but for the first time I did a project on identification, I didn't understand that it was too much. @Breta01

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

Is the Gap-Classifier used in OCR.ipynb the Classifier-BiRNN.ipynb model?

from handwriting-ocr.

Breta01 avatar Breta01 commented on July 17, 2024

Ok, yes you need two models and in OCR.ipynb is used the Classifier-BiRNN.ipynb.

I train both models from data/words2/ because it contains images of words along with files which contains the positions of gaplines. For CharClassifier I just cut out those separated letters... If you have letter and word training sets, you can process word set with WordClassDM.py (need manual work) and than train the two models. (If you have individual letters, you can possibly create artificial words for training with already know positions of gaplines, but I don't have code for that.)

GapClassifier-BiRNN-Attention.ipynb, GapClassifier-Attention-RNN.ipynb, GapClassification-CharClass.ipynb, and GapClassifier.ipynb are only experimental models which don't perform so good. Just skip those files.

GapClassification.ipynb deomstrates the process of separation of characters, but final code for separation is in ocr/charSeg.py.

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

This problem is encountered in running WordClassDM.py:
Traceback (most recent call last):
File "E:/Jupar/handwriting-ocr-master/WordClassDM.py", line 218, in
Cycler(args.index, args.data, args.save)
File "E:/Jupar/handwriting-ocr-master/WordClassDM.py", line 76, in init
self.blockLoad()
File "E:/Jupar/handwriting-ocr-master/WordClassDM.py", line 118, in blockLoad
self.data_loc, self.org_idx + self.idx, 100)
File "E:/Jupar/handwriting-ocr-master/WordClassDM.py", line 50, in loadImages
printProgressBar(i - idx, upper - idx - 1)
File "E:\Jupar\handwriting-ocr-master\ocr\viz.py", line 20, in printProgressBar
percent = ("{0:." + str(decimals) + "f}").format(100 * (iteration / float(total)))
ZeroDivisionError: float division by zero

Process finished with exit code 1
No changes to the words_raw data.

from handwriting-ocr.

Breta01 avatar Breta01 commented on July 17, 2024

The printProgressBar() is just for visualisation of loading, you can remove it.
It looks like the -1 shoudn't be there.

from handwriting-ocr.

juparrrr avatar juparrrr commented on July 17, 2024

Thanks for your help, your program is very good

from handwriting-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.