Git Product home page Git Product logo

Comments (3)

Belval avatar Belval commented on May 12, 2024

Added some tests for the most used functions. I'll leave the issue open until I satisfied with the coverage.

Current tests are:

  • String from wikipedia random
  • String from test file
  • String from dictionnary
  • Generate image with specified format (64)
  • Generate image with specified extension (.png)
  • Generate image with specified skewing (15)
  • Generate image with specified blur (3)
  • Generate gaussian background
  • Generate plain white backgroung

from textrecognitiondatagenerator.

William-Stocks avatar William-Stocks commented on May 12, 2024

Hi,thanks a lot for your wonderful work, there are many excellent ideas in your codes.
Something confused me, may I have a question?
1,The results are only pictures, I wonder why a file with pictures` name and label are not given. I think they are important in a OCR Deep Learning project.
2, How can I generating Samples with Text in non-Latin (English) Scripts, like Chinese or Japanese.
I have read the programs(JarveeLee/SynthText_Chinese_version, gachiemchiep/SynthText),they are not so concise and easy to understand .

from textrecognitiondatagenerator.

Belval avatar Belval commented on May 12, 2024

I am glad that you like it!

  1. The label of the image is in the filename. If the image contains "TEST", the image filename will be TEST_#.jpg. If you would rather have the labels in a file I suppose that could be arranged but I see the filename as a more user-friendly (and simple to understand) solution.

  2. Excellent question. I never really spent time on it because it was not my usecase. If you believe this would be an interesting feature for you I can add it.

Also, this is the issue for tests, please see #4

from textrecognitiondatagenerator.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.