Git Product home page Git Product logo

tesseract_opencv-ocr-and-text-recognition's Introduction

Hi,

In my manual-procedure : https://github.com/stefanache/using-tesseract-ocr-python

seen that can create one pipeline with scope to detect the text from images with good accurance.

So in this procedure will discuss about even of "Using OpenCV's EAST for text-recognition(OCR) with tesseract"

Remember that :

    This EAST method(https://www.pyimagesearch.com/2018/08/20/opencv-text-detection-east-text-detector/)
    will be part of this pipeline:

    Input image -> OpenCV's EAST(Text Detector) -> Extract Text ROIs -> Tesseract v.4 OCR with LSTM(text recognizer) -> OpenCV+Tesseract OCR Results

Ref from

    https://www.pyimagesearch.com/2018/09/17/opencv-ocr-and-text-recognition-with-tesseract/
    https://github.com/efviodo/opencv-text-recognition
    
    exist and web opencv OCR: 
    https://github.com/tamaUdon/Web_OCR_app/tree/master/app
    
    and can use google search for others with this phrase: 
          "github adrian rosebrock ocr Tesseract"
    or with phrase wich include specific file which contain the pretrained model:
          "frozen_east_text_detection.pb"

Remember:

       1. install the Open CV :   https://github.com/stefanache/install_OpenCV4_on_w10Pro
       2. install tesseract:      https://github.com/stefanache/install_Tesseract4_for_Python3.7
          and using it            https://github.com/stefanache/using-tesseract-ocr-python

But not discuss about the versions wich need to have to resolve the compatibility and finally

to can have one robust solution.

For that if you consult the https://github.com/efviodo/opencv-text-recognition then see this file

      requirement.txt with this content:
      
            pillow==7.0.0
            pytesseract==0.3.2
            imutils==0.5.3
            opencv-python==4.2.0.32

this file can be used to ensure that have all these versions with command :

      pip install -r requirements.txt

From the my github repository root directory missed the pretrained model(frozen_east_text_detection.pb)

because is large/big file but you can download from https://github.com/oyyd/frozen_east_text_detection.pb

Also I created the batch file(text_recognition_RUN.bat) for my OS(windows 10 pro) to can have idea about

how using the text_recognition.py with python and how install/update the requirement.txt

To run this batch call from windows cmd.exe and then simply type the command:

    C:\Users\{user}> text_recognition_RUN.bat

In fact you can find and download the entire project at https://github.com/efviodo/opencv-text-recognition

My repository contain in plus the above batch file (text_recognition_RUN.bat) which contain some calls for

this efviodo's tutorial

See also:

    https://medium.com/@tomhoag/opencv-text-detection-548950e3494c
    https://books.google.ro/books?id=hbSeDwAAQBAJ&pg=PA203&lpg=PA203&dq=frozen_east_text_detection.pb&source=bl&ots=WUw_6b5vdi&sig=ACfU3U1ycRwdcrbLBTiyrOD-r-1gmdCK0w&hl=ro&sa=X&ved=2ahUKEwjikeGT46boAhU2aRUIHe8ZA84Q6AEwB3oECAoQAQ#v=onepage&q=frozen_east_text_detection.pb&f=false

tesseract_opencv-ocr-and-text-recognition's People

Contributors

stefanache avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.