Git Product home page Git Product logo

invoice-automatic-scanning-and-text-predictions's Introduction

Invoice-automatic-scanning-and-text-predictions

BUSINESS OVERVIEW:

People have been relying on paper invoices for a very long time but these days, all have become digital and so are invoices. Reconciling digital invoices is currently a man-made job but this can be automated to avoid men spending hours on browsing through several invoices and noting things down in a ledger. This project deals with detecting three important classes from the invoices. They are:

  1. Invoice number
  2. Billing Date
  3. Total amount

APPLICATIONS:

This project can be applicable to any field that is currently going through all the bills manually to jot it down in a ledger.

TECH STACK:

Language: Python Object detection : YOLO V4 Text Recognition : Tesseract OCR LSTM Environment: Google Colab

APPROACH:

  1. Setting up and Installation to run Yolov4 Downloading AlexeyAB's famous repository and we will be adjusting the Makefile to enable OPENCV and GPU for darknet and then build darknet.
  2. Downloading pre-trained YOLOv4 weights YOLOv4 has been trained already on the coco dataset which has 80 classes that it can predict. We will take these pretrained weights to see how it gives result on some of the images.
  3. Creating display functions to display the predicted class
  4. Data collection and Labeling with LabelImg To create a custom object detector, we need a good dataset of images and labels for better training of the model so we will label our images using annotation tools LabelImg
  5. Configuring Files for Training This step involves configuring custom .cfg, obj.data, obj.names, train.txt and test.txt files. a. Configuring all the needed variables based on class in the config file b. Creating obj.names and obj.data files i. obj.names : Classes to be detected ii. obj.data: c. Configuring train.txt and test.txt
  6. Download pre-trained weights for the convolutional layers
  7. Training Custom Object Detector
  8. Evaluating the model using Mean Average precision
  9. Predict image classes and save the co-ordinates separately
  10. Detecting text from the predicted class a. Importing pytesseract and setting environment variable for english trained data b. Getting list of predicted files from the directory c. Using tesseract pretrained LSTM model to extract the text d. Fine tuning the LSTM model

invoice-automatic-scanning-and-text-predictions's People

Contributors

vineet1409 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.