xorsuyash / docparser Goto Github PK
View Code? Open in Web Editor NEWThis project forked from shrivastava95/docparser
A multilingual document parser that processes PDFs. Built using Google's open source Tesseract OCR, and OpenAI's CLIP (Contrastive Language Image Pretraining).
License: MIT License