This project provides a plugin interface to Apache PDFBox. Through this PDFBox can gain access to third party OCR libraries like Tesseract to get OCR services.
jahewson / ocr-plugin Goto Github PK
View Code? Open in Web Editor NEWThis project forked from dimuthuupe/ocr-plugin
OCR Plugin for PDFBox (GSoC 2014)