bertsky Goto Github PK
Name: Robert Sachunsky
Type: User
Name: Robert Sachunsky
Type: User
convert PubLayNet data into METS/PAGE-XML
Automatically fix PAGE-XML order inconsistencies in regions, lines and words
Evaluate plausibility of a page's segmentation
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
Demo processor to illustrate OCR-D Python API
OCR-D wrapper for arbitrary coords-preserving image operations
Python-based tools for document analysis and OCR
Fork of the project with patches by @bertsky
A suite of batches and tools for OCR tasks.
Page to PAGE Layout Analysis Tool
PAGE XML format collection for document image page content and more
Text page dewarping using a "cubic sheet" model
Synthesizing and manipulating 2048x1024 images with conditional GANs
Core libraries by the PRImA Research Lab
Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.
Automatically exported from code.google.com/p/pylepthonica
python bindings for LSD - Line Segment Detector.
Binarize document images
Detect textlines in document images
Sequence to Sequence Learning with Keras
Computation using data flow graphs for scalable machine learning
Tesseract Open Source OCR Engine (main repository)
A Python wrapper for the tesseract-ocr API
Train Tesseract LSTM with make
This repository save the stylesheet and workaround for transforming the properitary PAGE XML file from Transkribus (https://transkribus.eu/Transkribus) into a PAGE XML valid format (https://www.primaresearch.org/schema/PAGE/gts/pagecontent/ newest version from 2019-07-16
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.