ckorzen / pdf-text-extraction-benchmark Goto Github PK
View Code? Open in Web Editor NEWA project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
License: MIT License