CERMINE is a Java library and a web service (cermine.ceon.pl) for extracting metadata and content from PDF files containing academic publications. CERMINE is written in Java at Centre for Open Science at Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw.
The code is licensed under GNU Affero General Public License version 3.
How to cite CERMINE:
D. Tkaczyk, L. Bolikowski, A. Czeczko, and K. Rusek. A modular metadata extraction system for born-digital articles. In 10th IAPR International Workshop on Document Analysis Systems, pages 11โ16, 2012.