Content ExtRactor and MINEr

Welcome to CERMINE - Content ExtRactor and MINEr

Upload PDF file

Upload a PDF file containing scientific article:

Or process one of the example files: Example #1 (PDF), Example #2 (PDF), Example #3 (PDF)

About the service

CERMINE is a Java library and a web service for extracting metadata and content from scientific articles in born-digital form. The system analyses the content of a PDF file and attempts to extract information such as:

  • Title of the article
  • Journal information (title, etc.)
  • Bibliographic information (volume, issue, page numbers, etc.)
  • Authors and affiliations
  • Keywords
  • Abstract
  • Bibliographic references


This is an experimental service, and result may be not accurate. Uploaded file will be used only for metadata extraction, we do not store uploaded files. Accepted file format - *.pdf, maximum file size is 25 MB.


CERMINE is licensed under GNU Affero General Public License version 3.