Welcome to CERMINE - Content ExtRactor and MINEr
About the service
CERMINE is a Java library and a web service for extracting metadata and content from scientific articles in born-digital form. The system analyses the content of a PDF file and attempts to extract information such as:
- Title of the article
- Journal information (title, etc.)
- Bibliographic information (volume, issue, page numbers, etc.)
- Authors and affiliations
- Keywords
- Abstract
- Bibliographic references
Limitations
This is an experimental service, and result may be not accurate. Uploaded file will be used only for metadata extraction, we do not store uploaded files. Accepted file format - *.pdf, maximum file size is 25 MB.
License
CERMINE is licensed under GNU Affero General Public License version 3.