Skip to content

A python service to retrieve metadata extract keywords from scientific papers

Notifications You must be signed in to change notification settings

jsmatias/aiod-paper-metadata-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AIoD Paper Metadata Extractor

Extracts metadata from PDF files.

  1. First it gets the DOI from the text using regex
  2. Makes a get request to an API to retrieve the metadata of this specific paper.
  3. Cross validates the DOI by matching the retrieved title with the text content of the PDF
  4. Tries to extract the key words from different sources in this order: a. From the PDF metadata b. From the text itself using a regex pattern

About

A python service to retrieve metadata extract keywords from scientific papers

Topics

Resources

Stars

Watchers

Forks