Create a script for transforming PDF to text #1
Labels
good first issue
Good for newcomers
hacktoberfest 🍁
https://hacktoberfest.digitalocean.com/
help wanted
Extra attention is needed
add in src/papers/data/ a script that transforms the papers (in data/papers/raw) from a pdf to a text.
The outcome should be saved in data/papers/interim/ folder.
Use pdfminer.six or any other service to do so.
Optional: use a pre-made docker container as a service for it.
The text was updated successfully, but these errors were encountered: