Skip to content

Latest commit

 

History

History
12 lines (5 loc) · 659 Bytes

README.md

File metadata and controls

12 lines (5 loc) · 659 Bytes

text-gen-arxiv-papers

This is the raw files for the gh pages site: https://arnicas.github.io/text-gen-arxiv-papers.

The code is kind of a nightmare, but is being gradually cleaned up and checked in. Basically I do most of it manually using pandas, since jekyll is pretty bad at what I needed. It pretty much needs a giant refactor.

The file scrape.py has the search strings and saves a pickle of the latest data from ArXiv.

The file build_pages.py takes the pickle as an argument and processes it. There are required files and directories etc. I'll try to document more and clean it up for re-use.