We use the data from the Iron March forum leak, which has been packaged into an R library by Brendan Knapp.
TODO: update conda env files with latest packages
env.yml file can be used to recreate python environment used to run the project with Anaconda
env-history.yml is the conda environment generated with the conda env export --from-history
command.
after the python packages are installed run nltk_prerequisites.py
to install the necessary nltk stop words and punctuation datasets