A Jupyter notebook to explore, analyse and visualise Betfair historic data using PySpark.
The following packages should be installed.
pip install pyspark==2.3.3
pip install bz2file
pip install glob2
pip install jupyter
pip install matplotlib
pip install findspark
pip install boto3
See requirements.txt
for list of installed packages.
cd
to the folder that contains the project. Then,
(betfair) C:\betfair> jupyter notebook betfair_analysis.ipynb
Historical data may be downloaded from the Betfair website as follows.
- Request and downloaded the data you want to analyse from this area of Betfair website, https://historicdata.betfair.com/#/mydata
- Create a folder
data
in the project folder. - The
data.tar
file should be opened using your choice of file compression tool - for example Z-Zip. Using the tool, navigate todata.tar\C:\data\xds\historic\BASIC\
, and then extract to the folders to the project'sdata
folder.
https://historicdata.betfair.com/Betfair-Historical-Data-Feed-Specification.pdf