This is a docker hosted australian weather prediction analysis on this dataset.
The aim of this project is to create a docker environment which consists of:
- Spark Master: Creating machine learning preprocessing, analysis and pipelines tools.
- Hadoop DFS: Access and store large data files.
- Jupyter Notebook: IDE for conducting data analysis.
Simply run the code below in your console to host the container. Make sure you have docker and docker-compose installed in your computer.
sudo sh init.sh
To run the container seperately without copying the australian weather prediction dataset, copy the code below.
sudo docker-compose up -d