Skip to content

A docker hosted australian weather prediction analysis with PySpark and Hadoop DFS

Notifications You must be signed in to change notification settings

stevenlimcorn/australian-weather-prediction

Repository files navigation

australian-weather-prediction

This is a docker hosted australian weather prediction analysis on this dataset.

The aim of this project is to create a docker environment which consists of:

  • Spark Master: Creating machine learning preprocessing, analysis and pipelines tools.
  • Hadoop DFS: Access and store large data files.
  • Jupyter Notebook: IDE for conducting data analysis.

Docker Container Design

architecture design

Simply run the code below in your console to host the container. Make sure you have docker and docker-compose installed in your computer.

sudo sh init.sh

To run the container seperately without copying the australian weather prediction dataset, copy the code below.

sudo docker-compose up -d 

About

A docker hosted australian weather prediction analysis with PySpark and Hadoop DFS

Topics

Resources

Stars

Watchers

Forks