Skip to content

A collection of data analysis projects done using PySpark via Jupyter notebooks.

Notifications You must be signed in to change notification settings

DIYBigData/spark-data-analysis-projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark Data Analysis Projects

These are various Apache Spark data analysis projects done in Jupyter notebooks. Some of these analyses were conducted on the ODROID XU4 mini cluster, which the more recent ones are being performed on the Personal Compute Cluster. Since the XU4 mini cluster is a significantly constrained system, the projects done there are limited in scope. If you are looking to repeat some of these projects, the Personal Compute Cluster versions are more current.

About

A collection of data analysis projects done using PySpark via Jupyter notebooks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages