A curated list of awesome Apache Spark packages and resources.
-
Updated
Oct 24, 2024 - Shell
A curated list of awesome Apache Spark packages and resources.
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
Azure Databricks - Advent of 2020 Blogposts
Taller SparkR para las Jornadas de Usuarios de R
Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R
Mirror of https://gitlab.com/zero323/dlt
Slides and lab material for the talk R for HPC and big data at http://rsummer.data-analysis.at
Fit a Cubist regression model on StackOverflow data and make predictions in a distributed manner with SparkR
Docker images for testing SparkR builds
Taller Big Data con Apache Spark + R desde Databricks cloud
R workloads running at scale on Google Cloud
A curated list of essential cheatsheets for data analysis, visualization and machine learning using R or Python
Self-service modeling analysis tool based on R language and big data. It integrates SparkR, Rserve, and Mlib machine learning libraries
Sample Codes of Spark using R programming
Add a description, image, and links to the sparkr topic page so that developers can more easily learn about it.
To associate your repository with the sparkr topic, visit your repo's landing page and select "manage topics."