Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
-
Updated
May 19, 2021 - Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.
Repository contains various examples of Spark ApI i.e RDD, DataFrame, Structured Streaming etc
Full poc on spark 2, Spark RDD, Spark DStream, Spark SQL, Spark Datasets & DataFrames & Spark Structured Streaming [SCALA][SPARK]
WITSML Frames provides a DataFrame-centric view over WITSML data and prepares them for Apache Spark based machine learning and deep learning.
SageFrames brings together Sage ERP cloud platform and DataFrame-based data science. Transforming purchase & sales data into insights, recommendations and more with Apache Spark has never been easier.
This Application covers Apache Spark, Dataframes and Scala/Java.
scala spark dataframes example for Machin Learning feature pruning
a tiny sample about how to build a parquet file from whatever you need.
Spark SQL Example
Add a description, image, and links to the dataframes topic page so that developers can more easily learn about it.
To associate your repository with the dataframes topic, visit your repo's landing page and select "manage topics."