#

apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Here are 16 public repositories matching this topic...

spark-notebook / spark-notebook

Interactive and Reactive Data Science using Scala and Spark.

data-science reactive scala spark apache-spark notebook

Updated May 16, 2023
JavaScript

cuebook / cuelake

Use SQL to build ELT pipelines on a data lakehouse.

sql apache-spark etl pipelines data-engineering data-lake data-transfer delta data-integration upsert elt data-pipeline datalake data-ingestion spark-sql zeppelin-notebook apache-iceberg lakehouse incremental-updates

Updated May 25, 2022
JavaScript

itsjafer / jupyterlab-sparkmonitor

JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook

spark apache-spark jupyter pyspark jupyterlab jupyterlab-extension jupyter-lab

Updated Dec 27, 2022
JavaScript

MongoExpUser / Shale-Reservoir-DNN-and-Drilling-Rare-Events-Graph

Shale-Reservoir-DNN and Drilling-Rare-Events-Graph

javascript python java sql big-data apache-spark graph-algorithms tensorflow sklearn node-js apache-drill c-cpp-napi-addon

Updated Oct 11, 2023
JavaScript

astrolabsoftware / astrolabsoftware.github.io

Website for AstroLab

research apache-spark functional-programming cluster-computing

Updated Apr 12, 2023
JavaScript

thiagocoutinhor / burning-books

SSH Driven Spark Notebook

ssh scala apache-spark notebook

Updated Dec 12, 2022
JavaScript

JohnnyFoulds / nyc-job-exploration

This project uses Apache Spark to explore the popular New York City Current Job Postings Kaggle dataset.

scala apache-spark docker-container word-cloud apache-zeppelin

Updated Sep 7, 2019
JavaScript

felicitybui1 / tinder_of_food

This project is a Flask interactive web application that displays a map of New York City and allows users to query it, along with a recommendation algorithm that matches suppliers to restuarants. The application uses a combination of Python, html, css, and Javascript. The data is stored using Apache Spark and MongoDB.

javascript css python html apache-spark mongodb flask-application

Updated May 25, 2024
JavaScript

MaximLevchenko / MongoDb-ElasticStack-Kibana-Visualisation

This project showcases the use of NoSQL technologies and usage of Elastic Stack for comprehensive data processing and visualization.

visualization docker elasticsearch elasticstack logstash big-data apache-spark mongodb replication docker-compose sharding

Updated Jul 26, 2024
JavaScript

leetoo / myInvestor

System for stock prediction, analysis and investment.

docker akka cassandra apache-spark akka-cluster

Updated Dec 6, 2017
JavaScript

akarsh3007 / youtubedatasentimentalanalysis

redis scala apache-spark sentiment-analysis redis-server bigdata flask-application java-8 apache-kafka d3js python27

Updated Aug 13, 2019
JavaScript

Zahidul-Islam / InsightDataEngineering

λtrace - Performance Optimization tool for AWS Lambda Function

nodejs python vuejs apache-spark aws-s3 apache-airflow dvc

Updated Jan 9, 2023
JavaScript

alivcor / airavat

Airavat is a metric interceptor and a job watchdog for Spark Applications. It also features an interactive UI which shows all Spark Applications running, jobs and SQL Queries along with their metrics.

scala spark apache-spark metrics spark-sql

Updated Mar 19, 2021
JavaScript

Laura9505 / Markdown-Blog-Editor

online blog website

nodejs javascript angular express scala apache-spark mongodb mysql-database html-css jsp-servlet apache-tomcat

Updated Jan 6, 2019
JavaScript

jlloh / sparkui-react

Alternative UI for Spark Web UI that scrapes the Spark Web APIs

apache-spark reactjs

Updated Dec 11, 2022
JavaScript

mihir09 / BigData-Analysis

Scrapped and Analyzed Twitter data using Spark. Run Spark queries on Millions of tweets and trained models for sentiment analysis.

spark apache-spark bigdata twitter-sentiment-analysis spark-sql snscrape

Updated Aug 25, 2023
JavaScript

Created by Matei Zaharia

Released May 26, 2014

Followers: 424 followers
Repository: apache/spark
Website: spark.apache.org
Wikipedia: Wikipedia

Related Topics