Real-time Machine Learning with Apache Spark on Twitter Public Stream
-
Updated
Apr 27, 2017 - Python
Real-time Machine Learning with Apache Spark on Twitter Public Stream
A Simple Real-Time Detector of DDOS Attacks with Apache Kafka And Spark Streaming
A Realtime Stock Portfolio Manager built using Apache Distributed Technologies!
A real-time sales data analysis Application using Spark Structured Streaming, Kafka as a messaging system, PostgreSQL as a storage for processed data, and Superset for creating a dashboard.
Big Data Project - SSML - Spark Streaming for Machine Learning
Track trending hashtags on Twitter in real time.
Big data computing tasks conducted with PySpark. The problems involve MapReduce and Streaming algorithms.
Developed a real-time streaming analytics pipeline using Apache Spark to calculate and store KPIs for e-commerce sales data, including total volume of sales, orders per minute, rate of return, and average transaction size. Used Spark Streaming to read data from Kafka, Spark SQL to calculate KPIs, and Spark DataFrame to write KPIs to JSON files.
This project gets data from Spotify API , ingests into kafka for streaming and processes it through spark streaming. All this is done on Azure.
a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.
Add a description, image, and links to the sparkstreaming topic page so that developers can more easily learn about it.
To associate your repository with the sparkstreaming topic, visit your repo's landing page and select "manage topics."