Streaming Retail Analysis

This repo helps as introduction into data streaming processing. Inside, you will find out how to perform streaming using Apache Trident (Java) and Apache Spark (Scala API).

Both projects take a data input with invoice data (both purchases and cancellations). Data is sent to a Kafka topic by a simulator, which reads a csv file line by line. Each line represents a product purchase or cancellation within an invoice.

Apache Spark

It is located inside the spark_streaming/ folder. The setup and run instructions are in another readme file on that folder.

Apache Storm Trident

It is located inside the kafka_trident/ folder. The setup and run instructions are in another readme file on that folder.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
assets		assets
kafka_trident		kafka_trident
resources		resources
spark_streaming		spark_streaming
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Streaming Retail Analysis

Apache Spark

Apache Storm Trident

About

Releases

Packages

Languages

jgoodman8/streaming-retail-analysis

Folders and files

Latest commit

History

Repository files navigation

Streaming Retail Analysis

Apache Spark

Apache Storm Trident

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages