Fast data pipeline

The data pipeline designed in this project leverages the cutting edge fast data/reactive web solutions such as Play framework, Akka actor, Kafka, Zookeeper, Spark streaming, Cassandra, Elasticsearch, Kibana & Docker containers. The focus of this talk will be around spark streaming using kafka as data source and Elasticsearch and Cassandra as data sink.

Two type of data flow in this pipeline

Application logs
Event sourcing events

Spark streams data in real time from data backplane i.e. Kafka. The streamed data is transformed either to be indexed into Elasticsearch (application logs to be displayed in dashboard kibana) or to be saved in Cassandra (event sourcing data).

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
cassandra		cassandra
elasticSearch		elasticSearch
kafka		kafka
kibana		kibana
playApp		playApp
sparkStreaming		sparkStreaming
.gitignore		.gitignore
Pipeline.PNG		Pipeline.PNG
Readme.md		Readme.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast data pipeline

About

Releases

Packages

Languages

HimanshuArora1234/FastData-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Fast data pipeline

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages