GitHub

What is GoByBus ?

Go by Bus is an application for storing/analyzing communication data from Warsaw's open data platform.
It uses current GPS location of trams, bus stops locations & timetables to achieve following goals:

Store current & historical tram positions for analysis and ML [done]
Store timetables data for analysing line delays [done]
Visualize current & historical locations of queried tram using Google Maps API [done]
Current tram locations as a stream of data from Kafka [done]
Finding anomalies in traffic and calculate communication delays in Spark [TODO 1]
Stores nearest & historical weather info thanks to yr.no API and enriches delay analysis [TODO 2]
Visualizes timetables for selected line [TODO 3]

Tech stack

We are using Microservices with Java 8 + Spring Cloud based on Docker
CQRS architecture is applied (heart of system is Apache Kafka)
Apache Kafka for real-time locations stream
Configuration is stored in central Spring Config Service
Service logs + GC logs are connected to ELK (ElasticSearch + LogStash + Kibana). But no visualisations yet.
Data storing done in MongoDB
Docker as a container service, and docker-compose for getting up the environment for now.
Apache Spark as a main data analysis tool - module SparkPositionAnalyzer need a lot of development thought
Simple long-time-running master version is deployed to AWS using docker-machine
Gradle as a build tool

Nearest tasks

Refactor and develop more completed Spark queries
Introduce new datasource - Weather data from yr.no
Create separate service for timetables data based on GraphQL
Introduce cross-service user tracking with Zipkin
Prepare Kibana log visualizations
Introduce more complex orchestrating tool ei. Kubernetes
Introduce node monitoring - Zabbix

Running

Install Docker and docker-compose
Increase vm.max_map_count for your machine due to ELK requirements
Create your account and generate API key on Warsaw's open data platform.
Put your API key in secret-keys.properties file in main dir as a WARSAW_API_KEY= property
docker compose up -d in main dir
Have fun :)

Bare in mind that solution is pretty complexed and drains a lot of resources. On i7 + 16 GB RAM it's ok. Some clustering will be introduced in future for sure

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
GoByBus-BusStopCrawler		GoByBus-BusStopCrawler
GoByBus-Commons		GoByBus-Commons
GoByBus-ConfigServer		GoByBus-ConfigServer
GoByBus-Dockerfiles		GoByBus-Dockerfiles
GoByBus-LocationCrawler		GoByBus-LocationCrawler
GoByBus-LocationViewer		GoByBus-LocationViewer
GoByBus-Registry		GoByBus-Registry
GoByBus-Web		GoByBus-Web
SparkPositionAnalyzer		SparkPositionAnalyzer
gradle/wrapper		gradle/wrapper
.gitignore		.gitignore
README.md		README.md
Scripts.md		Scripts.md
analisis-pack.yml		analisis-pack.yml
aws-crawler-pack.yml		aws-crawler-pack.yml
bitbucket-pipelines.yml		bitbucket-pipelines.yml
build.gradle		build.gradle
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
gradlew		gradlew
gradlew.bat		gradlew.bat
kafka.properties		kafka.properties
locations-mongo.properties		locations-mongo.properties
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is GoByBus ?

Tech stack

Nearest tasks

Running

About

Releases

Packages

Languages

Hejwo/GoByBus

Folders and files

Latest commit

History

Repository files navigation

What is GoByBus ?

Tech stack

Nearest tasks

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages