[ADM] Hackathon 05-02-2020

Description

The system simulates the VISA-transaction data management for analytics. It must receive data from an input data stream (Main.py) that simulates the real time data arrival. The data must be distributed to two types of clients with different needs:

An internal analysis team: they can perform geographical queries or use a GIS interface (like Q-GIS)
External analysis companies: these companies need data streams composed by one or more shop categories based on their interests.

Architecture

Intructions (MacOS paths)

MongoDB

Start server:

mongod --port 27018 --dbpath /Users/[your-username]/Documents/hackathon/mongo_data --replSet “hackathon”

Connect to server:

mongo --port 27018

Make the MongoDB node primary (in the mongo shell):

rs.initiate()

KAFKA *** change IP address

Start zookeeper:

zookeeper-server-start /usr/local/etc/kafka/zookeeper.properties

Start server Kafka:

kafka-server-start /usr/local/etc/kafka/server.properties

Create topic:

kafka-topics --create --zookeeper 192.168.1.28:2181 --replication-factor 1 --partitions 1 --topic nome_topic

Console del produttore:

kafka-console-producer --broker-list 192.168.1.28:9092 --topic nome_topic

Console del consumatore:

kafka-console-consumer --bootstrap-server 192.168.1.28:9092 --topic nome_topic —from-beginning

Start producer script:

/Users/[your-usersname]/Documents/code_kafka_changestreams/kafkaproducer.py

Start consumer script:

python /Users/[your-usersname]/Documents/code_kafka_changestreams/kafkaconsumer.py

Create these 2 topics statically:

data_in

mongo_in

Configure PostegreSQL:

You can find the instrcutions in the following file:

./SQL/postegre_creation.sql

Start Python scripts following this order:

Main.py from its folder (this simulate data arrival)
postgresql.postgreconsumer.py (to read the Kafka stream)
postegresql.analysis_society.py (to use the raw interface to execute 3 given queries)
mongo_router.router.py (to classify the input stream into shop_category topics)
mongo_router.receive_interests (to load into mongo db the given_user stream)

Team

Mauro Marini

Giacomo Balloccu

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
SQL		SQL
mongo_router		mongo_router
postgre analyst		postgre analyst
shapefiles_nyc_wgs84		shapefiles_nyc_wgs84
.DS_Store		.DS_Store
.gitattributes		.gitattributes
Main.py		Main.py
README.md		README.md
gis.qgz		gis.qgz
hackathon.png		hackathon.png
instructions.txt		instructions.txt
nyc_purchases_wgs84.csv		nyc_purchases_wgs84.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[ADM] Hackathon 05-02-2020

Description

Architecture

Intructions (MacOS paths)

MongoDB

KAFKA *** change IP address

Create these 2 topics statically:

Configure PostegreSQL:

Start Python scripts following this order:

Team

About

Releases

Packages

Languages

marinimau/ADM-Hackathon---05-02-2020

Folders and files

Latest commit

History

Repository files navigation

[ADM] Hackathon 05-02-2020

Description

Architecture

Intructions (MacOS paths)

MongoDB

KAFKA *** change IP address

Create these 2 topics statically:

Configure PostegreSQL:

Start Python scripts following this order:

Team

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages