#Rettiwt- Rebuilding the Twitter Platform
This is the project I carried out during the seven-week Insight Data Engineering Fellows Program which is designed for people with strong knowledge of computer science fundamentals and coding skills to transition to data engineering by giving them a space to get hands-on experience building out distributed platforms on AWS using open source technologies.
Rettiwt is a twitter-like social media platform that delivers real-time streaming data from twitter api to all the simulated users. Basicly you can login using user id from 0 to 99999 to see their home page. Chech it out at daoyu.online and here is another video demo.
##Code Structure
- cassandra_connector
A side project written in JAVA to simulate users' names and followers and to compare reading rate from normalized schema and denormalized schema. - cassandra_schema
CQL language of schema in Cassandra database. - flask
A frontend implementation using flask(This framework is sooo convenient!!!) written in python. Bootstrap was applied for the web design. - kafka
A kafka producer written in python. Used tweepy to call twitter api to get real-time tweets and pipe them into kafka. - spark_stream
A Spark streaming process written in Scala which delivers the tweets. Used spark-cassandra-connector and lettuce to connect Cassandra and Redis.