Skip to content
/ Rettiwt Public

A twitter-like social media platform that deal streaming data in real time

Notifications You must be signed in to change notification settings

DaoyuT/Rettiwt

Repository files navigation

#Rettiwt- Rebuilding the Twitter Platform This is the project I carried out during the seven-week Insight Data Engineering Fellows Program which is designed for people with strong knowledge of computer science fundamentals and coding skills to transition to data engineering by giving them a space to get hands-on experience building out distributed platforms on AWS using open source technologies.
Rettiwt is a twitter-like social media platform that delivers real-time streaming data from twitter api to all the simulated users. Basicly you can login using user id from 0 to 99999 to see their home page. Chech it out at daoyu.online and here is another video demo.

##Data Pipeline alt tag

##Code Structure

  • cassandra_connector
    A side project written in JAVA to simulate users' names and followers and to compare reading rate from normalized schema and denormalized schema.
  • cassandra_schema
    CQL language of schema in Cassandra database.
  • flask
    A frontend implementation using flask(This framework is sooo convenient!!!) written in python. Bootstrap was applied for the web design.
  • kafka
    A kafka producer written in python. Used tweepy to call twitter api to get real-time tweets and pipe them into kafka.
  • spark_stream
    A Spark streaming process written in Scala which delivers the tweets. Used spark-cassandra-connector and lettuce to connect Cassandra and Redis.

About

A twitter-like social media platform that deal streaming data in real time

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published