Skip to content

get the real-time data from stock API (for this project I'm using dummy data) then stream it into datalake and finally into data ware house using kafka

Notifications You must be signed in to change notification settings

ArkanNibrastama/stream-pipeline-stock-analytics

Repository files navigation

Build Stream Pipeline for Stock Analysis

This repository contain my project about building streaming pipeline to ingest stock data into data lake (AWS S3), after that move the data automatically using snowpipe and load it into data waerhouse (Snowflake).

Architecture

Architecture

Run locally

  • Clone the project.

    git clone https://github.com/ArkanNibrastama/stream-pipeline-stock-analytics
  • Install all the dependencies.

    pip install -r requirements.txt
  • Set up the Snowflake and snowpipe (you can find the full explanation in the last section).

  • Fill the blank variable with your own data
    example:

    API_KEY = "{/YOUR API KEY}"
    AWS_ACCESS_KEY = "{/YOUR ACCESS KEY}"
    AWS_ACCESS_SECRET_KEY = "{/YOUR SECRET ACCESS KEY}"
  • Run the producer

    python producer.py
  • Run the consumer

    python consumer.py

Full explanation

To make better understand of this repository, you can check my linkedin post about this project Build Real-time Stock Analytics Data Pipeline.

About

get the real-time data from stock API (for this project I'm using dummy data) then stream it into datalake and finally into data ware house using kafka

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages