Skip to content

An ETL pipeline for two stocks (ANZ, CBA) available from Yahoo Finance.

Notifications You must be signed in to change notification settings

James-Akerman/yahoo-finance-anz-cba-etl

Repository files navigation

Yahoo Finance ANZ/CBA ETL

Overview of the Project

This is an ETL (extract, transform, load) process which scrapes several pages of the most recent data from the Yahoo Finance website regarding the the CBA (Commonwealth Bank of Australia) and ANZ (Australia and New Zealand Banking Group Limited) stocks.

yahoo-finance


This ETL process creates a database in MongoDB containing two collections for both stocks, and two stock historical data csv files for both stocks and saves them in the location of the cloned repository.

Database Structure

Database Structure


Example collection (Income Statements)

Income Statements


Tools/Packages used

  • Python
    • datetime
    • splinter
    • BeautifulSoup
    • selenium import webdriver
    • pandas
    • pymongo
  • MongoDB

How to use

  • Clone the respostiory to your Desktop.
  • Pip install all the modules listed in Tools/Packages used.
  • open jupyter notebook in git bash and open the Complete ETL Process.Ipynb notebook.
  • change the user variable on line 30.

Change User name


Resouces used for this project

About

An ETL pipeline for two stocks (ANZ, CBA) available from Yahoo Finance.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •