Skip to content

thakurfurqaan/python-etl

Repository files navigation

Python-ETL

This is a Python library for creating end-to-end ETL pipelines. It is currently under active development.

Dependencies

  1. Pandas == ?
  2. PyArrow == ?
  3. PySpark == ?

Features

Data Extraction

  1. FTP (ftplib)
  2. sFTP (paramiko)
  3. API
  4. Webscraping
  5. Databases (with listeners)
  6. Queues (ActiveMQ, RabbitMQ, SQS)

Data Transformation

  1. Pandas

Data Loading

  1. Upserting data using Pandas.
  2. Triggers

Maintainers

Furqaan Thakur

About

This is a python-based library for ETL processes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published