Skip to content
View evanmathew's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report evanmathew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
evanmathew/README.md

evanmathew

🌐 Connect with me:

evansajumathew evansajumathew evansajumathew


💫 About Me:

I'm a working Analyst with over 2 years of experience in data analysis and data engineering.

With extensive experience in tools such as Azure, Linux, Python,Docker, Apache Airflow, CI/CD, MySQL, Power BI, and Excel, I have successfully led multiple end-to-end data analytics projects. My journey includes working as an Analyst at Globallogic India, where I leveraged data to drive strategic decisions, enhance customer experiences, and streamline operational workflows.

Currently, I focus on developing and honing my data engineer skills

🔗 Explore My Work

🌱 What I’m Learning

I’m currently focused on advancing my knowledge in Data Engineering, particularly in tools and technologies like Apache Kafka, Spark, and Cloud Platforms.

💼 Portfolio & Resources

💻 Tech Stack:

Data Engineering Tools

Apache Spark Apache Kafka Apache Airflow Snowflake

Programming Languages

Python HTML5 CSS3 PowerShell Bash Script

Databases

MySQL Postgres Oracle

Data Visualization and Analysis

Matplotlib Plotly Pandas NumPy

Version Control and Containerization

Git GitHub Docker

Cloud and Operating Systems

AWS Linux

Design and Prototyping

Canva Adobe Illustrator Adobe XD Figma

Pinned Loading

  1. ETL-University-Course-Extraction-Using-Spark-Snowflake ETL-University-Course-Extraction-Using-Spark-Snowflake Public

    This project automates the extraction of university course details (e.g., schedules, professors, course codes) from text files using Regex pattern and SpaCy NLP Model and , processes them using PyS…

    Python

  2. euro-2024-kafka-pinot-pipeline euro-2024-kafka-pinot-pipeline Public

    This project implements a real-time data pipeline for EURO 2024 football data, utilizing Apache Kafka for streaming, Apache Pinot for fast querying, and Apache Superset for data visualization. The …

    Python

  3. Reddit_ETL_DE Reddit_ETL_DE Public

    This project demonstrates a complete data pipeline for extracting, transforming, and loading (ETL) Reddit data into an Amazon Redshift data warehouse. The pipeline uses various AWS services and too…

    Python 1

  4. Data-Analysis-Projects Data-Analysis-Projects Public

    This repository hosts multiple data analysis projects, showcasing a variety of real-time and batch processing pipelines. Each project highlights different tools and technologies, offering comprehen…

    Jupyter Notebook 1

  5. evanmathew evanmathew Public

    1

  6. netflix_sql_data_analysis netflix_sql_data_analysis Public

    This project explores the Netflix dataset using SQL to answer complex analytical questions. It involves data cleansing, aggregation, ranking, and advanced SQL techniques to uncover insights such as…