End-to-End MLOps: From Data to Deployment

Introduction

Welcome to the End-to-End MLOps Project for Spelling Orthographic Correction Automation! This repository serves as a demonstration of a complete end-to-end MLOps solution designed to streamline and enhance the development, deployment, and upkeep of machine learning models dedicated to spelling orthographic correction. In this context, MLOps represents the fusion of machine learning processes with DevOps principles, delivering a framework that guarantees the repeatability, scalability, and full automation of tasks throughout the entire lifecycle of our orthographic correction model.

Project Overview

This project offers a meticulously designed and structured pipeline tailored specifically for machine learning initiatives, encompassing every aspect of the process from initial data preprocessing to the ultimate deployment of our spelling orthographic correction model. Our primary objective is to facilitate seamless cooperation and synergy among data scientists, machine learning engineers, and operations teams. This synergy is geared towards optimizing the entire workflow, resulting in an exceptionally efficient and dependable deployment process for our spelling orthographic correction model.

🚀 Features

Data Versioning: DVC for version control 📊📦
Model Training: BERT-based spell correction 📝🤖
Secure Storage: AWS S3 artifact security 🛡️🗃️
User Interface: Flask web app for correction 💬🌐
Project Improvement: User feedback-driven enhancements 🔄📈👥
Deployment: Docker for consistent deployment 🚀🐳
Hosting: AWS ECR/EC2, custom domain 🌐🏢🌐
Continuous Deployment: GitHub Actions for automation ⚙️🔄🚀
Monitoring: Grafana & AWS CloudWatch 📈🔍📊

Tech Stack

The MLOps project utilizes the following main tools and libraries:

NLTK (Natural Language Toolkit) 🧠: an open-source NLP library for data processing.
Spello: a library having a pretrained model for spelling correction.
Keras: a deep learning framework for building and training neural networks.
DVC (Data Version Control) 📈: a version control system for data sets and machine learning models.
Flask 🤖: a lightweight web framework for creating APIs.
Docker 🐳: a containerization platform for packaging applications.
Amazon EC2 ☁️: cloud-based virtual machines for deployment.
AWS CloudWatch 📊: a cloud monitoring and observability platform.
Grafana 📈: a monitoring and observability platform.

Prerequisites

Before you begin, make sure you have the following in place:

AWS Account: You need an AWS account to access EC2, ECR, and S3 services.
Docker: Make sure you have Docker installed on your local machine.
Python: Ensure you have Python (version 3.6 or 3.8) installed.

Architecture

Data Source

Spelling Corrector | Kaggle

Getting Started

To get started with this project, follow these instructions to set up your environment and start working with the MLOps pipeline.

Installation

To set up and run this project on your local machine, follow these steps:

Clone the repository:

git clone https://github.com/IbLahlou/SpellX

Navigate to the project directory:

cd SpellX

Install project dependencies:

pip install -e .
pip install -r requirements.txt

Run the project:

python ./main.py

Start the Flask API:

cd api
flask run

Now, the project is installed and running locally on your machine.

Workflows

Update config.yaml
Update secrets.yaml [Optional]
Update params.yaml
Update the entity
Update the configuration manager in src config
Update the components
Update the pipeline
Update the main.py
Update the dvc.yaml

Contributing

If you would like to contribute to this project, please fork the repository, make your changes, and submit a pull request. We welcome contributions from the community!

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.dvc		.dvc
.github/workflows		.github/workflows
.idea		.idea
api		api
config		config
logs		logs
readme_assets		readme_assets
spellX.egg-info		spellX.egg-info
src		src
.dockerignore		.dockerignore
.dvcignore		.dvcignore
.gitignore		.gitignore
README.md		README.md
dockerfile		dockerfile
dvc.yaml		dvc.yaml
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-End MLOps: From Data to Deployment

Introduction

Project Overview

🚀 Features

Tech Stack

Prerequisites

Architecture

Data Source

Getting Started

Installation

Workflows

Contributing

About

Releases

Packages

Contributors 2

Languages

IbLahlou/SpellX

Folders and files

Latest commit

History

Repository files navigation

End-to-End MLOps: From Data to Deployment

Introduction

Project Overview

🚀 Features

Tech Stack

Prerequisites

Architecture

Data Source

Getting Started

Installation

Workflows

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages