Concept Drift and Model Decay in Machine Learning

This is the source code to go along with the blog article

Concept drift is a drift of labels with time for the essentially the same data.

It leads to the divergence of decision boundary for new data from that of a model built from earlier data/labels. Scoring randomly sampled new data can detect the drift allowing us to trigger the expensive re-label/re-train tasks on an as needed basis…

An example of concept drift would look like:

Dependencies

numpy
scikit-learn
matplotlib

Usage

The following will simulate concept drift in 2-d, 2-class situation where the linear decision boundary rotates by 0.2 degrees with each new batch of data.

pipenv run python ./concept-drift.py 1000

The morphing of the decision boundary for new data can be detected by sampling the new data. When the predictive ability of the model sinks below a threshold, we re-label the data and retrain the model.

Model decay and recovery

The loss of predictive ability of the model can be used to trigger the re-label and re-train tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
images		images
README.md		README.md
concept-drift.py		concept-drift.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Concept Drift and Model Decay in Machine Learning

Dependencies

Usage

Model decay and recovery

About

Releases

Packages

Languages

ashokc/Concept-Drift-and-Model-Decay

Folders and files

Latest commit

History

Repository files navigation

Concept Drift and Model Decay in Machine Learning

Dependencies

Usage

Model decay and recovery

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages