Skip to content

Latest commit

 

History

History
77 lines (58 loc) · 3.21 KB

README.md

File metadata and controls

77 lines (58 loc) · 3.21 KB


outdpik: Fundamental toolkit for outlier analysis and visualization

PyPI Latest Release PyPI License Package Status Documentation Status Code style: black

What is it?

Outdpik is an open source Python package that provides different methods for outlier detection. It aims to be the fundamental high-level package for this purpose. Additionally, it offers visualization methods for the outlier analysis.

Main Features

Here are just a few of the things that outdpik does well:

  • It supports numpy arrays and pandas dataframes
  • Multiple outlier detection techniques that can be combined
  • Powerful visualizations
  • Flexible at including one or more columns for the analysis

Where to get it

The source code is currently hosted on GitHub at: https://github.com/DanielPuentee/outdpik

Installer for the latest released version is available at the Python Package Index (PyPI)

# PyPI
pip install outdpik

How to use outdpik

Examples of configuring and running outpdik:

import outpdik as outdp
outdp = outdp()

We proceed to detect outliers returning a dictionary of numeric features and the outliers instances:

outliers_dict = outdp.outliers(df = df, cols = "all")

Plotting advantages:

outdp.plot_outliers(df = df, col = "x")

Strip plot outliers detection


Dependencies

License

This project is licensed under the terms of the GNU - see the LICENSE file for details.

Documentation

The official documentation is hosted on: https://outdpik.readthedocs.io/en/latest/

Development

Want to contribute? Great! Open a discussion in Github in this repo and we will answer as soon as possible.