Compute Natural Breaks in Python (Fisher-Jenks algorithm)
-
Updated
Jun 23, 2024 - Python
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
📊 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
Visual Knowledge Discovery demo tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.
Two differrent approach to predict Churn customers and finding out important variables that drives churn
Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.
ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB
Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.
Neural Network Deep learning specialization course offered via Coursera
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
A cross-platform tool for building machine learning models with General Line Coordinates lossless data visualizations, analyzing classifier errors, and improving classification with assistive computational tools with the goal of defining robust visual model representations as hyperblocks.
This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.
This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.
Data classification defines and categorizes data according to its type, sensitivity, and value
Add a description, image, and links to the data-classification topic page so that developers can more easily learn about it.
To associate your repository with the data-classification topic, visit your repo's landing page and select "manage topics."