Skip to content

Various Datasets for Machine Learning Research & Teaching

Notifications You must be signed in to change notification settings

akmand/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datasets

A collection of public datasets for machine learning research & teaching.

Dataset descriptions:

https://github.com/akmand/datasets/blob/main/github_dataset_descriptions.ipynb

Automated script to prepare these datasets for predictive modeling:

https://github.com/akmand/datasets/blob/main/prepare_dataset_for_modeling_github.py

Convention:

  • First row is the header row.
  • Target feature is the last column.
  • There might be missing values.
  • Categorical features are encoded as strings - so that pd.get_dummies() work correctly.

About

Various Datasets for Machine Learning Research & Teaching

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published