Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
-
Updated
Feb 3, 2021 - Python
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
A Malware classifier dataset built with header fields’ values of Portable Executable files
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
jazznet dataset of piano patterns for music audio machine learning research
2D Geometric shapes generator
We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please c…
A duplicate-free variant of the CIFAR test set.
UCLA Dining Hall Menus Dataset
Corpus of Coq code related to MathComp including several machine-readable representations
Extract Japanese characters database.
Classification dataset for comparing cats and dogs images
OpenFrameworks program that generates training data from font-faces installed on your Mac.
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
Marktplaats.nl (Dutch Classifieds) Listing Scraper
CSV datasets for ML/AI models from captured network traffic during ZAP scanning with web applications like Django, Flask, React, Vue and Spring - Anti-Nex training datasets
Simple task for mixed image-graph data
Given a product name, the python program downloads all the images. This includes pagenation also.
Generate captchas for ML tasks in parallel.
tools for a deep learning in physics research course
sentence polarity dataset v1.0 (includes sentence polarity dataset README v1.0): 5331 positive and 5331 negative processed sentences / snippets. Introduced in Pang/Lee ACL 2005. Released July 2005.
Add a description, image, and links to the machine-learning-dataset topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-dataset topic, visit your repo's landing page and select "manage topics."