Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add OpenSLR dataset #2173

Merged
merged 12 commits into from
Apr 12, 2021
Merged

Add OpenSLR dataset #2173

merged 12 commits into from
Apr 12, 2021

Conversation

cahya-wirawan
Copy link
Contributor

@cahya-wirawan cahya-wirawan commented Apr 6, 2021

OpenSLR (https://openslr.org/) is a site devoted to hosting speech and language resources, such as training corpora for speech recognition, and software related to speech recognition. There are around 80 speech datasets listed in OpenSLR, currently this PR includes only 9 speech datasets SLR41, SLR42, SLR43, SLR44, SLR63, SLR64, SLR65, SLR66 and SLR69 (Javanese, Khmer, Nepali and Sundanese, Malayalam, Marathi, Tamil, Telugu and Catalan). I can add other speech datasets gradually next time.

Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is really cool thank you !

datasets/openslr/README.md Outdated Show resolved Hide resolved
datasets/openslr/README.md Outdated Show resolved Hide resolved
datasets/openslr/openslr.py Outdated Show resolved Hide resolved
datasets/openslr/openslr.py Outdated Show resolved Hide resolved
datasets/openslr/openslr.py Outdated Show resolved Hide resolved
datasets/openslr/README.md Outdated Show resolved Hide resolved
cahya-wirawan and others added 3 commits April 7, 2021 11:53
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
openslr.py: updated the description and removed unused variable.
@cahya-wirawan cahya-wirawan mentioned this pull request Apr 8, 2021
Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM thanks !

I just added a missing section in the readme and updated the language tags

@lhoestq lhoestq merged commit 1f8be07 into huggingface:master Apr 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants