-
Notifications
You must be signed in to change notification settings - Fork 116
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Added chunking * Dynamic chunking * Raise error in jinja template * Format errors * Fix test * Return sample when format is wrong * Remove template failures counter, doenst work with multi-proc * Addressing comments * Handle text proc asr error in dataset.py * removing extra prints * Make process upload split recurisve * Add more comments * More comments * Use None instead of empty quotes. Type issue resolved * Chunked dataset subclass * HF readme integration * format * Add dataset version to load * Remove empty audio * Moved asr error try catch to get_transcribe_sample * Remove total samples processed * Change continuation to text * Some fixes * Fix more bugs * Address comments * Address comments * Fix import format * Extra filter method * Check empty columns filter * Add empty column check
- Loading branch information
Showing
7 changed files
with
616 additions
and
80 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.