-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LibriCSS recipe #4321
LibriCSS recipe #4321
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Whew! That's quite a changeset!
I did not mark nits in files that otherwise needed no attention, not to drawn you in the changes. I'm sure I missed more than a couple things, but hope this would be already helpful.
egs/libri_css/s5_mono/local/convert_rttm_to_utt2spk_and_segments.py
Outdated
Show resolved
Hide resolved
@kkm000 Thanks this was very helpful! I have made (almost) all the changes you suggested. |
Thanks! @kkm you can merge if you think it's OK.
…On Thu, Nov 12, 2020 at 5:03 AM Desh Raj ***@***.***> wrote:
@kkm000 <https://github.com/kkm000> Thanks this was very helpful! I have
made (almost) all the changes you suggested.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#4321 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAZFLO2KUUGPLEFILTL5UCLSPL32ZANCNFSM4TKU5VHQ>
.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few glitches introduced in the change.
egs/libri_css/s5_mono/local/diarization/post_process_css_rttm.py
Outdated
Show resolved
Hide resolved
You make me 😳 :) By the way, there is a nice book called Pure Bash Bible. A lot of good tricks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Keeping this note as a reminder: change the call to grep back to calling find(1) as external process, and we're done!
I'm fine with hosting it but please prepare the doc files that are needed.
y.
…On Thu, Nov 12, 2020 at 10:57 PM Daniel Povey ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In egs/libri_css/s5_mono/local/data_prep_css.sh
<#4321 (comment)>:
> + wget -c --load-cookies /tmp/cookies.txt \
+ "https://docs.google.com/uc?export=download&confirm=$(wget --quiet \
+ --save-cookies /tmp/cookies.txt --keep-session-cookies --no-check-certificate \
+ 'https://docs.google.com/uc?export=download&id=1Piioxd5G_85K9Bhcr8ebdhXx0CnaHy7l' \
+ -O- | sed -rn 's/.*confirm=([0-9A-Za-z_]+).*/\1\n/p')&id=1Piioxd5G_85K9Bhcr8ebdhXx0CnaHy7l" \
+ -O for_release.zip && rm -rf /tmp/cookies.txt
I think it's a good idea to mirror on OpenSRL but we should check what
@jtrmal <https://github.com/jtrmal> thinks.
In any case, IMO we shouldn't let this delay merging.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#4321 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACUKYXY4SCFZLM7JHCOG5SLSPSVCZANCNFSM4TKU5VHQ>
.
|
Thank you @desh2608, that was a helluva work! LGTM, merging. |
This recipe contains multi-speaker diarization + ASR for long recording. There are 2 scenarios:
s5_mono
: The input is single-channel mixed recording (possibly containing overlapping speech).s5_css
: The input is continuously separated audio streams. By "continuous" separation, we mean a method that performs separation on windowed segments, such that the same speaker may be split across different audio streams in different windows.For diarization, we use x-vectors and spectral clustering. For ASR, we use a TDNNF-based model (pretrained model from Librispeech) fine-tuned on Librispeech + simulated RIRs. Evaluation is in terms of cpWER (similar to CHiME-6). More details are in the README.md file.