-
Notifications
You must be signed in to change notification settings - Fork 313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decoding with time slots and with a confidence #713
Comments
We already have the option to return timestamps during decoding. |
What option should I use to print timestamps and the confidence of every word? Utterance level confidence is less important, it should be something equivalent to average of all word confidences in the utterance excluding silences and UNKs. |
Please see this PR #598. It returns the timestamp for every word. |
Hi, I tried to obtain confidences in Sherpa.
Observations:
but when I use a model that is trained for another language:
I would except lower probs. The problem is that wrong model cannot capture true tokens, but captures some tokens that are actually in audio acoustically. And these captured tokens may come with high prob. For simplicity, a simple example: prob of ref comes ~0.8 and prob of hyp comes ~0.5. I think missing tokens have to be punished in some way.
|
Since we don't have ground truth available during decoding, it is hard to know when and where we have missed some tokens during decoding. |
Hello guys,
Is it possible in K2 to extract start time and end time of every decoded (hypothesized) word along with a recognition confidence of every word and every full utterance?
Thanks,
AlexG.
The text was updated successfully, but these errors were encountered: