Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Otsu threshold sometimes fails #44

Open
ttencate opened this issue May 26, 2020 · 0 comments
Open

Otsu threshold sometimes fails #44

ttencate opened this issue May 26, 2020 · 0 comments
Labels
area:data Data and processing: audio recordings, photos kind:bug Something isn't working
Milestone

Comments

@ttencate
Copy link
Owner

Example:

./master.py --store_audio_files --debug_recording_ids xc:295996 --debug_utterances --debug_otsu_threshold

The histogram looks like a single Gaussian, not bimodal, because the recording is almost all utterance and no silence.

See if we can detect such cases and just trim from the beginning if they happen. Perhaps simply if >80% is classified as utterance?

@ttencate ttencate added this to the Version 1.0 milestone May 26, 2020
@ttencate ttencate added kind:bug Something isn't working area:data Data and processing: audio recordings, photos labels May 26, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:data Data and processing: audio recordings, photos kind:bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant