Make force alignment accessible from pocketsphinx_batch and the ps_decoder API #144
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This provides a simple (maybe too simple) API for doing force alignment as well as a command-line interface for it via pocketsphinx_batch. This works like any other kind of search, you do:
In pocketsphinx_batch there is -alignctl, -aligndir and -alignext, these point you to a control file with transcription files (one file per utterance), the directory and file extensions.
The transcription is expected to be whitespace-separated tokens. It will add the
<s>
and</s>
tokens for you, which may or may not be the right thing to do (perhaps we should just add them if they aren't present).This will only do word alignments even though it is capable of doing more than that, because that's all the ps_seg_iter interface allows. We should probably fix that. In the near term I will add output to TextGrid files to the batch interface so we can get the phone segmentation that way, and also be drop-in compatible with the Montreal Force Aligner.