Support CTC Beam Search Decoder (KenLM Lexicon) #2072

carolineechen · 2021-12-14T16:52:34Z

Add support for a CTC Beam Search Decoder with KenLM language model support and lexicon constraint

bindings from flashlight / wav2letter project
decoder API similar to fairseq's KenLMDecoder
factory function for constructing the decoder (while core decoder API is not finalized)

mthrok · 2021-12-14T17:03:00Z

Thanks for the PR.

A) Can you import this diff and see if there is any unexpected memory issue?

Then, Could you split the PR (maybe leaving this PR as-is and make new ones) into

The code ported from FL with only minor cosmetic changes
The custom code you added for torchaudio (C++ and Python)
Build update. (CMake and stuff)

RE 1. we can land it with simple review without actual build process as long as A) does not yield any issue.
Then 2 and 3 become easier to handle.

facebook-github-bot · 2021-12-14T18:54:33Z

@carolineechen has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nateanl · 2021-12-14T19:00:28Z

Looks like the CI test failures are from Python 3.6. Rebasing the PR should resolve it.

decoder api decoder api + files updates remove files fix docstring

nateanl · 2021-12-16T15:06:18Z

torchaudio/csrc/decoder/README.md

@@ -0,0 +1,39 @@
+# Flashlight Decoder Binding


This file can be put somewhere more explicit IMO. For example, CONTRIBUTING.md or in prototype directory.

nateanl · 2021-12-16T15:07:43Z

torchaudio/prototype/ctc_decoder.py

@@ -0,0 +1,222 @@
+import torch


The factory function is not imported in prototype/__init__.py. Could you add kenlm_lexicon_decoder to it?

prototype/__init__.py should not import anything (with external library dependencies).
I missed it when I was reviewing emformer/rnn-t, but they are pure-python implementation, and they won't cause segmentation fault, so they are exceptionally okay.

If ctcdecoder is imported in prototype/__init__.py, then KenLM becomes mandatory dependency for using anything in prototype module. It will force user who do not care about this decoder but want to use other prototype module to install KenLM, which is inconvenient.

I see. If users want to try prototype feature, are they expected to add it to __init__.py manually?

Nope. If we put everything in a separate submodule under prototype, and as long as prototype.__init__.py does not import this submodule, users should be able to perform import normally (from torchaudio.prototype.ctc_decoder import foo), while we mitigate the risk of broken dependency.

torchaudio └── prototype ├── __init__.py # does not import ctc_decoder module └── ctc_decoder ├── __init__.py # Initialize extension module here └── ctc_decoder.py

Got it. Sounds good to me.

torchaudio/prototype/ctc_decoder.py

mthrok · 2021-12-16T22:10:12Z

torchaudio/csrc/decoder/src/decoder/lm/KenLM.cpp

+
+#include <stdexcept>
+
+#include <kenlm/lm/model.hh>


Suggested change

#include <kenlm/lm/model.hh>

#include "lm/model.hh"

Although this is not wrong, but the upstream KenLM defines the header structure this way, so we need to use the local include here. It causes an error like this https://fburl.com/sandcastle/jivo6e9y

Actually I had a similar issue when installing torchaudio with ctc decoder. The kenlm repository has to be inside the audio directory in order to be built. You can try by installing KenLM outside of audio directory and build torchaudio.

Do you have any idea how to mitigate this issue?

I have some ideas for build process, but I need to experiment it as it has to be compatible with local build/CI build/fbcode build.

would "kenlm/lm/model.hh" work? the current suggestion "lm/model.hh" causes my local build to fail

facebook-github-bot · 2021-12-17T04:36:50Z

@carolineechen has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: part of #2072 -- splitting up the PR for easier review Add C++ files from [flashlight](https://github.com/flashlight/flashlight) that are needed for building CTC decoder w/ Lexicon and KenLM support Note: the code here will not be compiled until the build process is changed (future PR) Pull Request resolved: #2075 Reviewed By: mthrok Differential Revision: D33186825 Pulled By: carolineechen fbshipit-source-id: 5b69eea7634f3fae686471d988422942bb784cd9

Summary: part of #2072 -- splitting up the PR for easier review Add C++ files for binding CTC decoder functionality for Python Note: the code here will not be compiled until the build process is changed Pull Request resolved: #2079 Reviewed By: mthrok Differential Revision: D33196286 Pulled By: carolineechen fbshipit-source-id: 9fe4a8635b60ebfb594918bab00f5c3dccf96bd2

Summary: After all the C++ code from #2072 are added, this commit will enable decoder/KenLM integration in the build process. Pull Request resolved: #2078 Reviewed By: carolineechen Differential Revision: D33198183 Pulled By: mthrok fbshipit-source-id: 9d7fa76151d06fbbac3785183c7c2ff9862d3128

Summary: Part of #2072 -- splitting up PR for easier review This PR adds Python decoder API and basic README Pull Request resolved: #2089 Reviewed By: mthrok Differential Revision: D33299818 Pulled By: carolineechen fbshipit-source-id: 778ec3692331e95258d3734f0d4ab60b6618ddbc

Summary: part of pytorch#2072 -- splitting up the PR for easier review Add C++ files from [flashlight](https://github.com/flashlight/flashlight) that are needed for building CTC decoder w/ Lexicon and KenLM support Note: the code here will not be compiled until the build process is changed (future PR) Pull Request resolved: pytorch#2075 Reviewed By: mthrok Differential Revision: D33186825 Pulled By: carolineechen fbshipit-source-id: 5b69eea7634f3fae686471d988422942bb784cd9

Summary: part of pytorch#2072 -- splitting up the PR for easier review Add C++ files for binding CTC decoder functionality for Python Note: the code here will not be compiled until the build process is changed Pull Request resolved: pytorch#2079 Reviewed By: mthrok Differential Revision: D33196286 Pulled By: carolineechen fbshipit-source-id: 9fe4a8635b60ebfb594918bab00f5c3dccf96bd2

Summary: After all the C++ code from pytorch#2072 are added, this commit will enable decoder/KenLM integration in the build process. Pull Request resolved: pytorch#2078 Reviewed By: carolineechen Differential Revision: D33198183 Pulled By: mthrok fbshipit-source-id: 9d7fa76151d06fbbac3785183c7c2ff9862d3128

Summary: Part of pytorch#2072 -- splitting up PR for easier review This PR adds Python decoder API and basic README Pull Request resolved: pytorch#2089 Reviewed By: mthrok Differential Revision: D33299818 Pulled By: carolineechen fbshipit-source-id: 778ec3692331e95258d3734f0d4ab60b6618ddbc

* Add Profiling PyTorch workloads with the Instrumentation and Tracing Technology (ITT) API recipe

pytorch-probot bot added the ciflow/default label Dec 14, 2021

facebook-github-bot added the CLA Signed label Dec 14, 2021

carolineechen force-pushed the ctc-decoder-binding branch from b3cb134 to 9981f77 Compare December 14, 2021 18:49

Caroline Chen and others added 12 commits December 14, 2021 14:04

binding flashlight decoder

f314d36

decoder api

5484ce4

decoder api decoder api + files updates remove files fix docstring

refine api/README

9050fea

rebase and update cmake build

4b39cce

cleanup

91f59bd

rebase

7204e3d

Clean up CMakeLists

bbfb35e

separate decoder build

e83377d

update file organization

d848ade

removed USE_KENLM variable

e8ac10a

remove unused files and bindings

03da689

minor changes

cf32cb4

carolineechen force-pushed the ctc-decoder-binding branch from 9981f77 to cf32cb4 Compare December 14, 2021 19:04

nateanl reviewed Dec 16, 2021

View reviewed changes

carolineechen force-pushed the ctc-decoder-binding branch 2 times, most recently from 0f17a0a to 6080f39 Compare December 16, 2021 20:45

mthrok reviewed Dec 16, 2021

View reviewed changes

api/docs modifications

5981015

carolineechen force-pushed the ctc-decoder-binding branch from 6080f39 to 5981015 Compare December 17, 2021 01:16

carolineechen mentioned this pull request Dec 17, 2021

Add C++ files for CTC decoder #2075

Closed

mthrok mentioned this pull request Dec 17, 2021

Add FL Decoder / KenLM integration to build process #2078

Closed

carolineechen mentioned this pull request Dec 17, 2021

Add C++ files for CTC decoder bindings #2079

Closed

carolineechen mentioned this pull request Dec 20, 2021

Add Python CTC decoder API #2089

Closed

carolineechen closed this Dec 29, 2021

carolineechen deleted the ctc-decoder-binding branch December 29, 2021 20:39

carolineechen added the prototype label Jan 24, 2022

mthrok pushed a commit to mthrok/audio that referenced this pull request Dec 13, 2022

Add ITT recipe (pytorch#2072)

adda5fe

* Add Profiling PyTorch workloads with the Instrumentation and Tracing Technology (ITT) API recipe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support CTC Beam Search Decoder (KenLM Lexicon) #2072

Support CTC Beam Search Decoder (KenLM Lexicon) #2072

carolineechen commented Dec 14, 2021

mthrok commented Dec 14, 2021

facebook-github-bot commented Dec 14, 2021

nateanl commented Dec 14, 2021

nateanl Dec 16, 2021

nateanl Dec 16, 2021

mthrok Dec 16, 2021

nateanl Dec 16, 2021 •

edited

Loading

mthrok Dec 16, 2021

nateanl Dec 16, 2021

mthrok Dec 16, 2021

nateanl Dec 16, 2021

mthrok Dec 16, 2021

carolineechen Dec 17, 2021

facebook-github-bot commented Dec 17, 2021

Support CTC Beam Search Decoder (KenLM Lexicon) #2072

Support CTC Beam Search Decoder (KenLM Lexicon) #2072

Conversation

carolineechen commented Dec 14, 2021

mthrok commented Dec 14, 2021

facebook-github-bot commented Dec 14, 2021

nateanl commented Dec 14, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nateanl Dec 16, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 17, 2021

nateanl Dec 16, 2021 •

edited

Loading