GitHub - standyyyy/RapidASR: 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

Rapid ASR

flowchart LR

A([wav]) --RapidVad--> B([各个小段的音频]) --RapidASR--> C([识别的文本内容]) --RapidPunc--> D([最终识别内容])

详情

2023-02-25
- 添加C++版本推理，使用onnxruntime引擎，预/后处理代码来自： FastASR
2023-02-14 v2.0.3 update:
- 修复librosa读取wav文件错误
- 修复fbank与torch下fbank提取结果不一致bug
2023-02-11 v2.0.2 update:
- 模型和推理代码解耦（rapid_paraformer和resources）
- 支持批量推理（通过resources/config.yaml中batch_size指定）
- 增加多种输入方式（Union[str, np.ndarray, List[str]]）
2023-02-10 v2.0.1 update:
- 添加对输入音频为噪音或者静音的文件推理结果捕捉。

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
cpp_onnx		cpp_onnx
python		python
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md