Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

k2-fsa / sherpa-onnx Public

Notifications You must be signed in to change notification settings
Fork 467
Star 4k

Code
Issues 201
Pull requests 22
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: k2-fsa/sherpa-onnx

Releases · k2-fsa/sherpa-onnx

Release v1.9.23

25 Apr 04:29

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.9.23

What's Changed

fix a typo in building language ID apk by @csukuangfj in #795
Add jieba for Chinese TTS models by @csukuangfj in #797
Increase CED's max frame length to 3000 by @csukuangfj in #798
Fix the last character not being recognized for streaming paraformer … by @csukuangfj in #799
Refactor TTS Android code to support jieba for Chinese TTS models by @csukuangfj in #800
wget 续传 by @bubao in #801
Refactor the JNI interface to make it more modular and maintainable by @csukuangfj in #802
Fix CI tests by @csukuangfj in #804
Refactor Java APIs by @csukuangfj in #806
Add Java API for non-streaming ASR by @csukuangfj in #807
Add dict_dir arg to c api to support Chinese TTS models using jieba by @csukuangfj in #809

Full Changelog: v1.9.22...v1.9.23

Contributors

csukuangfj and bubao

Assets 19

Loading

All reactions

v1.9.22

19 Apr 10:40

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.22

What's Changed

Replace torchaudio with soundfile in python-api-examples by @gtf35 in #765
Add C API for punctuation by @csukuangfj in #768
Add Kotlin API for audio tagging by @csukuangfj in #770
Adding warm up for Zipformer2 by @manickavela29 in #766
Fix display for sherpa-onnx-microphone by @csukuangfj in #773
Fix code style issues by @csukuangfj in #774
Add score function to speaker identification by @chiiyeh in #775
Add Android demo for audio tagging by @csukuangfj in #776
Add WearOS demo for audio tagging by @csukuangfj in #777
Add JNI support for spoken language identification by @csukuangfj in #782
Add Android demo for spoken language identification using Whisper multilingual models by @csukuangfj in #783
Support CED models by @csukuangfj in #792
Add Python API example for CED audio tagging. by @csukuangfj in #793
Release v1.9.22 by @csukuangfj in #794

New Contributors

@gtf35 made their first contribution in #765

Full Changelog: v1.9.19...v1.9.22

Contributors

csukuangfj, gtf35, and 2 other contributors

Assets 30

Loading

amirf147, ling989, chmod740, and Wangyabin-sss reacted with hooray emoji

jame25 and Wangyabin-sss reacted with rocket emoji

All reactions

🎉 4 reactions
🚀 2 reactions

5 people reacted

v1.9.19

13 Apr 11:10

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.19

What's Changed

Fix a punctuation bug by @csukuangfj in #764

Full Changelog: v1.9.18...v1.9.19

Contributors

csukuangfj

Assets 55

Loading

All reactions

v1.9.18

13 Apr 08:35

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.18

What's Changed

Fix building OpenFst on Windows. by @csukuangfj in #744
Fix go API examples with portaudio on Windows. by @csukuangfj in #746
Support audio tagging using zipformer by @csukuangfj in #747
Add C++ microphone examples for audio tagging by @csukuangfj in #749
Add SHERPA_ONNX_GITHUB by @bubao in #750
Fix a bug in mean calculation of 'ys_probs' by @aask1357 in #748
Add Python API and Python examples for audio tagging by @csukuangfj in #753
Add C API for audio tagging by @csukuangfj in #754
[feature] Configurable padding length by @manickavela29 in #755
Use batch size 1 in generating subtitles. by @csukuangfj in #756
Fix WebAssembly for kws by @csukuangfj in #758
Support adding punctuations to the speech recogntion result by @csukuangfj in #761
Add Python API for punctuation models. by @csukuangfj in #762
Release v1.9.18 by @csukuangfj in #763

New Contributors

@bubao made their first contribution in #750
@aask1357 made their first contribution in #748
@manickavela29 made their first contribution in #755

Full Changelog: v1.9.17...v1.9.18

Contributors

csukuangfj, bubao, and 2 other contributors

Assets 32

Loading

All reactions

punctuation-models

12 Apr 10:46

csukuangfj

punctuation-models

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

punctuation-models

Use batch size 1 in generating subtitles. (#756)

Assets 5

Loading

All reactions

audio-tagging-models

09 Apr 08:04

csukuangfj

audio-tagging-models

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

audio-tagging-models

Please see also

https://k2-fsa.github.io/sherpa/onnx/audio-tagging/

Assets 9

Loading

yuyun2000, ssf008, and Daviderikmollberg reacted with hooray emoji

rilshok reacted with rocket emoji

All reactions

🎉 3 reactions
🚀 1 reaction

4 people reacted

v1.9.17

08 Apr 13:38

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.17

What's Changed

Support heteronyms in Chinese TTS by @csukuangfj in #738
Add VAD examples using ALSA for recording by @csukuangfj in #739
Fix releasing GIL by @csukuangfj in #741
Support Chinese heteronyms on Android for TTS. by @csukuangfj in #742

Full Changelog: v1.9.16...v1.9.17

Contributors

csukuangfj

Assets 51

Loading

All reactions

v1.9.16

05 Apr 12:25

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.16

What's Changed

Fix building wasm in CI by @csukuangfj in #720
Add more piper models for text-to-speech by @csukuangfj in #725
Fix microphone privacy config by @yujinqiu in #727
Add language identification swiftui demo by @yujinqiu in #729
Add HLG decoding for streaming CTC models by @csukuangfj in #731
Add C API for streaming HLG decoding by @csukuangfj in #734
return timestamps for WebAssembly by @csukuangfj in #737

Full Changelog: v1.9.15...v1.9.16

Contributors

yujinqiu and csukuangfj

Assets 62

Loading

All reactions

v1.9.15

29 Mar 12:08

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.15

What's Changed

Add C API for spoken language identification. by @csukuangfj in #695
Add Swift API for spoken language identification. by @csukuangfj in #696
Add C# API for spoken language identification by @csukuangfj in #697
Use a single thread when loading models by @csukuangfj in #703
Build Android TTS APKs for coqui-ai/TTS models by @csukuangfj in #704
Support including TTS conditionally. by @csukuangfj in #699
add openfst.cmake file by @hantengc in #707
Add Golang API for VAD by @csukuangfj in #708
Sign released TTS APKs by @csukuangfj in #710
Add Golang API for spoken language identification. by @csukuangfj in #709
Fix ios-swift to remove invalid references by @csukuangfj in #713
delete incorrect logs by @longshiming in #714
Added progress for callback of tts generator by @leohuang2013 in #712
Add C API for speaker embedding extractor. by @csukuangfj in #711
Add Go API for speaker identification by @csukuangfj in #718
Release v1.9.15 by @csukuangfj in #719

New Contributors

@hantengc made their first contribution in #707
@leohuang2013 made their first contribution in #712

Full Changelog: v1.9.14...v1.9.15

Contributors

leohuang2013, longshiming, and 2 other contributors

Assets 62

Loading

gilcu3, Soebb, datkat21, and JohnClaw reacted with hooray emoji

All reactions

🎉 4 reactions

4 people reacted

v1.9.14

24 Mar 14:59

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.9.14

What's Changed

Support spoken language identification with whisper by @csukuangfj in #694

Full Changelog: v1.9.13...v1.9.14

Contributors

csukuangfj

Assets 60

Loading

All reactions

Previous 1 2 3 4 5 6 7 … 11 12 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.