From 1ef25a7936d70453b863ba24001ab9d7dadd24ab Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Piotr=20=C5=BBelasko?= Date: Wed, 25 Sep 2024 15:53:40 -0400 Subject: [PATCH] Add inference optimization blog post announcement to README MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Signed-off-by: Piotr Żelasko --- README.md | 10 ++++++++++ 1 file changed, 10 insertions(+) diff --git a/README.md b/README.md index 9b019d3ac175..f229cfb637ea 100644 --- a/README.md +++ b/README.md @@ -78,6 +78,16 @@
Speech Recognition +
+ + + Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo + (2024/09/24) + + NVIDIA NeMo team released a number of inference optimizations for CTC, RNN-T, and TDT models that resulted in up to 10x inference speed-up. + These models now exceed an inverse real-time factor (RTFx) of 2,000, with some reaching RTFx of even 6,000. +

+