Add inference optimization blog post announcement to README (#10623)

Signed-off-by: Piotr Żelasko <petezor@gmail.com>
NVIDIA · Sep 25, 2024 · e35a659 · e35a659
1 parent e8304d6
commit e35a659
Showing 1 changed file with 10 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -78,6 +78,16 @@
 
 <details open>
   <summary><b>Speech Recognition</b></summary>
+  <details>
+      <summary>
+        <a href="https://developer.nvidia.com/blog/accelerating-leaderboard-topping-asr-models-10x-with-nvidia-nemo/">
+          Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo
+        </a> (2024/09/24)
+      </summary>
+      NVIDIA NeMo team released a number of inference optimizations for CTC, RNN-T, and TDT models that resulted in up to 10x inference speed-up. 
+      These models now exceed an inverse real-time factor (RTFx) of 2,000, with some reaching RTFx of even 6,000.
+      <br><br>
+    </details>
     <details>
       <summary>
         <a href="https://developer.nvidia.com/blog/new-standard-for-speech-recognition-and-translation-from-the-nvidia-nemo-canary-model/">