Latency metrics - Time to first token, Tokens per second #964
Closed
nikcaryo-super
started this conversation in
Ideas
Replies: 3 comments 5 replies
-
Hi @nikcaryo-super -- just wanted to let you know that we have now started to work on this and will ship soon. Will ping you once it's live. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Hi @nikcaryo-super, both of the metrics are available by now on the generations table :) |
Beta Was this translation helpful? Give feedback.
0 replies
-
i want to measure time to first token metric, i am using ollama, how do i do it? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
really useful as latency metrics for Voice AI applications
Beta Was this translation helpful? Give feedback.
All reactions