does token & cost counting take into consideration prompt caching, eg openai prompt caching? #4417

dwstevens · 2024-11-25T19:59:13Z

dwstevens
Nov 25, 2024

Wondering if Langfuse can accurate calculate the cost based on prompt caching

@dwstevens · 2024-11-25T20:00:45Z

dosubot[bot]
bot Nov 25, 2024

Hey @dwstevens! I'm Dosu, a bot here to help you out with bugs, questions, and more. Let's tackle this issue together while we wait for a human maintainer.

I found a similar discussion regarding token cost counting when using Langchain's LLM cache. In that case, it was noted that the Langfuse UI might show costs and tokens for both cached and non-cached calls, which could be a discrepancy in how Langfuse interprets or displays cached results ^[1].

Additionally, there is an open issue tracking this behavior, suggesting that the current implementation might not accurately reflect cost savings from prompt caching ^[2].

If you're experiencing similar issues, it might be related to this known discrepancy.

To continue talking to Dosu, mention @dosu.

0 replies

marcklingen · 2024-11-25T20:08:23Z

marcklingen
Nov 25, 2024
Maintainer

This is a very near-term product update that's coming, thanks for sharing that this is important to you!

1 reply

marcklingen Nov 25, 2024
Maintainer

cc @hassiebp who's building this

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Langfuse

does token & cost counting take into consideration prompt caching, eg openai prompt caching? #4417

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Langfuse

does token & cost counting take into consideration prompt caching, eg openai prompt caching? #4417

dwstevens Nov 25, 2024

Replies: 2 comments · 1 reply

dosubot[bot] bot Nov 25, 2024

marcklingen Nov 25, 2024 Maintainer

marcklingen Nov 25, 2024 Maintainer

dwstevens
Nov 25, 2024

Replies: 2 comments 1 reply

dosubot[bot]
bot Nov 25, 2024

marcklingen
Nov 25, 2024
Maintainer

marcklingen Nov 25, 2024
Maintainer