OpenAI Prompt Caching: Add `cached_token` parameter in `usage_metadata` of AI response #27334

ShubhamMaddhashiya-bidgely · 2024-10-14T09:33:27Z

ShubhamMaddhashiya-bidgely
Oct 14, 2024

Checked

I searched existing ideas and did not find a similar one
I added a very descriptive title
I've clearly described the feature request and motivation for it

Feature request

OpenAI recently introduced a prompt caching feature that is automatically available for newer models. With this update, OpenAI's responses now include details on how many cached tokens were used for a request.

The current response schema includes the following structure:

{
  "completion_tokens": 93,
  "prompt_tokens": 14155,
  "total_tokens": 14248,
  "completion_tokens_details": {
    "audio_tokens": null,
    "reasoning_tokens": 0
  },
  "prompt_tokens_details": {
    "audio_tokens": null,
    "cached_tokens": 4480
  }
}

Adding support for these new fields, specifically the cached_tokens parameter, would be highly beneficial for users. It would allow them to monitor caching hit rates and optimize token usage more effectively.

This enhancement would align LangChain's capabilities with OpenAI's latest API features, providing users with greater visibility into prompt efficiency and performance.

Motivation

Adding support for these new fields, specifically the cached_tokens parameter, would be highly beneficial for users. It would allow them to monitor caching hit rates and optimize token usage more effectively.

Proposal (If applicable)

I'm not sure if this enhancement is already in development, but if not, I would love to contribute to implementing this enhancement.

sagaruprety · 2024-11-19T06:01:17Z

sagaruprety
Nov 19, 2024

I too would like to see this feature in Langchain, if not already existing.

5 replies

ShubhamMaddhashiya-bidgely Nov 19, 2024
Author

@sagaruprety This feature is now available in langchain-core 0.3.15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI Prompt Caching: Add `cached_token` parameter in `usage_metadata` of AI response #27334

{{title}}

Replies: 1 comment 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

OpenAI Prompt Caching: Add cached_token parameter in usage_metadata of AI response #27334

ShubhamMaddhashiya-bidgely Oct 14, 2024

Checked

Feature request

Motivation

Proposal (If applicable)

Replies: 1 comment · 5 replies

sagaruprety Nov 19, 2024

ShubhamMaddhashiya-bidgely Nov 19, 2024 Author

sagaruprety Nov 19, 2024

RafaelMCarvalho Nov 28, 2024

RafaelMCarvalho Nov 28, 2024

lolbus Dec 12, 2024

OpenAI Prompt Caching: Add `cached_token` parameter in `usage_metadata` of AI response #27334

ShubhamMaddhashiya-bidgely
Oct 14, 2024

Replies: 1 comment 5 replies

sagaruprety
Nov 19, 2024

ShubhamMaddhashiya-bidgely Nov 19, 2024
Author