ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used #239

renjiexu-amzn · 2024-10-14T22:04:05Z

To reproduce, the following code will result in the same behavior as invoke; if comment out .bind_tools line, the response would be properly streamed.

from langchain_aws import ChatBedrockConverse
from langchain_core.tools import tool

@tool(response_format="content_and_artifact")
def simple_calculator(a: int, b: int):
    """Use this tool to calcuate the sum of two integers.

    Args:
        a (int): The first integer.
        b (int): The second integer.

    Returns:
        int: The sum of the two integers.
    """
    return a + b

llm = ChatBedrockConverse(
    model="us.anthropic.claude-3-sonnet-20240229-v1:0",
    temperature=0,
    top_p=1,
    max_tokens=4096,
    region_name="us-west-2"
).bind_tools(tools=[simple_calculator])

a = llm.stream(
    input=[
        ("human", "Hello"),
    ],
)

full = next(a)

for x in a:
    print(x)
    full += x

print(full)

The text was updated successfully, but these errors were encountered:

renjiexu-amzn · 2024-10-14T23:46:11Z

Root cause is the logic to infer the provider from model/model ID doesn't support the cross-region inference profile ID properly, where the provider would be the second element after the split.

The workaround is to explicitly provide the provider value during the setup of the ChatBedrockConverse

from langchain_aws import ChatBedrockConverse
from langchain_core.tools import tool

@tool(response_format="content_and_artifact")
def simple_calculator(a: int, b: int):
    """Use this tool to calcuate the sum of two integers.

    Args:
        a (int): The first integer.
        b (int): The second integer.

    Returns:
        int: The sum of the two integers.
    """
    return a + b

llm = ChatBedrockConverse(
    model="us.anthropic.claude-3-sonnet-20240229-v1:0",
    temperature=0,
    top_p=1,
    max_tokens=4096,
    region_name="us-west-2",
    provider="anthropic"
).bind_tools(tools=[simple_calculator])

a = llm.stream(
    input=[
        ("human", "Hello"),
    ],
)

full = next(a)

for x in a:
    print(x)
    full += x

print(full)

3coins · 2024-10-15T02:52:29Z

@renjiexu-amzn
Thanks for reporting this issue. The converse API has many different ways to specify a model id with a mix of arns, foundation model, inference profiles and model ids. While we can look at a long term solution to support and identify each of these formats, a short-term fix to support inference profile ids (without hard-coding regions) will be to look at how many parts the model id has. Here is a quick attempt at this formula.

def get_provider(model_id: str) -> str:
    parts = model_id.split(".")
    return parts[1] if len(parts) == 3 else parts[0]

assert "meta" == get_provider("meta.llama3-2-3b-instruct-v1:0") # mode id
assert "meta" == get_provider("us.meta.llama3-2-3b-instruct-v1:0") # inference profile id

Let me know if the above works for you, and if you want to open a PR to make the change.

An alternate solution could be to use the Bedrock API to get more info about the model, I am not sure if the Bedrock API returns the provider info for all models, so we have to verify that. This solution will also need some more consideration at calling the API only once during initialization of the chat class.

sidatcd · 2024-11-18T03:20:23Z

This inference profile model ids are validated by its regions.
There is one available fro apac now.Can we include that in the validation list??

langcarl bot added the investigate label Oct 14, 2024

3coins changed the title ~~ChatBedrockConverse#stream not streaming the response if has bind_tools~~ ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used Oct 15, 2024

3coins added bedrock cross region inference labels Oct 15, 2024

3coins mentioned this issue Oct 16, 2024

Fixes support for cross region inference #242

Merged

3coins closed this as completed in #242 Oct 22, 2024

3coins closed this as completed in ee32da0 Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used #239

ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used #239

renjiexu-amzn commented Oct 14, 2024

renjiexu-amzn commented Oct 14, 2024 •

edited

Loading

3coins commented Oct 15, 2024

sidatcd commented Nov 18, 2024 •

edited

Loading

ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used #239

ChatBedrockConverse#stream not streaming response for model ids with cross region inference when bind_tools is used #239

Comments

renjiexu-amzn commented Oct 14, 2024

renjiexu-amzn commented Oct 14, 2024 • edited Loading

3coins commented Oct 15, 2024

sidatcd commented Nov 18, 2024 • edited Loading

renjiexu-amzn commented Oct 14, 2024 •

edited

Loading

sidatcd commented Nov 18, 2024 •

edited

Loading