ValueError: Missing some input keys: - passed data requires input keys if containing { } #13055

Kaotic3 · 2023-11-08T12:01:05Z

System Info

Python 3.11.5
Langchain (pip show) 0.0.327
Windows OS
Visual Studio Code

Who can help?

@eyurtsev

Information

The official example notebooks/scripts
My own modified scripts

Related Components

Reproduction

I searched and was surprised this has not come up.

I am using LangChain for a RAG workflow - and when I send a document, if that document contains { } - it throws a missing key error - it is treating the content of the document, as it would a normal prompt where you might have "question {question}" and expect an input key of 'question' it then reports back that all of the { } are in fact different missing keys.

For example, my data contains this:

"...1 2 ------------------------------------ {w14 w15 w16se w16cid w16 w16cex w16sdtdh wp14}{DP}{AD}{S::}"

It will say that we are missing numerous keys:

ValueError: Missing some input keys: {'AD', 'w14 w15 w16se w16cid w16 w16cex w16sdtdh wp14', ...}

Now, I can clean the data prior to sending, but I was wondering whether it should behave like this given that this document is already within { } as content?

I use the "FewShotPromptTemplate" to create a prompt which includes a "Suffix" and my suffix is:

def get_suffix():
    return """
    Document: {content}
    Question: {question}
    """

Here content is the content of the document that contains the { } set out above.

I build the prompt like this:

prompt_template = FewShotPromptTemplate(
            examples = examples,
            example_prompt = get_prompt_template(example_template, example_variables),
            prefix = prefix,
            suffix = suffix,
            input_variables = input_variables
        )
        prompt = prompt_template.format(question=question, context=context)
        return prompt

I also did a test using another piece of code:

document_context = text_response + "{AD}"
prompt = ChatPromptTemplate.from_template("my_specific_prompt": {document}.\n{format_instructions}")
formated_prompt = prompt.format(**{"document": document_context, "format_instructions":output_parser.get_format_instructions()})

Introducing a random {AD} into the text response. It did not fail. It messed up the results, but it didn't actually cause any missing input key errors.

So this may be limited to the FewShotPromptTemplate?

Expected behavior

I would have thought that anything passed within a curly bracket set would be considered as plain text, not parsed for further keys that might be embedded in that curly bracket set and throw an error when it cannot find them?

Maybe I am wrong, but that is what I would have expected and is what appears to happen when using the ChatPromptTemplate.from_template?

The text was updated successfully, but these errors were encountered:

dosubot · 2023-11-08T12:01:34Z

🤖

Hello @Kaotic3,

I'm here to help! I'm an AI bot dedicated to assisting with issues related to LangChain. I can help you debug issues, answer your questions, and guide you on how to become a better contributor. While we wait for a human maintainer, let's see if we can tackle this issue together. I'm analyzing your problem now and will get back to you with a more detailed response soon.

dosubot · 2023-11-08T12:14:04Z

🤖

Based on the information you've provided and the context of similar issues in the LangChain repository, it seems that the issue you're experiencing is related to how Python's built-in string formatting mechanism works. When you use curly braces {} in a string that you're formatting with the format method, Python interprets anything inside the braces as a variable to be replaced. If a key in the template is not found in the keyword arguments, a KeyError is raised.

In your case, the document content you're passing to the FewShotPromptTemplate contains curly braces, which Python is interpreting as variables. Since these variables are not defined in your code, you're seeing a ValueError: Missing some input keys.

There are a few potential solutions to this issue:

Escape the curly braces in your document content: You can escape the curly braces in your document content by doubling them. Here's how you can modify your code:

document_content = "...1 2 ------------------------------------ {{w14 w15 w16se w16cid w16 w16cex w16sdtdh wp14}}{{DP}}{{AD}}{{S::}}"

By doubling the curly braces, you're instructing Python to treat them as literal characters in the string, rather than as placeholders for variables.

Use a different template format: The FewShotPromptTemplate class includes a template_format parameter that specifies the format of the prompt template. The options are 'f-string' and 'jinja2'. The chosen format determines how the content within the curly brackets is treated. For 'f-string', the content within the curly brackets is treated as a variable to be replaced. For 'jinja2', the content within the curly brackets is treated as a placeholder for a variable to be replaced. You might consider using 'jinja2' as your template format.

prompt_template = FewShotPromptTemplate(
    examples = examples,
    example_prompt = get_prompt_template(example_template, example_variables),
    prefix = prefix,
    suffix = suffix,
    input_variables = input_variables,
    template_format = 'jinja2'
)

Upgrade to the latest version of LangChain: The latest version of LangChain is v0.0.331rc2, and you're currently using v0.0.327. It's possible that the issue you're experiencing has been fixed in a more recent version of LangChain.

I hope this helps! If you have any further questions or if the issue persists, please let me know.

Sources

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

Kaotic3 · 2023-11-08T12:18:43Z

Thanks jinja2 worked perfectly.

humphreyde · 2024-09-12T02:57:37Z

🤖

Based on the information you've provided and the context of similar issues in the LangChain repository, it seems that the issue you're experiencing is related to how Python's built-in string formatting mechanism works. When you use curly braces {} in a string that you're formatting with the format method, Python interprets anything inside the braces as a variable to be replaced. If a key in the template is not found in the keyword arguments, a KeyError is raised.

In your case, the document content you're passing to the FewShotPromptTemplate contains curly braces, which Python is interpreting as variables. Since these variables are not defined in your code, you're seeing a ValueError: Missing some input keys.

There are a few potential solutions to this issue:

Escape the curly braces in your document content: You can escape the curly braces in your document content by doubling them. Here's how you can modify your code:
document_content = "...1 2 ------------------------------------ {{w14 w15 w16se w16cid w16 w16cex w16sdtdh wp14}}{{DP}}{{AD}}{{S::}}"
By doubling the curly braces, you're instructing Python to treat them as literal characters in the string, rather than as placeholders for variables.

Use a different template format: The FewShotPromptTemplate class includes a template_format parameter that specifies the format of the prompt template. The options are 'f-string' and 'jinja2'. The chosen format determines how the content within the curly brackets is treated. For 'f-string', the content within the curly brackets is treated as a variable to be replaced. For 'jinja2', the content within the curly brackets is treated as a placeholder for a variable to be replaced. You might consider using 'jinja2' as your template format.
prompt_template = FewShotPromptTemplate(
    examples = examples,
    example_prompt = get_prompt_template(example_template, example_variables),
    prefix = prefix,
    suffix = suffix,
    input_variables = input_variables,
    template_format = 'jinja2'
)
Upgrade to the latest version of LangChain: The latest version of LangChain is v0.0.331rc2, and you're currently using v0.0.327. It's possible that the issue you're experiencing has been fixed in a more recent version of LangChain.

I hope this helps! If you have any further questions or if the issue persists, please let me know.

Sources

FewShotPromptTemplate example formating bug

how to escape {} for the PromptTemplate

Pass a JSON string and get an error mismatched or missing input parameter

libs/langchain/tests/unit_tests/prompts/test_few_shot.py

libs/langchain/langchain/prompts/prompt.py

libs/langchain/tests/unit_tests/prompts/test_few_shot_with_templates.py

libs/langchain/langchain/prompts/few_shot.py

libs/langchain/langchain/prompts/few_shot_with_templates.py

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

I got the same problem, there are many "{x}..{xx}" in my system prompt, how can I handle this?

dosubot bot added Ɑ: models Related to LLMs or chat model modules 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature labels Nov 8, 2023

Kaotic3 closed this as completed Nov 8, 2023

Kaotic3 reopened this Nov 8, 2023

Kaotic3 closed this as completed Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Missing some input keys: - passed data requires input keys if containing { } #13055

ValueError: Missing some input keys: - passed data requires input keys if containing { } #13055

Kaotic3 commented Nov 8, 2023

dosubot bot commented Nov 8, 2023

dosubot bot commented Nov 8, 2023

Kaotic3 commented Nov 8, 2023

humphreyde commented Sep 12, 2024

Sources

ValueError: Missing some input keys: - passed data requires input keys if containing { } #13055

ValueError: Missing some input keys: - passed data requires input keys if containing { } #13055

Comments

Kaotic3 commented Nov 8, 2023

System Info

Who can help?

Information

Related Components

Reproduction

Expected behavior

dosubot bot commented Nov 8, 2023

dosubot bot commented Nov 8, 2023

Sources

Kaotic3 commented Nov 8, 2023

humphreyde commented Sep 12, 2024

Sources