Support for Pydantic Model as Signature InputField type #1904

rohitgarud · 2024-12-08T05:29:57Z

Currently, only Signature OutputField can have Pydantic Model as a type. Adding it to InputField does not reflect in the System prompt

okhat · 2024-12-08T13:11:26Z

Hey @rohitgarud , it works though, right? It just doesn't describe the schema. We don't need to describe the schema of the input, since we show the input value to the model.

rohitgarud · 2024-12-08T13:39:17Z

Yes, @okhat it works, I was wondering if adding the description of the fields in the input Pydantic model can help use the provided information better? Was the decision to leave out the InputField schema based on some experimentation or on a judgment and an obvious fact that the data itself is available to the model? I will try to add schema to the input field using the custom adapter and check if you have not tried it previously

okhat · 2024-12-08T14:24:50Z

It was an analytic decision. Why should we show the schema to the model, if it sees the actual structure + values anyway?

However, I'm interested in whether there are edge cases where showing the input schema is important.

rohitgarud · 2024-12-08T18:48:57Z

What would you prefer:

multiple (say 5-6) InputFields with descriptions
single InputField with Pydantic model and show the schema using a custom adapter

okhat · 2024-12-08T21:00:59Z

Of course the first one :D

rohitgarud · 2024-12-09T06:38:17Z

In that case, what happens if the data for any InputField is optional and missing?.. I will be testing this, but I wanted to get your intuition on this

rohitgarud · 2024-12-09T09:04:17Z

What would you prefer:

multiple (say 5-6) InputFields with descriptions

single InputField with Pydantic model and show the schema using a custom adapter

Going against your suggestion, went with option 2 😄 by doing this in the custom adapter (a little hacky but will be improving later):

def field_metadata(field_name, field_info):
        type_ = field_info.annotation

        if type_ is str:
            desc = ""
        elif type_ is bool:
            desc = "must be True or False"
        elif type_ in (int, float):
            desc = f"must be a single {type_.__name__} value"
        elif inspect.isclass(type_) and issubclass(type_, enum.Enum):
            desc = f"must be one of: {'; '.join(type_.__members__)}"
        elif hasattr(type_, "__origin__") and type_.__origin__ is Literal:
            desc = f"must be one of: {'; '.join([str(x) for x in type_.__args__])}"  # noqa: E501
        else:
            desc = (
                "must be pareseable according to the following JSON schema: "
            )
            processed_schema = ProcessSchema(
                schema=TypeAdapter(type_).json_schema()
            ).transform_schema()
            desc += processed_schema
        desc = (
            (" " * 8) + f"# note: the value you produce {desc}" if desc else ""
        )

        if get_dspy_field_type(field_info) == "input":
            desc = desc.replace(
                "# note: the value you produce must be", "#note: input will be"
            )
            desc = desc.replace("pareseable according to the", "having")

        return f"{{{field_name}}}{desc}"

Rationale:

Fields and descriptions of inputs and outputs stay at a single location inside models, keeping the program cleaner
Can use nested Pydantic models for input
As I am processing the schema before injecting into the system prompt using custom adapter the increase in number of tokens is manageable

Please let me know your thoughts on this

thomasahle · 2024-12-10T15:42:37Z

Sometimes a codebase may already have a bunch of pydantic types it uses. It's inconvenient to "unpack" them into a signature, rather than just using them directly. So maybe some automatic "unpacking formatter" is not a bad idea.

rohitgarud · 2024-12-11T04:43:35Z

Thank you @thomasahle, I would really appreciate if you can review this approach for unpacking formatter. Works really well with even smaller models. I think we can still use the Structured Output feature from the providers. Usage is shown above in ProcessSchema

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Pydantic Model as Signature InputField type #1904

Support for Pydantic Model as Signature InputField type #1904

rohitgarud commented Dec 8, 2024

okhat commented Dec 8, 2024

rohitgarud commented Dec 8, 2024 •

edited

Loading

okhat commented Dec 8, 2024

rohitgarud commented Dec 8, 2024 •

edited

Loading

okhat commented Dec 8, 2024

rohitgarud commented Dec 9, 2024

rohitgarud commented Dec 9, 2024 •

edited

Loading

thomasahle commented Dec 10, 2024

rohitgarud commented Dec 11, 2024

Support for Pydantic Model as Signature InputField type #1904

Support for Pydantic Model as Signature InputField type #1904

Comments

rohitgarud commented Dec 8, 2024

okhat commented Dec 8, 2024

rohitgarud commented Dec 8, 2024 • edited Loading

okhat commented Dec 8, 2024

rohitgarud commented Dec 8, 2024 • edited Loading

okhat commented Dec 8, 2024

rohitgarud commented Dec 9, 2024

rohitgarud commented Dec 9, 2024 • edited Loading

thomasahle commented Dec 10, 2024

rohitgarud commented Dec 11, 2024

rohitgarud commented Dec 8, 2024 •

edited

Loading

rohitgarud commented Dec 8, 2024 •

edited

Loading

rohitgarud commented Dec 9, 2024 •

edited

Loading