Typed dict extract fields #1253

skrawcz · 2024-12-07T18:50:02Z

Adds typeddict support for extract_fields

from typing import TypedDict
from hamilton.function_modifiers import extract_fields

class MyDict(TypedDict):
    foo: str
    bar: int

@extract_fields()
def some_function()->MyDict:
    return MyDict(foo="s", bar=1)

The above will automatically extract the fields foo and bar.

You can also do:

from typing import TypedDict
from hamilton.function_modifiers import extract_fields

class MyDict(TypedDict):
    foo: str
    bar: int

@extract_fields({"foo": str})
def some_function()->MyDict:
    return MyDict(foo="s", bar=1)

To only expose a subset of the fields.

Changes

extract_fields decorator behavior -> it can now handle TypedDict

How I tested this

locally
unit tests

Notes

fixes @extract_fields - support TypedDict as return type #1252

Checklist

PR has an informative and human-readable title (this will be pulled into the release notes)
Changes are limited to a single goal (no scope creep)
Code passed the pre-commit check & code is left cleaner/nicer than when first encountered.
Any change in functionality is tested
New functions are documented (with a description, list of inputs, and expected output)
Placeholder code is flagged / future TODOs are captured in comments
Project documentation has been updated if adding/changing functionality.

ellipsis-dev

❌ Changes requested. Reviewed everything up to 1782dcf in 33 seconds

More details

Looked at 130 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 2 drafted comments based on config settings.

1. tests/function_modifiers/test_expanders.py:365

Draft comment:
Consider adding a test case to check that an InvalidDecoratorException is raised when the specified fields do not match the TypedDict fields. This will ensure that the validation logic for TypedDicts is thoroughly tested.
Reason this comment was not posted:
Comment looked like it was already resolved.

2. hamilton/function_modifiers/expanders.py:777

Draft comment:
The logic for checking if fields is a subset of the TypedDict fields is repeated. Consider refactoring this into a separate function to adhere to the DRY principle.
Reason this comment was not posted:
Marked as duplicate.

Workflow ID: wflow_6h5DrbBG53z73ihx

Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev · 2024-12-08T02:56:39Z

hamilton/function_modifiers/expanders.py

+                _validate_extract_fields(self.fields)
+            else:
+                # check that fields is a subset of TypedDict that is defined
+                typed_dict_fields = typing.get_type_hints(output_type)


The logic for checking if fields is a subset of the TypedDict fields is repeated. Consider refactoring this into a separate function to adhere to the DRY principle.

ellipsis-dev

👍 Looks good to me! Incremental review on f0bd568 in 17 seconds

More details

Looked at 20 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 2 drafted comments based on config settings.

1. hamilton/function_modifiers/expanders.py:777

Draft comment:
The _validate_extract_fields function call has been moved to ensure it validates self.fields after it is potentially set from a TypedDict. This change ensures that the fields are validated correctly when using TypedDict.
Reason this comment was not posted:
Confidence changes required: 10%
The PR modifies the validate method in extract_fields to handle TypedDict. The change moves the _validate_extract_fields call to ensure it is executed after setting self.fields when TypedDict is used. This ensures validation is performed on the correct fields. The change seems correct and aligns with the PR description.

2. hamilton/function_modifiers/expanders.py:783

Draft comment:
The error message here is misleading since TypedDict is also supported. Consider updating it to reflect that TypedDict is a valid return type.

                f"For extracting fields, output type must be a dict, typing.Dict, or TypedDict, not: {output_type}"

Reason this comment was not posted:
Comment was on unchanged code.

Workflow ID: wflow_K8SLBPmr15irZq3O

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

elijahbenizzy

One minor point

hamilton/function_modifiers/expanders.py

zilto · 2024-12-08T20:31:43Z

Could we loosen the constraints to enable Mapping type?

skrawcz · 2024-12-08T20:55:10Z

Could we loosen the constraints to enable Mapping type?

if it's quick to do. Otherwise can punt for another PR.

ellipsis-dev

👍 Looks good to me! Incremental review on 4e8e239 in 30 seconds

More details

Looked at 78 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 3 drafted comments based on config settings.

1. tests/function_modifiers/test_expanders.py:361

Draft comment:
Consider returning an instance of MyDictInheritance in return_dict to better simulate a real-world scenario and ensure the test covers both validation and extraction logic.
Reason this comment was not posted:
Confidence changes required: 50%
The test function test_extract_fields_validate_happy_inheritance is testing the validation of the extract_fields decorator with a TypedDict that involves inheritance. The test is correctly set up to ensure that the decorator can handle subclass relationships. However, the test function return_dict returns an empty dictionary, which might not be the best way to test the functionality. It would be more appropriate to return an instance of MyDictInheritance to better simulate a real-world scenario. This would ensure that the test is not only checking the validation logic but also the actual extraction logic.

2. tests/function_modifiers/test_expanders.py:369

Draft comment:
Consider returning an instance of MyDictInheritanceBadCase in return_dict to better simulate a real-world scenario and ensure the test covers both validation and extraction logic.
Reason this comment was not posted:
Confidence changes required: 50%
The test function test_extract_fields_validate_not_subclass is designed to test the validation logic of the extract_fields decorator when the field type is not a subclass of the expected type. The test is correctly set up to ensure that the decorator raises an exception in this scenario. However, similar to the previous test, the function return_dict returns an empty dictionary. It would be more appropriate to return an instance of MyDictInheritanceBadCase to better simulate a real-world scenario and ensure the test covers both validation and extraction logic.

3. hamilton/function_modifiers/expanders.py:772

Draft comment:
The validate function in extract_fields is complex and would benefit from comments explaining the logic, especially around handling TypedDicts and field validation.
Reason this comment was not posted:
Comment was on unchanged code.

Workflow ID: wflow_qBjhjNI8059Htw65

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

This in response to #1252. We should be able to handle typeddict better. This sketches some ideas: 1. field validation should happen in .validate() not the constructor. 2. extract_fields shouldn't need fields if the typeddict is the annotation type. 3. we properly check that typeddict can be a return type.

skrawcz linked an issue Dec 7, 2024 that may be closed by this pull request

@extract_fields - support TypedDict as return type #1252

Closed

skrawcz mentioned this pull request Dec 7, 2024

@extract_fields - support TypedDict as return type #1252

Closed

skrawcz force-pushed the typed_dict_extract_fields branch from 413c282 to 594f4e9 Compare December 8, 2024 00:00

skrawcz marked this pull request as ready for review December 8, 2024 02:55

ellipsis-dev bot reviewed Dec 8, 2024

View reviewed changes

skrawcz requested a review from elijahbenizzy December 8, 2024 02:59

ellipsis-dev bot reviewed Dec 8, 2024

View reviewed changes

elijahbenizzy reviewed Dec 8, 2024

View reviewed changes

hamilton/function_modifiers/expanders.py Outdated Show resolved Hide resolved

ellipsis-dev bot reviewed Dec 11, 2024

View reviewed changes

elijahbenizzy approved these changes Dec 11, 2024

View reviewed changes

skrawcz force-pushed the typed_dict_extract_fields branch 2 times, most recently from 2f3e9b9 to 4637bbd Compare December 12, 2024 07:13

skrawcz added 4 commits December 11, 2024 23:13

Adds typeddict tests

17261c2

Adding validation to cover all extract_fields paths

7266a26

Adds Typeddict Extract fields subclass type check and test for it

d3495fc

skrawcz force-pushed the typed_dict_extract_fields branch from 4637bbd to d3495fc Compare December 12, 2024 07:14

skrawcz merged commit 622866a into main Dec 12, 2024
20 of 24 checks passed

skrawcz deleted the typed_dict_extract_fields branch December 12, 2024 07:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Typed dict extract fields #1253

Typed dict extract fields #1253

skrawcz commented Dec 7, 2024 •

edited

Loading

ellipsis-dev bot left a comment

ellipsis-dev bot Dec 8, 2024

ellipsis-dev bot left a comment

elijahbenizzy left a comment

zilto commented Dec 8, 2024

skrawcz commented Dec 8, 2024

ellipsis-dev bot left a comment

Typed dict extract fields #1253

Typed dict extract fields #1253

Conversation

skrawcz commented Dec 7, 2024 • edited Loading

Changes

How I tested this

Notes

Checklist

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot Dec 8, 2024

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

elijahbenizzy left a comment

Choose a reason for hiding this comment

zilto commented Dec 8, 2024

skrawcz commented Dec 8, 2024

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

skrawcz commented Dec 7, 2024 •

edited

Loading