[minor] Recursive to/from_dict #1602

pmrv · 2024-08-16T14:47:18Z

Adds a create_from_dict function that takes the output of HasDict.to_dict and turns it into a live object, similar to our earlier to_object() method on ProjectHDFio.
Adds an instantiate class method to HasDict to support this, this allows HasDictfromHDF to work and will be useful for dataclasses in the future. HasDict.to_dict now goes over the contents of what is returned from _to_dict and automatically converts any HasDict/HasHDF objects it finds. I haven't used this in downstream code yet to keep the change small, but in principle this will allow GenericJob/DataContainer to stop calling to_dict on their children explicitly and let the generic interface handle it.

The rest of the changes are renaming everything to _from_dict/_to_dict and normalizing the argument name to obj_dict.

This allows to store "abitrary" objects again in the input/outputs of TemplateJobs.

Separate PR for easier review. I will add extra unit tests, but it worked already in my local notebook tests.

Adds a create_from_dict function that takes the output of HasDict.to_dict and turns it into a live object, similar to our earlier to_object() method on ProjectHDFio. Adds an instantiate class method to HasDict to support this, this allows HasDictfromHDF to work and will be useful for dataclasses in the future. HasDict.to_dict now goes over the contents of what is returned from _to_dict and automatically converts any HasDict/HasHDF objects it finds. I haven't used this in downstream code yet to keep the change small, but in principle this will allow GenericJob/DataContainer to stop calling to_dict on their children explicitly and let the generic interface handle it. The rest of the changes are renaming everything to _from_dict/_to_dict and normalizing the argument name to obj_dict.

pyiron_base/jobs/job/extension/executable.py

pyiron_base/interfaces/has_dict.py

jan-janssen · 2024-08-17T10:23:39Z

pyiron_base/jobs/job/generic.py

+        if "executable" in obj_dict.keys() and obj_dict["executable"] is not None:
+            self._executable = obj_dict["executable"]


I am a bit surprised about this part, when is the dictionary of the executable converted to the executable object? Is this already happening when reading the executable from the HDF5 file?

In this new setup HasDict.from_dict converts nested objects back from their dictionary form before the implementations HasDict._from_dict is called. So by the time GenericJob._from_dict runs and looks for the executable Executable._from_dict already ran.

In principle the reverse also works, i.e. it's not always necessary anymore to call to_dict on ones children that are returned in the obj_dict.

I think it makes sense to document this at least in the HasDict class, so developers understand how to derive their own classes from HasDict and that they only have to implement the _to_dict() and _from_dict() interface. Finally, I am wondering if it makes sense to overload the __getstate__() and __setstate__() function in the HasDict class. Then we can slowly transition from calling to_dict() and from_dict() to using the __getstate__() and __setstate__() interface to reload the pyiron objects. This would simplify the interface for developers who just want to integrate a new code. As long as their classes can be pickled, then we can store their classes in our jobs. If they want to optimise the performance and benefit from the hierarchical nature of the HDF5 file, then they can overload the __getstate__() and __setstate__() method, for example by attaching a dataclass to their class which they use for data storage or by attaching a data container.

jan-janssen · 2024-08-17T10:28:26Z

pyiron_base/interfaces/has_dict.py

+        def load(inner_dict):
+            if not isinstance(inner_dict, dict):
+                return inner_dict
+            if not all(
+                k in inner_dict for k in ("NAME", "TYPE", "OBJECT", "DICT_VERSION")
+            ):
+                return {k: load(v) for k, v in inner_dict.items()}
+            return create_from_dict(inner_dict)


Ok, I understand that the conversion is handled here, still I am wondering if this is what we want. I thought the intention was to get towards using __getstate__() and __setstate__(). Now with the executable and the server object using datacalsses for data storage internally, I was hoping that we could use a simple dictionary representation there where the class is stored by storing the attached dataclass.

jan-janssen · 2024-08-17T13:39:49Z

pyiron_base/interfaces/has_dict.py

+        for k, v in self._to_dict().items():
+            if isinstance(v, HasDict):
+                child_dict[k] = v.to_dict()
+            elif isinstance(v, HasHDF):
+                child_dict[k] = HasDictfromHDF.to_dict(v)
+            else:
+                data_dict[k] = v


Just to clarify this, this part currently does not work for the Atoms object in pyiron_atomistics, as it is neither derived from HasDict nor from HasHDF, correct? It only works when the Atoms object was stored as part of a DataContainer, correct?

for more information, see https://pre-commit.ci

pmrv force-pushed the hasdictrec branch from 5f44b59 to af49e2c Compare August 16, 2024 15:08

pmrv force-pushed the hasdictrec branch from 0aa9ca9 to 75c6a31 Compare August 16, 2024 15:15

pmrv added enhancement New feature or request integration Start the integration tests with pyiron_atomistics/contrib for this PR minor add functionality in a backward compatible manner labels Aug 16, 2024

pmrv requested a review from jan-janssen August 16, 2024 15:21

github-actions bot changed the title ~~Recursive to/from_dict~~ [minor] Recursive to/from_dict Aug 16, 2024

pmrv commented Aug 16, 2024

View reviewed changes

pyiron_base/jobs/job/extension/executable.py Outdated Show resolved Hide resolved

pmrv commented Aug 16, 2024

View reviewed changes

pyiron_base/interfaces/has_dict.py Outdated Show resolved Hide resolved

Fix typos

8ae825b

jan-janssen reviewed Aug 17, 2024

View reviewed changes

pmrv mentioned this pull request Aug 17, 2024

Add two classes for HasHDF/HasDict compat #1364

Merged

jan-janssen reviewed Aug 17, 2024

View reviewed changes

pmrv and others added 2 commits August 19, 2024 17:54

Add docstrings and rearrange method order

6db8b9e

[pre-commit.ci] auto fixes from pre-commit.com hooks

d710333

for more information, see https://pre-commit.ci

pmrv merged commit 013a21b into hasdict Aug 19, 2024
22 of 26 checks passed

pmrv deleted the hasdictrec branch August 19, 2024 16:17

pmrv mentioned this pull request Aug 20, 2024

Backwards compatible loading of server/executable #1611

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[minor] Recursive to/from_dict #1602

[minor] Recursive to/from_dict #1602

pmrv commented Aug 16, 2024

jan-janssen Aug 17, 2024

pmrv Aug 17, 2024

pmrv Aug 17, 2024

jan-janssen Aug 17, 2024

jan-janssen Aug 17, 2024

jan-janssen Aug 17, 2024

		if "executable" in obj_dict.keys() and obj_dict["executable"] is not None:
		self._executable = obj_dict["executable"]

[minor] Recursive to/from_dict #1602

[minor] Recursive to/from_dict #1602

Conversation

pmrv commented Aug 16, 2024

jan-janssen Aug 17, 2024

Choose a reason for hiding this comment

pmrv Aug 17, 2024

Choose a reason for hiding this comment

pmrv Aug 17, 2024

Choose a reason for hiding this comment

jan-janssen Aug 17, 2024

Choose a reason for hiding this comment

jan-janssen Aug 17, 2024

Choose a reason for hiding this comment

jan-janssen Aug 17, 2024

Choose a reason for hiding this comment