Made it easier to instantiate environments #74

whitead · 2024-10-17T22:37:53Z

Added a new way to create environments based on an explicit task. Also made it possible to see available environments (although it is approximate).

sidnarayanan · 2024-10-17T22:50:57Z

src/aviary/env.py

+    @classmethod
+    def from_name(cls, name: str, task: str | None = None, **env_kwargs) -> Self:
+        new_cls = _get_cls_from_name(ENV_REGISTRY, name)
+        if task is not None:
+            return new_cls.from_task(task)
+        return new_cls(**env_kwargs)


Does this become a problem if a subclass uses task in its init?

Will add a check for that

ok - added a check

src/aviary/env.py

jamesbraza · 2024-10-17T23:16:20Z

src/aviary/env.py

        self.end_immediately = end_immediately
+        self.task = task


Can we name this self.subject = task?

Would rather just keep it consistent - don't think it's so important here about what is done with the task but like to be able to trace its path.

Sounds good

src/aviary/env.py

jamesbraza

jamesbraza · 2024-10-17T23:43:23Z

src/aviary/env.py

        self.end_immediately = end_immediately
+        self.task = task


Sounds good

jamesbraza · 2024-10-17T23:44:34Z

src/aviary/env.py

@@ -432,7 +469,7 @@ def __bool__(self) -> bool:
        return True


-def _construct_obj_from_name(registry: dict[str, tuple[str, str]], name: str, **kwargs):
+def _get_cls_from_name(registry: dict[str, tuple[str, str]], name: str):


Nice change here, like it

Ryan-Rhys · 2024-10-18T18:05:30Z

src/aviary/env.py

@@ -248,8 +248,35 @@ async def close(self) -> None:
        """

    @classmethod
-    def from_name(cls, name: str, **env_kwargs) -> Self:
-        return _construct_obj_from_name(ENV_REGISTRY, name, **env_kwargs)
+    def from_task(cls, task: str) -> Self:


Not sure I understand the use-case for from_task. Is there a specific environment where this kind of behavior is desirable?

I understand is for inference time?

Presumably task is not an arbitrary string as a user prompt would be? It seems as though it must correspond to a valid problem_id?

This is for inference time - so that you can have user defined tasks instead of tasks coming from a training or eval set.

An example would help I think

See Future-House/ldp#109

This is to enable scripts/entry points so that an end user can use the environments

Left a comment above, it makes the most sense to me for environments like HotpotQA where question can be open-ended. I don't know what happens, however, when the user passes in an arbitrary problem_id to environments like GSM8K? Similarly in cloning, I can't see where self.problem_id is used?

I missed that the problem argument is also being set as task

That now makes sense

Ryan-Rhys · 2024-10-18T18:18:02Z

LOL

albertbou92

LGTM!

Ryan-Rhys · 2024-10-18T18:23:34Z

packages/gsm8k/src/aviary/gsm8k/env.py

@@ -50,6 +50,10 @@ def __init__(
        self.check_tool = Tool.from_function(self.check_answer)
        self.tools = [self.calc_tool, self.check_tool]

+    @classmethod
+    def from_task(cls, task: str) -> "CalculatorEnv":
+        return cls(problem_id="task", problem=task, answer=0.0)


Where is self.problem_id used in the GSM8K environment aside from being exported as a dictionary in export_frame?

I don't know - it's a required argument so I just put a placeholder. Do you think I should refactor to make id optional?

Ah, I missed that task is being passed in to problem

Ryan-Rhys · 2024-10-18T18:25:37Z

packages/hotpotqa/src/aviary/hotpotqa/env.py

@@ -191,6 +191,10 @@ def __init__(
            create_tool(self.finish, "Finish"),
        ]

+    @classmethod
+    def from_task(cls, task: str) -> "HotPotQAEnv":
+        return cls(question=task, correct_answer=0.0)


I think HotpotQA makes the most sense intuitively for me since question can be open-ended allowing the user to pass an arbitrary question at inference time.

Ryan-Rhys · 2024-10-18T18:30:42Z

src/aviary/env.py

+        in calling an LLM. This is how the environment should be used after training
+        and in deployment. We don't take config here, because the default environment config
+        should be general for arbitrary tasks. Or, the config should be coupled to the agent
+        training (future TODO).


Might be worth adding some examples in the docstring e.g.

For the HotpotQA environment, a question not featured in the HotpotQA dataset.

For the GSM8K environment, a math word question not featured in the GSM8K dataset.

Ok - will add these in next revision set

Added task envs

bbda6b6

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. enhancement New feature or request labels Oct 17, 2024

whitead requested review from jamesbraza, sidnarayanan and Ryan-Rhys October 17, 2024 22:38

Fixed str concat stuff

965c1fb

whitead requested a review from albertbou92 October 17, 2024 22:41

sidnarayanan reviewed Oct 17, 2024

View reviewed changes

Added check on specifying task

6f8a09f

whitead mentioned this pull request Oct 17, 2024

Added main entry point Future-House/ldp#109

Merged

jamesbraza reviewed Oct 17, 2024

View reviewed changes

Added more documentation about tasks and name

14940be

whitead requested review from sidnarayanan and jamesbraza October 17, 2024 23:38

jamesbraza approved these changes Oct 18, 2024

View reviewed changes

Ryan-Rhys reviewed Oct 18, 2024

View reviewed changes

albertbou92 approved these changes Oct 18, 2024

View reviewed changes

Ryan-Rhys reviewed Oct 18, 2024

View reviewed changes

whitead merged commit a20118e into main Oct 18, 2024
5 of 6 checks passed

whitead deleted the env-tass branch October 18, 2024 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made it easier to instantiate environments #74

Made it easier to instantiate environments #74

whitead commented Oct 17, 2024

sidnarayanan Oct 17, 2024

whitead Oct 17, 2024

whitead Oct 17, 2024

jamesbraza Oct 17, 2024

whitead Oct 17, 2024

jamesbraza Oct 17, 2024

jamesbraza left a comment

jamesbraza Oct 17, 2024

jamesbraza Oct 17, 2024

Ryan-Rhys Oct 18, 2024

albertbou92 Oct 18, 2024

Ryan-Rhys Oct 18, 2024

whitead Oct 18, 2024

Ryan-Rhys Oct 18, 2024

whitead Oct 18, 2024

Ryan-Rhys Oct 18, 2024

Ryan-Rhys Oct 18, 2024

Ryan-Rhys Oct 18, 2024

Ryan-Rhys commented Oct 18, 2024

albertbou92 left a comment

Ryan-Rhys Oct 18, 2024

whitead Oct 18, 2024

Ryan-Rhys Oct 18, 2024

Ryan-Rhys Oct 18, 2024

Ryan-Rhys Oct 18, 2024

whitead Oct 18, 2024

Made it easier to instantiate environments #74

Made it easier to instantiate environments #74

Conversation

whitead commented Oct 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jamesbraza left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ryan-Rhys commented Oct 18, 2024

albertbou92 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment