Goal conditioning integration #5142

vincentpierre · 2021-03-17T01:35:09Z

Proposed change(s)

[Do not merge, first merge the modules, then merge this into main]
Integration of the Hypernetworks into network body

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

awjuliani · 2021-03-17T18:00:21Z

ml-agents/mlagents/trainers/settings.py

@@ -92,6 +92,11 @@ class ScheduleType(Enum):
    LINEAR = "linear"


+class ConditioningType(Enum):


Like we discussed in the design doc, we probably don't need anything more than hyper, but it might still be worth keeping this just in case. For instances where the user finds out that the hypernetwork hurts performance on their task for whatever reason, it might be easier to disable it in the trainer than to rebuild their environment without goals.

Made the default hyper and the no conditioning the second option

I think it should be hypernetwork - is that too long?

ml-agents/mlagents/trainers/torch/networks.py

Co-authored-by: Arthur Juliani <awjuliani@gmail.com>

…ity-Technologies/ml-agents into goal-conditioning-integration

ervteng · 2021-03-19T21:59:25Z

ml-agents/mlagents/trainers/settings.py

@@ -115,6 +120,7 @@ def _check_valid_memory_size(self, attribute, value):
    num_layers: int = 2
    vis_encode_type: EncoderType = EncoderType.SIMPLE
    memory: Optional[MemorySettings] = None
+    conditioning_type: ConditioningType = ConditioningType.HYPER


should we maybe use goal_conditioning_type?

ervteng · 2021-03-19T22:00:15Z

ml-agents/mlagents/trainers/torch/networks.py

@@ -79,9 +94,6 @@ def forward(self, inputs: List[torch.Tensor]) -> torch.Tensor:
        """
        Encode observations using a list of processors and an RSA.
        :param inputs: List of Tensors corresponding to a set of obs.
-        :param processors: a ModuleList of the input processors to be applied to these obs.
-        :param rsa: Optionally, an RSA to use for variable length obs.
-        :param x_self_encoder: Optionally, an encoder to use for x_self (in this case, the non-variable inputs.).


Thanks for removing this 😅

ervteng · 2021-03-19T23:06:29Z

ml-agents/mlagents/trainers/torch/networks.py

-        encoding = self.linear_encoder(encoded_self)
+        if isinstance(self.linear_encoder, ConditionalEncoder):
+            goal = self.observation_encoder.get_goal_encoding(inputs)
+            encoding = self.linear_encoder(encoded_self, goal)


If we're going to do both Conditional and Linear I think we should rename it to just encoder or body_encoder, but not really that necessary

ervteng · 2021-03-19T23:10:36Z

ml-agents/mlagents/trainers/torch/networks.py

+    @property
+    def total_goal_enc_size(self) -> int:
+        """
+        Returns the total encoding size for this ObservationEncoder.


Suggested change

Returns the total encoding size for this ObservationEncoder.

Returns the total goal encoding size for this ObservationEncoder.

vincentpierre added 3 commits March 16, 2021 16:42

Adding Hypernetwork modules and unit tests

0a79b7b

Edits

4a72459

Integration of the hypernetowrk to the trainer

208f8d9

vincentpierre self-assigned this Mar 17, 2021

vincentpierre requested a review from awjuliani March 17, 2021 17:15

awjuliani reviewed Mar 17, 2021

View reviewed changes

ml-agents/mlagents/trainers/torch/networks.py Outdated Show resolved Hide resolved

vincentpierre and others added 3 commits March 17, 2021 11:13

Update ml-agents/mlagents/trainers/torch/networks.py

8767705

Co-authored-by: Arthur Juliani <awjuliani@gmail.com>

Making the default hyper and added the conditioning type None

6c35634

Merge branch 'goal-conditioning-integration' of https://github.com/Un…

746a80d

…ity-Technologies/ml-agents into goal-conditioning-integration

delete-merged-branch bot deleted the branch main March 17, 2021 18:17

vincentpierre changed the base branch from goal-conditioning-modules to main March 17, 2021 18:46

vincentpierre marked this pull request as ready for review March 17, 2021 18:46

vincentpierre added 2 commits March 17, 2021 11:46

Merge branch 'main' into goal-conditioning-integration

de1d1d5

Reducing the number of hypernetwork layers

aa07191

ervteng reviewed Mar 19, 2021

View reviewed changes

ervteng approved these changes Mar 19, 2021

View reviewed changes

addressing comments

16ae823

vincentpierre merged commit 2072dd2 into main Mar 26, 2021

delete-merged-branch bot deleted the goal-conditioning-integration branch March 26, 2021 01:37

github-actions bot locked as resolved and limited conversation to collaborators Mar 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Goal conditioning integration #5142

Goal conditioning integration #5142

vincentpierre commented Mar 17, 2021

awjuliani Mar 17, 2021

vincentpierre Mar 17, 2021

ervteng Mar 19, 2021

ervteng Mar 19, 2021

ervteng Mar 19, 2021

ervteng Mar 19, 2021

ervteng Mar 19, 2021 •

edited

Loading

		@@ -92,6 +92,11 @@ class ScheduleType(Enum):
		LINEAR = "linear"


		class ConditioningType(Enum):

	Returns the total encoding size for this ObservationEncoder.
	Returns the total goal encoding size for this ObservationEncoder.

Goal conditioning integration #5142

Goal conditioning integration #5142

Conversation

vincentpierre commented Mar 17, 2021

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

awjuliani Mar 17, 2021

Choose a reason for hiding this comment

vincentpierre Mar 17, 2021

Choose a reason for hiding this comment

ervteng Mar 19, 2021

Choose a reason for hiding this comment

ervteng Mar 19, 2021

Choose a reason for hiding this comment

ervteng Mar 19, 2021

Choose a reason for hiding this comment

ervteng Mar 19, 2021

Choose a reason for hiding this comment

ervteng Mar 19, 2021 • edited Loading

Choose a reason for hiding this comment

ervteng Mar 19, 2021 •

edited

Loading