Fix bug in model export with new action output nodes #4705

dongruoping · 2020-12-04T01:55:16Z

Proposed change(s)

Force the new action output and deprecated action output to be different nodes in the graph

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

The deprecated action_out_deprecated is assigned to be the same as continuous or discrete action output according to action spec.

However the new action output and deprecated action output will point to the same tensor and same node in the graph, causing one of them being missing is the graph since one node can only have one name.

Solving this issue by forcing them to be different nodes in the graph.

Also fixing missing nodes in dynamic_axes.

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

vincentpierre

Left a comment : rename the discrete output node before merging 🚢 🇮🇹

vincentpierre · 2020-12-04T18:20:09Z

ml-agents/mlagents/trainers/torch/model_serialization.py

        if self.policy.behavior_spec.action_spec.discrete_size > 0:
            self.output_names += ["discrete_actions", "discrete_action_output_shape"]
+            self.dynamic_axes.update({"discrete_actions": {0: "batch"}})


Since the output will actually be discrete_actions_log_probs instead of discrete_actions, we need to rename this output.

I think this will have to keep the same name for backward compat concern(old C# + new python), and we give the actual discrete action a new name like sampled_discrete_actions

* Add hybrid action capability flag (#4576) * Change BrainParametersProto to support ActionSpec (#4579) * Assign new BrainParametersProto fields based on capabilities (#4581) * ActionBuffer with hybrid actions for RemotePolicy (#4592) * Barracuda inference for hybrid actions (#4611) * Refactor BarracudaModel loader checks (#4629) * Export separate nodes for continuous/discrete actions (#4655) * Separate continuous/discrete actions in AgentActionProto (#4698) * Force different nodes for new and deprecated action output (#4705)

dongruoping added 2 commits December 3, 2020 17:44

force different nodes for new and deprecated action output

2d0d7e8

fix dynamic axis

48bb219

dongruoping requested a review from vincentpierre December 4, 2020 01:55

vincentpierre approved these changes Dec 4, 2020

View reviewed changes

dongruoping merged commit fa3e093 into develop-hybrid-actions-csharp Dec 4, 2020

delete-merged-branch bot deleted the develop-hybrid-fix-export branch December 4, 2020 18:30

github-actions bot locked as resolved and limited conversation to collaborators Dec 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in model export with new action output nodes #4705

Fix bug in model export with new action output nodes #4705

dongruoping commented Dec 4, 2020

vincentpierre left a comment

vincentpierre Dec 4, 2020

dongruoping Dec 4, 2020

Fix bug in model export with new action output nodes #4705

Fix bug in model export with new action output nodes #4705

Conversation

dongruoping commented Dec 4, 2020

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

vincentpierre left a comment

Choose a reason for hiding this comment

vincentpierre Dec 4, 2020

Choose a reason for hiding this comment

dongruoping Dec 4, 2020

Choose a reason for hiding this comment