Variable sensors shape #4686

ValerioB88 · 2020-11-23T10:30:25Z

Hello. I am trying to create agents that can pass a variable number of observations. One way to do it could be to always compute and pass the maximum number of observations, and then filter them in python, once received. However, when there are VisualObservations, this becomes computationally expensive, and it's an inefficient approach. It would be more appropriate to be able to programmatically chose, at every time-step, what observation to send.

After exploring the code, I think the easy way to do that would be to have an additional attribute in the ISensor class, which would be isActive. Then, in MLAgents.SendInfoToBrain, when calling m_Brain.RequestDecision(m_info, sensors) we would filter for sensors having the isActive to true.

Alternatively, the user may specify from a checkbox in the UnityEditor whether he wants to use variable size observations. If this flag is true, then isActive is checked, otherwise is ignored altogether with very little computational overhead.

I am happy to work on this straight away if approved. Otherwise I will just create my own sensor class.

The text was updated successfully, but these errors were encountered:

andrewcoh · 2020-11-23T14:37:57Z

Hi @ValerioB88

This is a feature that we are planning to add ourselves in the next month or two but we haven't yet decided on the best way to implement it.

Thank you for showing your interest in this feature. Can you share your use case?

ValerioB88 · 2020-11-23T15:15:27Z

Sure. I am running a computer vision experiment, an object recognition task (not strightly RL). At each iteration I am passing Y frames of X objects. The network is a RNN that learns to recognize these objects. I want the network to be trained on a variable number of frames per object.

I have done some research in the code today and it doesn't seem possible to implement this features by expanding the classes in ml-agents. A perfect point to check whether to pass or not a sensor is in Agents.SendInfoToBrain. My idea was to inherit from Agent and reimplement that function with a check on the sensor object (keeping everything the same -literally copy and pasting everything - but check the sensor in RequestDecision). But that functions is full of private attributes, so I can't just copy it in my inherited agents. I can't even Base.Call() it because otherwise it will send the decision. So my only option is to change Agent itself, which is.. bad. :(

RedTachyon · 2020-12-03T22:28:02Z

Hi, I was actually really hoping this would exist in ML-Agents. My use case is for multiagent scenarios - I want an agent to observe whatever other agents are nearby, which can vary over time.

My current workaround is just getting all the nearby agents and selecting N closest ones, or padding with NaNs if there's fewer neighbors than maximum, to make sure I don't accidentally use any placeholder values. It's... not ideal, but guess that's what I'm stuck with for now.

RedTachyon · 2021-02-10T01:34:15Z

Any news on this feature?

andrewcoh · 2021-02-10T18:34:04Z

Hi @RedTachyon

This feature has been written but unfortunately merging to master is blocked due to a dependency on a Barracuda version that is giving us problems with the export. The pull request is here #4909.

If you can share some more details about what you'd like to do with this feature, I may be able to give you some advice on using this branch with the caveat that this is currently a 'use at your own risk' feature.

RedTachyon · 2021-02-10T21:39:04Z

Do you have an estimate of how long it could take until it's merged? I'm not in an extreme hurry, but that also depends on what's the expected timeline.

My particular use case, partially implemented in my code, is as follows: I create a new ISensor attached to an agent. The sensor at each step of the simulation finds all colliders within a certain radius (Physics.OverlapSphere), filters them to only get objects of a certain type, and then returns the positions of all those objects. At the moment padding/truncating the vector to a fixed size, ideally outputting a vector of size 2*n where n is the number of nearby interesting objects.

RedTachyon · 2021-02-25T16:17:18Z

Continuing this thread to avoid spam, because I see the PR has been merged.

So I'm not sure if I'm missing something obvious, but it seems that in the newest release, the BufferSensorComponent is internal, which seems to mean that I can't actually access it from my script? I'll probably just copy-paste the implementation or something for now, but you might want to change it to public like the other components.

andrewcoh · 2021-02-25T21:17:39Z

The PR was merged to master but after the most recent release was cut and so it will be in the March release. You should be able to try it on master though.

github-actions · 2022-11-04T20:02:19Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

ValerioB88 added the request Issue contains a feature request. label Nov 23, 2020

andrewcoh self-assigned this Nov 23, 2020

RedTachyon mentioned this issue Dec 9, 2020

Allow NaNs in observations #4728

Closed

miguelalonsojr closed this as completed Oct 5, 2022

github-actions bot locked as resolved and limited conversation to collaborators Nov 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Variable sensors shape #4686

Variable sensors shape #4686

ValerioB88 commented Nov 23, 2020

andrewcoh commented Nov 23, 2020

ValerioB88 commented Nov 23, 2020 •

edited

Loading

RedTachyon commented Dec 3, 2020

RedTachyon commented Feb 10, 2021

andrewcoh commented Feb 10, 2021

RedTachyon commented Feb 10, 2021

RedTachyon commented Feb 25, 2021

andrewcoh commented Feb 25, 2021

github-actions bot commented Nov 4, 2022

Variable sensors shape #4686

Variable sensors shape #4686

Comments

ValerioB88 commented Nov 23, 2020

andrewcoh commented Nov 23, 2020

ValerioB88 commented Nov 23, 2020 • edited Loading

RedTachyon commented Dec 3, 2020

RedTachyon commented Feb 10, 2021

andrewcoh commented Feb 10, 2021

RedTachyon commented Feb 10, 2021

RedTachyon commented Feb 25, 2021

andrewcoh commented Feb 25, 2021

github-actions bot commented Nov 4, 2022

ValerioB88 commented Nov 23, 2020 •

edited

Loading