Observation shape in the custom marl environment #823

WangJuan6 · 2024-07-31T01:51:54Z

Dear Author,

I want to use the mappo algorithm in the custom multi-agent environment.
My observation consists of two parts: agent_state and global_state. Specifically:
The dimensions of agent_state are given by H1 × W1 × C1.
The dimensions of global_state are given by H2 × W2 × C2.

I noticed that the parameters of agent_obs_shape and global_obs_shape in MAVAC can be either integers or sequences:

DI-engine/ding/model/template/mavac.py

Lines 43 to 45 in 7f95159

- agent_obs_shape (:obj:`Union[int, SequenceType]`): Observation's space for single agent, \

such as 8 or [4, 84, 84].

- global_obs_shape (:obj:`Union[int, SequenceType]`): Global observation's space, such as 8 or [4, 84, 84].

So, I set the observation shape in the config file:

model=dict(
            action_space='continuous',
            agent_num=n_agent,
            agent_obs_shape=[100, 100, 6],
            global_obs_shape=[100, 100, 6],
            action_shape=2,
        ),

However, the error:

Traceback (most recent call last):
  File "/data/projects/20240708/DI-engine/custom_procthor_env_hierarchy_top_view_multi_agent/procthor_env_ppo_subprocess_hierarchy.py", line 215, in <module>
    main()
  File "/data/projects/20240708/DI-engine/custom_procthor_env_hierarchy_top_view_multi_agent/procthor_env_ppo_subprocess_hierarchy.py", line 202, in main
    model = MAVAC(**cfg.policy.model)
  File "/data/projects/20240708/DI-engine/ding/model/template/mavac.py", line 84, in __init__
    nn.Linear(global_obs_shape, critic_head_hidden_size), activation,
  File "/home/wj/anaconda3/envs/di/lib/python3.9/site-packages/torch/nn/modules/linear.py", line 96, in __init__
    self.weight = Parameter(torch.empty((out_features, in_features), **factory_kwargs))
TypeError: empty(): argument 'size' must be tuple of SymInts, but found element of type tuple at pos 2

occurred in:

DI-engine/ding/model/template/mavac.py

Lines 83 to 88 in 7f95159

self.critic_head = nn.Sequential(

nn.Linear(global_obs_shape, critic_head_hidden_size), activation,

RegressionHead(

critic_head_hidden_size, 1, critic_head_layer_num, activation=activation, norm_type=norm_type

)

)

when I use the observation shape mentioned above.

How can I utilize custom networks, or is there an alternative method for handling sequence-type inputs other than flattening the array to 1D?

Thank you for your assistance!

Best regards,
Juan

The text was updated successfully, but these errors were encountered:

PaParaZz1 · 2024-08-04T09:26:12Z

You can custom your own networks, and pass it into the input argument of DI-engine's policy. Here is the concrete implementation of this argument.

WangJuan6 · 2024-08-05T03:20:01Z

Hi, @PaParaZz1
Thank you for your reply.
Is there a plan to set encoder parameters in MAVAC that are similar to how it’s done in VAC?

DI-engine/ding/model/template/vac.py

Line 43 in 7f95159

encoder: Optional[torch.nn.Module] = None,

I think this would be convenient for network customization.
Thanks,
Juan

PaParaZz1 · 2024-08-05T13:41:29Z

We will add this argument like that in VAC in a week.

WangJuan6 · 2024-08-06T00:36:17Z

Thanks, that’s good news! I’m looking forward to the update.

PaParaZz1 · 2024-08-14T13:44:23Z

We have added this feature in the above commit.

PaParaZz1 added discussion Discussion of a typical issue env Questions about RL environment labels Jul 31, 2024

PaParaZz1 added a commit that referenced this issue Aug 14, 2024

feature(nyz): add encoder in MAVAC (#823)

ae3ddc6

PaParaZz1 mentioned this issue Aug 14, 2024

Roadmap for DI-engine #548

Open

PaParaZz1 closed this as completed Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observation shape in the custom marl environment #823

Observation shape in the custom marl environment #823

WangJuan6 commented Jul 31, 2024

PaParaZz1 commented Aug 4, 2024

WangJuan6 commented Aug 5, 2024

PaParaZz1 commented Aug 5, 2024

WangJuan6 commented Aug 6, 2024

PaParaZz1 commented Aug 14, 2024

Observation shape in the custom marl environment #823

Observation shape in the custom marl environment #823

Comments

WangJuan6 commented Jul 31, 2024

PaParaZz1 commented Aug 4, 2024

WangJuan6 commented Aug 5, 2024

PaParaZz1 commented Aug 5, 2024

WangJuan6 commented Aug 6, 2024

PaParaZz1 commented Aug 14, 2024