Adding RLlib compatible Multi-agent environment #895

elpollouk · 2020-09-16T13:34:48Z

Added a new TurnBasedRllibMultiAgentEnv environment implementation that is compatible with RLlib's multi-agent environment. This environment wraps multiple MalmoEnv environment instances and sends steps to them in turn based sequence to work with multiple Minecraft agents operating in a single gameplay session.

In addition to the TurnBasedRllibMultiAgentEnv environment, I've also added SyncRllibMultiAgentEnv to sync Malmo's actions with their resultant observations. This is managed by sending an idle step request to each Minecraft instance after each real step request to query the resultant state of the environment.

Finally, I've included a new launcher.py script that can be used either directly or by importing into training scripts to launch multiple Minecraft instances. The script clones the Malmo directory into a new temporary directory and launches Minecraft from the new copy. This avoids issues with running multiple instances of Minecraft from the same directory causing conflicts as they try to update the same files.

…k/malmo into elpollouk/MultiAgentEnv

martinballa · 2020-10-02T09:36:47Z

Is it possible to add a new bash file for running the Malmo instances headless?
It should be in 'Minecraft/launchClient_headless.sh' and it should contain the following 2 lines:

#! /bin/bash
xvfb-run -a -e /dev/stdout -s '-screen 0 640x480x16' ./launchClient.sh -port $1 -env > ../out.txt 2>&1

martinballa · 2020-12-05T10:58:10Z

I have been working with Adrian's updates for a while and it is much easier to run Malmo, especially on clusters. Unfortunately, when Malmo crashes the launcher keeps hanging and only says "Waiting for N instances..." instead of throwing an exception with an error message. The problem is that each Malmo instance runs on their own process and do not send their output back to the main thread.

martinballa · 2020-12-05T11:20:06Z

Another minor issue I had with the launcher is when my run crashes the Malmo instances keep running and I have to manually kill the java processes. Or in some cases it would be great to keep the instances as it takes a few minutes per instance to do a full start-up. If somebody would continue working on this I think these features would be great additions to Malmo. I wanted to make them as issues, but this PR has not been approved yet.

Adrian O'Grady added 13 commits September 8, 2020 14:11

Added Minecraft launcher

dd617fa

Added initial RLlib traiing script

e8d3c84

Fixed info to be returned as a dict

a7719a1

Added RLlib compatible multi-agent env

bb8643d

Moved the SyncEnv into core.py

8fdb577

Merge branch 'elpollouk/MultiAgentEnv' of https://github.com/elpollou…

cfbb10d

…k/malmo into elpollouk/MultiAgentEnv

Added multi-agent training

c09a386

Stable RLlib training

aab8397

Switching back to turn based actions

f3d5174

Synchronised observations for turn based experiments

8871515

Removed debug info

590af3d

Removed some redundant changes

d8ccfdb

Added automatic generation of version.properties

25b7cdc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding RLlib compatible Multi-agent environment #895

Adding RLlib compatible Multi-agent environment #895

elpollouk commented Sep 16, 2020

martinballa commented Oct 2, 2020

martinballa commented Dec 5, 2020

martinballa commented Dec 5, 2020

Adding RLlib compatible Multi-agent environment #895

Are you sure you want to change the base?

Adding RLlib compatible Multi-agent environment #895

Conversation

elpollouk commented Sep 16, 2020

martinballa commented Oct 2, 2020

martinballa commented Dec 5, 2020

martinballa commented Dec 5, 2020