Frames are returned incorrectly after an absolute action (e.g. teleport) #250

salaniz · 2016-08-02T16:03:18Z

I am trying to get robust frame-action pairs as mentioned in issue #231 .
Currently, I have a workaround solution by waiting for the next new observation and then accepting the first frame that arrives thereafter. See my attached Lua file as a reference.

This method seems to work for discrete actions, but not for absolute actions like teleport.
After an absolute action, I receive a frame of the agent in the previous state (e.g. prior to teleportation) instead of the new state.

If I skip the first frame after a new observation and wait for the second observation-frame pair after the action then it works for both discrete and absolute actions. However, it would be nice if it's consistent and preferably the first frame after a new observation that corresponds to the new state.

I can reproduce the problem with the attached files written in Lua. The code uses the Lua modules torch and image as well as qtlua to display the frame after each action. Run it with qlua frame_action_pairs.lua. Alternatively, it can also be run without qtlua by saving images instead of displaying them (see lines 96 and 150).
Use the boolean variable skip_first_frame to either accept the first observation-frame pair or the second.

I am using the latest release: 0.16.0

frame_action_pairs.zip

The text was updated successfully, but these errors were encountered:

timhutton · 2016-08-16T09:34:36Z

@salaniz Thanks for looking into this. Sorry it's causing problems.

When you send a command to Minecraft, it gets added to a queue. Then, on every world tick (usually every 50ms unless overclocking) the pending commands are acted on. On a separate thread the rendering is happening as quickly as possible (up to a limit of 60fps usually). Together with random delays in the network messages this makes it hard to robustly get a frame containing the outcome of the command.

I've just been looking at the tabular_q_learning.py sample to see what can be done here. With the approach there it seems robust for discrete movement. I'll have a look at the teleporting movement now.

In the meanwhile, take a look at ObservationFromRecentCommands. This gets returned after a command has been acted on, and so hopefully by taking the next frame after that it should be robust. Let me know if this helps. I'll try it too and will make sure we include a sample for this when we have a solution.

salaniz · 2016-08-16T12:06:41Z

@timhutton Thanks for the advice.

My current solution to the problem is based on how it is done in tabular_q_learning.py: I wait for a new observation to arrive and check if my current state has changed (x, y, z, pitch or yaw). If so, I accept the next frame that comes thereafter and if not, I discard the observation and wait for the next.

So far this methods seems to be robust. Both ObservationFromRecentCommands and the positional information in the frame (from #259) should provide enough additional means to associate frame-action pairs.

However, it remains that absolute actions are handled a little differently than the rest as I have noticed in #255, too.

timhutton · 2016-08-16T14:05:11Z

@salaniz Yes, there's a difference between how the absolute movement commands (tp, etc.) work and how the discrete movement commands (movenorth, etc.) work - the latter are directly applied on the Minecraft client and thus act immediately, while the former must be sent to the server to be acted upon. This introduces extra delay, which I think is part of the problem.

The same thing applies to the discrete use and attack commands, which also need to be sent to the server.

We're actively looking at how to robustly get frame-action pairs with these server-side actions.

This was referenced Aug 4, 2016

Add camera position data to returned frames #257

Closed

Frame pos #259

Merged

timhutton modified the milestone: Dolphin Aug 15, 2016

timhutton modified the milestones: Dolphin, Elk Aug 18, 2016

timhutton mentioned this issue Aug 19, 2016

Added sample showing robust frame-action pairs #292

Merged

timhutton closed this as completed in #292 Aug 23, 2016

Phantomb mentioned this issue Dec 4, 2017

Request: Indication finished discrete command #641

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frames are returned incorrectly after an absolute action (e.g. teleport) #250

Frames are returned incorrectly after an absolute action (e.g. teleport) #250

salaniz commented Aug 2, 2016

timhutton commented Aug 16, 2016 •

edited

Loading

salaniz commented Aug 16, 2016

timhutton commented Aug 16, 2016

Frames are returned incorrectly after an absolute action (e.g. teleport) #250

Frames are returned incorrectly after an absolute action (e.g. teleport) #250

Comments

salaniz commented Aug 2, 2016

timhutton commented Aug 16, 2016 • edited Loading

salaniz commented Aug 16, 2016

timhutton commented Aug 16, 2016

timhutton commented Aug 16, 2016 •

edited

Loading