Duplicated Input

Title	Action Type	Action Shape	Action Values	Observation Shape	Observation Values	Average Total Reward	Import
Duplicated Input	Discrete	(3,)	[(0, 1),(0,1),(0,base-1)]	(1,)	(0,base)		from gym.envs.algorithmic import duplicated_input

Task is to return every nth (duplication) character from the input tape. This task was originally used in the paper Learning Simple Algorithms from Examples.

The model has to learn:

The agent take a 3-element vector for actions. The action space is (x, w, v), where:

The observation space size is (1,) .

Rewards:

Rewards are issued similar to other Algorithmic Environments. Reward schedule:

gym.make('DuplicatedInput-v0', base=5, duplication=2)

base: Number of distinct characters to read/write.

duplication: Number of similar characters that should be converted to a single character.

Provide feedback