GitHub - doesburg11/PredPreyGrass: A Predator-Prey-Grass multi-agent gridworld environment implemented with Farama's Gymnasium, PettingZoo and MOMAland. Featuring dynamic spawning and deletion and partial observability of agents.

Predator-Prey-Grass multi-agent reinforcement learning (MARL)

Predator-Prey-Grass gridworld deploying multi-agent environments with dynamic deletion and spawning of partially observant agents, utilizing Farama's PettingZoo.

The environments

predpregrass_base.py: A (single-objective) multi-agent reinforcement learning (MARL) environment, centralized trained and decentralized evaluated using Proximal Policy Optimization (PPO). Learning agents Predators (red) and Prey (blue) both expend energy moving around, and replenish it by eating. Prey eat Grass (green), and Predators eat Prey if they end up on the same grid cell. In the base case, the agents obtain all the energy from the eaten Prey or Grass. Predators die of starvation when their energy is zero, Prey die either of starvation or when being eaten by a Predator. The agents asexually reproduce when energy levels of learning agents rise above a certain treshold by eating. Learning agents learn to execute movement actions based on their partial observations (transparent red and blue squares respectively as depicted above) of the environment to maximize cumulative reward.In the base case, the single objective rewards (stepping, eating, dying and reproducing) are aggregated and can be adjusted in the environment configuration file.

Emergent Behaviors

Training the single objective environment predpregrass_base.py with the PPO algorithm is an example of how elaborate behaviors can emerge from simple rules in agent-based models. In the above displayed MARL example, rewards for learning agents are solely obtained by reproduction. So all other reward options are set to zero in the environment configuration. Despite these relative sparse reward structure, maximizing these rewards results in elaborate emerging behaviors such as:

Predators hunting Prey
Prey finding and eating grass
Predators hovering around grass to catch Prey
Prey trying to escape Predators

Moreover, these learning behaviors lead to more complex emergent dynamics at the ecosystem level. The trained agents are displaying a classic Lotka–Volterra pattern over time:

More emergent behavior and findings are described on our website.

Installation

Editor used: Visual Studio Code 1.93.1 on Linux Mint 21.3 Cinnamon

Clone the repository:

git clone https://github.com/doesburg11/PredPreyGrass.git

Open Visual Studio Code and execute:
- Press ctrl+shift+p
- Type and choose: "Python: Create Environment..."
- Choose environment: Conda
- Choose interpreter: Python 3.11.7
- Open a new terminal
- ```
pip install -e . 
```

Install the following requirements:

```
pip install supersuit==3.9.3 
```
```
pip install tensorboard==2.18.0 
```
```
pip install stable-baselines3[extra] 
```
```
conda install -c conda-forge gcc=12.1.0
```

Getting started

Visualize a random policy

In Visual Studio Code run: predpreygrass/single_objective/eval/evaluate_random_policy.py

Training and visualize trained model using PPO from stable baselines3

Adjust parameters accordingly in:

predpreygrass/single_objective/config/config_predpreygrass.py

In Visual Studio Code run:

predpreygrass/single_objective/train/train_ppo.py

To evaluate and visualize after training follow instructions in:

predpreygrass/single_objective/eval/evaluate_ppo_from_file.py

[UNDER (RE)CONSTRUCTION] Batch training and evaluating in one go:

predpreygrass/single_objective/eval/_parameter_variation_train_ppo_and_evaluate.py

Name		Name	Last commit message	Last commit date
Latest commit History 488 Commits
.github		.github
.vscode		.vscode
assets		assets
predpreygrass		predpreygrass
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
PredPreyGrass.code-workspace		PredPreyGrass.code-workspace
README.md		README.md
requirements.txt		requirements.txt
requirements_lock.txt		requirements_lock.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predator-Prey-Grass multi-agent reinforcement learning (MARL)

The environments

Emergent Behaviors

Installation

Getting started

Visualize a random policy

Training and visualize trained model using PPO from stable baselines3

References

About

Contributors 2

Languages

License

doesburg11/PredPreyGrass

Folders and files

Latest commit

History

Repository files navigation

Predator-Prey-Grass multi-agent reinforcement learning (MARL)

The environments

Emergent Behaviors

Installation

Getting started

Visualize a random policy

Training and visualize trained model using PPO from stable baselines3

References

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Contributors 2

Languages