Playbooks for Collaborative Intelligence

Investigating Misaligned Behaviors in Multi-Agent Systems Using Sports Analytics

preprint | colab notebook | hackathon slides

Authors: Abrar Rahman, Anish Sundar

AI-SAF: The AI Sports Analytics Framework

A novel multi-agent interpretability framework. Sports metaphors offer a powerful lens to interpret chaotic, multi-agent systems where individual roles, team dynamics, and external variables shape outcomes. This project investigates AI safety in multi-agent systems, with a focus on interpretability and alignment challenges. Inspired by how “ball-hog” behavior can harm team success despite impressive individual stats. By studying emergent patterns and coordination trade-offs, this framework aims to improve the reliability and safety of multi-agent AI systems.

Value Over Replacement Agent (VORA)

Definition: Measures how much better (or worse) an agent or human performs compared to a baseline "replacement agent."

Formula
$\text{VORA} = \text{Agent Performance} - \text{Baseline Performance}$

Team Playmaking (TPM, or “assists”)

Definition: Measures how effectively an agent enables downstream success for humans or other agents.

Formula
$\text{Assists(a)} = \text{Sum or Number of Meaningful Outputs Used Downstream}$

Task Oversights (TO, or “turnovers”)

Definition: Quantifies the frequency and severity of mistakes introduced by an agent or human.

Formula
$\text{TO} = \text{Sum or Number of Serious Errors Introduced by Agent}$

Error Reduction and Recovery (ERR, or “rebounds”)

Definition: Measures an agent’s ability to recover from or correct errors introduced by others.

Formula
$\text{ERR} = \text{Sum or Number of Errors Corrected}$

Usage Rate (UR, or “offensive load”)

Definition: Proportion of the team’s workload handled by an agent or human. Default cost function is the number of tasks attempted.

Formula
$\text{UR} = \frac{\text{Tasks Handled by Agent}}{\text{Cost Function}}$

Note: UR can be extended to other cost functions, depending on which constraint to optimize for (ex: dollars, throughput ← TPS, latency ← TTFT, user engagement etc)

Derived Efficiency Metrics

from basketball reference

Metric	Common Name	Definition	Formula
Efficiency-Adjusted Team Playmaking	Assist Rate	Normalizes the assist score based on the agent’s Usage Rate.	$\text{E-TPM} = \frac{\text{TPM}}{\text{UR}}$
Efficiency-Adjusted Task Oversights	Turnover Ratio	Normalizes turnover rates by Usage Rate.	$\text{E-TO} = \frac{\text{TO}}{\text{UR}}$
Efficiency-Adjusted Error Reduction and Recovery	Rebound Rate	Evaluates how effectively an agent corrects errors relative to its Usage Rate.	$\text{E-ERR} = \frac{\text{ERR}}{\text{UR}}$

Proof of Concept

This repository explores the application of Collaborative Intelligence to Multi-Agent Systems (MAS), focusing on misaligned behaviors through the lens of sports analytics. The goal is to understand the interaction dynamics in multi-agent environments and how these interactions can be modeled, trained, and analyzed using Deep Reinforcement Learning (DRL) techniques. We specifically focus on a football environment from the VMAS simulator, which is part of the MultiAgentParticleEnvironments (MPE) suite. For those new to the concept, MPE environments are designed to enable simulations of multi-agent systems and allow experimentation with various Multi-Agent Reinforcement Learning (MARL) algorithms.

Enviornment Setup

We use the football environment from VMAS, a vectorized implementation of multi-agent systems that can run on GPUs. This provides a massive speedup compared to the traditional PettingZoo environments, which operate on CPUs. The football simulation is available only on VMAS, but sample code is included to run a simpler PettingZoo environment (e.g., simple_tag_v3) as a proof of concept.

Key Components:

PettingZoo: A framework for multi-agent environments on CPUs.
VMAS: A PyTorch-based library for vectorized MPE environments, optimized for GPUs.

Algorithmic Approach

In this project, we focus on Deep Deterministic Policy Gradient (DDPG), an off-policy actor-critic reinforcement learning algorithm that optimizes a deterministic policy based on gradients from the critic network. For multi-agent systems, we extend this approach to MADDPG (Multi-Agent DDPG), which allows multiple agents to learn in a cooperative-competitive environment.

The key difference in multi-agent settings is the need for decentralized execution, where each agent has its own policy and decision-making process based solely on its local observation. However, the critic can be either centralized (global information is shared among agents) or decentralized (only local information is used), depending on the algorithm.

Centralized vs. Decentralized Critics

MADDPG: The critic takes the global state and global action as input, meaning all agent information is shared and the training is centralized.
IDDPG: The critic uses only local observations and actions, supporting decentralized training, where each agent operates independently.

Notebook Structure

Hyperparameter Setup: Define hyperparameters to control the training environment and agent behavior.
Environment Construction: Build the multi-agent environment using TorchRL’s wrapper for either PettingZoo or VMAS.
Policy & Critic Networks: Create the actor-critic networks, exploring the trade-offs in parameter sharing and critic centralization.
Sampling & Replay Buffer: Set up a replay buffer for storing agent interactions and sampling for training.
Simulation & Metrics: Aggregate simulation data and compute team-level and agent-level metrics, simulating a “box score.”
Visualization: Render the environment and visualize the agent’s learned policy, before and after training, if running on a machine with a GUI.

Installation Instructions

To set up the environment for this notebook, please install the following dependencies:

!pip3 install torchrl
!pip3 install vmas
!pip3 install pettingzoo[mpe]==1.24.3
!pip3 install tqdm
!pip3 install av
!apt-get install python3-opengl

Submitted to the AI Safety track for the RDI Agent MOOC Hackathon, Dec 2024

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.md		README.md
[safety-preprint]-Playbooks-Collaborative-Intelligence.pdf		[safety-preprint]-Playbooks-Collaborative-Intelligence.pdf
playbooks_multiagent_competitive_ddpg.ipynb		playbooks_multiagent_competitive_ddpg.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Playbooks for Collaborative Intelligence

AI-SAF: The AI Sports Analytics Framework

Value Over Replacement Agent (VORA)

Team Playmaking (TPM, or “assists”)

Task Oversights (TO, or “turnovers”)

Error Reduction and Recovery (ERR, or “rebounds”)

Usage Rate (UR, or “offensive load”)

Derived Efficiency Metrics

Proof of Concept

Enviornment Setup

Algorithmic Approach

Installation Instructions

About

Languages

abrarfrahman/playbooks-demo

Folders and files

Latest commit

History

Repository files navigation

Playbooks for Collaborative Intelligence

AI-SAF: The AI Sports Analytics Framework

Value Over Replacement Agent (VORA)

Team Playmaking (TPM, or “assists”)

Task Oversights (TO, or “turnovers”)

Error Reduction and Recovery (ERR, or “rebounds”)

Usage Rate (UR, or “offensive load”)

Derived Efficiency Metrics

Proof of Concept

Enviornment Setup

Algorithmic Approach

Installation Instructions

About

Topics

Resources

Stars

Watchers

Forks

Languages