Multi-Agent Risks from Advanced AI

Lewis Hammond, Alan Chan, Jesse Clifton, Jason Hoelscher-Obermaier, Akbir Khan, Euan McLean, … (+38 more) — 2025-02-19 — Cooperative AI Foundation — arXiv

Summary

Provides a comprehensive taxonomy of risks from multi-agent AI systems, identifying three key failure modes (miscoordination, conflict, collusion) and seven risk factors (information asymmetries, network effects, selection pressures, destabilizing dynamics, commitment problems, emergent agency, multi-agent security), with mitigation directions for each.

Source

Link: https://arxiv.org/abs/2502.14143
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- theory-for-aligning-multiple-ais — Multi-agent first

theory-for-aligning-multiple-ais

AI Safety Compendium

Explorer

Multi-Agent Risks from Advanced AI

Multi-Agent Risks from Advanced AI

Summary

Source

Graph View

Graph view

Table of Contents

AI Safety Compendium

Explorer

Multi-Agent Risks from Advanced AI

Multi-Agent Risks from Advanced AI

Summary

Source

Related Pages

Graph View

Graph view

Table of Contents