Multi-Agent Risks from Advanced AI
Lewis Hammond, Alan Chan, Jesse Clifton, Jason Hoelscher-Obermaier, Akbir Khan, Euan McLean, … (+38 more) — 2025-02-19 — Cooperative AI Foundation — arXiv
Summary
Provides a comprehensive taxonomy of risks from multi-agent AI systems, identifying three key failure modes (miscoordination, conflict, collusion) and seven risk factors (information asymmetries, network effects, selection pressures, destabilizing dynamics, commitment problems, emergent agency, multi-agent security), with mitigation directions for each.
Source
- Link: https://arxiv.org/abs/2502.14143
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- theory-for-aligning-multiple-ais — Multi-agent first