The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
Jingyu Zhang, Haozhu Wang, Eric Michael Smith, Sid Wang, Amr Sharaf, Mahesh Pasupuleti, … (+4 more) — 2025-10-09 — Meta, Johns Hopkins University
Source
- Link: https://arxiv.org/pdf/2510.08240
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- meta — Labs (giant companies)