2506.17434 - Resource Rational Contractualism Should Guide AI Alignment
Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, … (+5 more) — 2025-06-20 — MIT, Stanford, University of Washington, DeepMind, ANU — arXiv
Summary
Proposes Resource-Rational Contractualism (RRC), a framework for AI alignment that approximates agreements diverse stakeholders would endorse by using cognitively-inspired heuristics that trade computational effort for accuracy when navigating value conflicts.
Source
- Link: https://arxiv.org/abs/2506.17434
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- aligning-to-the-social-contract — Multi-agent first
- Editorial blurb (verbatim):
[2506.17434 \- Resource Rational Contractualism Should Guide AI Alignment](https://arxiv.org/abs/2506.17434)