2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, … (+5 more) — 2025-06-20 — MIT, Stanford, University of Washington, DeepMind, ANU — arXiv

Summary

Proposes Resource-Rational Contractualism (RRC), a framework for AI alignment that approximates agreements diverse stakeholders would endorse by using cognitively-inspired heuristics that trade computational effort for accuracy when navigating value conflicts.

Source