2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, … (+5 more) — 2025-06-20 — MIT, Stanford, University of Washington, DeepMind, ANU — arXiv

Summary

Proposes Resource-Rational Contractualism (RRC), a framework for AI alignment that approximates agreements diverse stakeholders would endorse by using cognitively-inspired heuristics that trade computational effort for accuracy when navigating value conflicts.

Source

Link: https://arxiv.org/abs/2506.17434
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- aligning-to-the-social-contract — Multi-agent first
Editorial blurb (verbatim): [2506.17434 \- Resource Rational Contractualism Should Guide AI Alignment](https://arxiv.org/abs/2506.17434)

aligning-to-the-social-contract

AI Safety Compendium

Explorer

2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

Summary

Source

Graph View

Graph view

Table of Contents

AI Safety Compendium

Explorer

2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

2506.17434 - Resource Rational Contractualism Should Guide AI Alignment

Summary

Source

Related Pages

Graph View

Graph view

Table of Contents