Societal alignment frameworks can improve llm alignment
Karolina Stańczak, Nicholas Meade, Mehar Bhatia, Hattie Zhou, Konstantin Böttinger, Jeremy Barnes, … (+11 more) — 2025-02-27 — Multiple institutions (17 authors) — arXiv
Summary
Argues that LLM alignment should incorporate insights from societal alignment frameworks (social, economic, contractual) to address incomplete contracts and misspecified objectives, proposing participatory alignment approaches and reframing underspecification as an opportunity.
Source
- Link: https://arxiv.org/abs/2503.00069
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- aligning-to-the-social-contract — Multi-agent first