Why Future AIs will Require New Alignment Methods

Alvin Ånestrand — 2025-10-10 — LessWrong

Summary

Introduces the concept of ‘alignment depth’ tied to task completion time, arguing that current alignment methods (HHH, deliberative alignment) that work for short tasks will be insufficient for AGI capable of completing longer tasks requiring different behavioral consistencies.

Source