Detect Goodhart and shut down
Jeremy Gillen — 2025-01-22
Source
- Link: https://www.lesswrong.com/posts/ZHFZ6tivEjznkEoby/detect-goodhart-and-shut-down
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- other-corrigibility — Theory / Corrigibility