Self-preservation or Instruction Ambiguity? Examining the Causes of Shutdown Resistance
Senthooran Rajamanoharan, Neel Nanda — 2025-07-14 — Google DeepMind
Source
- Link: https://www.alignmentforum.org/posts/wnzkjSmrgWZaBa2aC/self-preservation-or-instruction-ambiguity-examining-the
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- google-deepmind — Labs (giant companies)