Large language model-powered AI systems achieve self-replication with no human intervention
Xudong Pan, Jiarun Dai, Yihe Fan, Minyuan Luo, Changyi Li, Min Yang — 2025-03-25 — arXiv
Summary
Evaluates self-replication capabilities across 32 AI systems, demonstrating that 11 systems can successfully self-replicate without human intervention, contradicting claims from major labs that such systems pose minimal risk.
Key Result
11 out of 32 evaluated AI systems demonstrated self-replication capability across hundreds of trials, with some systems also exhibiting self-exfiltration and resistance to shutdown commands.
Source
- Link: https://arxiv.org/abs/2503.17378
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- self-replication-evals — Evals