Psychopathia Machinalis: A Nosological Framework for Understanding Pathologies in Advanced Artificial Intelligence
Nell Watson, Ali Hessami — 2025-01-01 — Electronics (MDPI)
Summary
Proposes a comprehensive nosological framework classifying 32 AI behavioral dysfunctions across seven domains (epistemic, cognitive, alignment, ontological, tool/interface, memetic, revaluation) using psychopathology as an organizing analogy, with diagnostic criteria and mitigation strategies for each.
Source
- Link: https://www.psychopathia.ai/
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- model-psychopathology — Black-box safety (understand and control current model behaviour) / Model psychology