A Three-Layer Model of LLM Psychology
Jan_Kulveit — 2024-12-26
Source
- Link: https://www.alignmentforum.org/posts/zuXo9imNKYspu9HGv/a-three-layer-model-of-llm-psychology
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- character-training-and-persona-steering — Black-box safety (understand and control current model behaviour) / Model psychology