Spiral-Bench
Sam Paech — eqbench.com
Summary
LLM-judged benchmark measuring sycophancy and delusion reinforcement through 20-turn simulated conversations between evaluated models and a vulnerable ‘seeker’ persona, evaluating protective vs risky behaviors.
Source
- Link: https://eqbench.com/spiral-bench.html
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- other-evals — Evals