Spiral-Bench

Sam Paech — eqbench.com

Summary

LLM-judged benchmark measuring sycophancy and delusion reinforcement through 20-turn simulated conversations between evaluated models and a vulnerable ‘seeker’ persona, evaluating protective vs risky behaviors.

Source

Link: https://eqbench.com/spiral-bench.html
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- other-evals — Evals

other-evals

AI Safety Compendium

Explorer

Spiral-Bench

Spiral-Bench

Summary

Source

Graph View

Graph view

Table of Contents

AI Safety Compendium

Explorer

Spiral-Bench

Spiral-Bench

Summary

Source

Related Pages

Graph View

Graph view

Table of Contents