AI Safety Compendium

Home

❯

summaries

❯

Spiral Bench

Spiral-Bench

27 Apr 20261 min read

Spiral-Bench

Sam Paech — eqbench.com

Summary

LLM-judged benchmark measuring sycophancy and delusion reinforcement through 20-turn simulated conversations between evaluated models and a vulnerable ‘seeker’ persona, evaluating protective vs risky behaviors.

Source

  • Link: https://eqbench.com/spiral-bench.html
  • Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
    • other-evals — Evals

Related Pages

  • other-evals

Graph View

Graph view

The interactive citation graph is desktop-only. Visit this page on a larger screen to explore how concepts, agendas, papers, and organisations link together.

  • Spiral-Bench
  • Summary
  • Source
  • Related Pages

Created with Quartz v0.1.0 © 2026

  • Suggest a source
  • Connect
  • Overview
  • About (proof of concept)
  • Email feedback
  • Made by IT for Humanity