New website analyzing AI companies’ model evals
Zach Stein-Perlman — 2025-05-26 — LessWrong
Summary
Announces a website analyzing AI companies’ dangerous capability evaluations and provides technical critique of recent model cards from OpenAI, Anthropic, DeepMind, Meta, and xAI, identifying methodological problems in elicitation, interpretation, and threshold-setting.
Source
- Link: https://lesswrong.com/posts/nmaKpoHxmzjT8yXTk/new-website-analyzing-ai-companies-model-evals
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- capability-evals — Evals