Open-sourcing circuit tracing tools
Michael Hanna, Mateusz Piotrowski, Emmanuel Ameisen, Jack Lindsey, Johnny Lin, Curt Tigges — 2025-05-29 — Anthropic, Decode Research — Anthropic Blog
Summary
Announces open-source release of circuit-tracing tools that generate attribution graphs revealing internal computational steps in language models, with library support for popular open-weights models and interactive visualization interface.
Source
- Link: https://anthropic.com/research/open-source-circuit-tracing
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- anthropic — Labs (giant companies)