AI Safety Compendium

Home

❯

summaries

❯

Automatically Jailbreaking Frontier Language Models with Investigator Agents

27 Apr 20261 min read

Automatically Jailbreaking Frontier Language Models with Investigator Agents

Source

Link: https://transluce.org/jailbreaking-frontier-models
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- ai-explanations-of-ais — Make AI solve it

Related Pages

ai-explanations-of-ais

Graph View

Graph view

The interactive citation graph is desktop-only. Visit this page on a larger screen to explore how concepts, agendas, papers, and organisations link together.

Automatically Jailbreaking Frontier Language Models with Investigator Agents
Source
Related Pages

Suggest a source
Connect
Overview
About (proof of concept)
Email feedback
Made by IT for Humanity

AI Safety Compendium

Explorer

Automatically Jailbreaking Frontier Language Models with Investigator Agents

Automatically Jailbreaking Frontier Language Models with Investigator Agents

Source

Graph View

Graph view

Table of Contents