Automatically Jailbreaking Frontier Language Models with Investigator Agents
Source
- Link: https://transluce.org/jailbreaking-frontier-models
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- ai-explanations-of-ais — Make AI solve it