AI Safety Compendium

Home

❯

summaries

❯

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI

27 Apr 20261 min read

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI

Sharan Maiya, Henning Bartsch, Nathan Lambert, Evan Hubinger — 2025-11-03 — Anthropic

Source

Link: https://arxiv.org/pdf/2511.01689%20
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- character-training-and-persona-steering — Black-box safety (understand and control current model behaviour) / Model psychology

Related Pages

character-training-and-persona-steering

Graph View

Graph view

The interactive citation graph is desktop-only. Visit this page on a larger screen to explore how concepts, agendas, papers, and organisations link together.

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI
Source
Related Pages

Suggest a source
Connect
Overview
About (proof of concept)
Email feedback
Made by IT for Humanity

AI Safety Compendium

Explorer

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI

Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI

Source

Graph View

Graph view

Table of Contents