The “Neglected Approaches” Approach — SR2025 Agenda Snapshot

One-sentence summary: Agenda-agnostic approaches to identifying good but overlooked empirical alignment ideas, working with theorists who could use engineers, and prototyping them.

Theory of Change

Empirical search for “negative alignment taxes” (prioritizing methods that simultaneously enhance alignment and capabilities)

Broad Approach

engineering

Target Case

average

Orthodox Problems Addressed

Someone else will deploy unsafe superintelligence first

Key People

AE Studio, Gunnar Zarncke, Cameron Berg, Michael Vaiana, Judd Rosenblatt, Diogo Schwerz de Lucena

Funding

AE Studio

Estimated FTEs: 15

Critiques

The ‘Alignment Bonus’ is a Dangerous Mirage

See Also

Iterative alignment, automated alignment research, Beijing Key Laboratory of Safe AI and Superalignment, Aligned AI

Outputs in 2025

3 item(s) in the review. See the wiki/summaries/ entries with frontmatter agenda: the-neglected-approaches-approach (these were generated alongside this file from the same export).

Source

Sources cited

Primary URLs harvested from this page’s summary references. Auto-generated by scripts/backfill_citations.py; edit by re-running, not by hand.