Open Technical Problems in Open-Weight AI Model Risk Management

Stephen Casper, Kyle O’Brien, Shayne Longpre, Elizabeth Seger, Kevin Klyman, Rishi Bommasani, … (+16 more) — 2025-10-26 — Massachusetts Institute of Technology, ERA Fellowship, Apple, Centre for the Governance of AI, Stanford University, Google DeepMind, Vector Institute for Artificial Intelligence, FAR.AI, Hugging Face, Center for AI Safety, Princeton University, Carnegie Mellon University, UK AI Security Institute, University of Oxford, University of Montreal — SSRN

Summary

Identifies 16 open technical challenges for open-weight AI model safety across training data, training algorithms, evaluations, deployment, and ecosystem monitoring, addressing the unique risk management challenges posed by openly available model weights.

Source

Link: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5705186
Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- harm-reduction-for-open-weights — Black-box safety (understand and control current model behaviour) / Goal robustness

harm-reduction-for-open-weights

AI Safety Compendium

Explorer

Open Technical Problems in Open-Weight AI Model Risk Management

Open Technical Problems in Open-Weight AI Model Risk Management

Summary

Source

Graph View

Graph view

Table of Contents

AI Safety Compendium

Explorer

Open Technical Problems in Open-Weight AI Model Risk Management

Open Technical Problems in Open-Weight AI Model Risk Management

Summary

Source

Related Pages

Graph View

Graph view

Table of Contents