Open Technical Problems in Open-Weight AI Model Risk Management
Stephen Casper, Kyle O’Brien, Shayne Longpre, Elizabeth Seger, Kevin Klyman, Rishi Bommasani, … (+16 more) — 2025-10-26 — Massachusetts Institute of Technology, ERA Fellowship, Apple, Centre for the Governance of AI, Stanford University, Google DeepMind, Vector Institute for Artificial Intelligence, FAR.AI, Hugging Face, Center for AI Safety, Princeton University, Carnegie Mellon University, UK AI Security Institute, University of Oxford, University of Montreal — SSRN
Summary
Identifies 16 open technical challenges for open-weight AI model safety across training data, training algorithms, evaluations, deployment, and ecosystem monitoring, addressing the unique risk management challenges posed by openly available model weights.
Source
- Link: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5705186
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- harm-reduction-for-open-weights — Black-box safety (understand and control current model behaviour) / Goal robustness