Large Reasoning Models Learn Better Alignment from Flawed Thinking
ShengYun Peng, Eric Smith, Ivan Evtimov, Song Jiang, Pin-Yu Chen, Hongyuan Zhan, … (+4 more) — 2025-10-01
Source
- Link: https://arxiv.org/pdf/2510.00938%20
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- meta — Labs (giant companies)