Dodging systematic human errors in scalable oversight
Geoffrey Irving — 2025-05-14 — UK AISI
Source
- Link: https://www.alignmentforum.org/posts/EgRJtwQurNzz8CEfJ/dodging-systematic-human-errors-in-scalable-oversight
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- supervising-ais-improving-ais — Make AI solve it