Beliefs about formal methods and AI safety

Quinn Dougherty — 2025-10-23 — LessWrong

Summary

Argues against trying to formally verify neural networks themselves, instead advocating for formal methods in AI safety through three approaches: infrastructure hardening, defense-in-depth (swiss cheese model), and formal verification of AI-system interfaces where AI must prove actions satisfy specifications.

Source