Here’s 18 Applications of Deception Probes

Cleo Nardo, Avi Parrack, jordine — 2025-08-28 — LessWrong

Summary

Systematically enumerates 18 applications of deception probes for AI safety, analyzing required properties and desiderata for each use case, from monitoring current models to eliciting latent knowledge to augmenting scalable oversight.

Source