Bridging the human–AI knowledge gap through concept discovery and transfer in AlphaZero
Source
- Link: https://www.pnas.org/doi/10.1073/pnas.2406675122
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- reverse-engineering — White-box safety (i.e. Interpretability)