AI Safety Atlas Ch.1 — Introduction
Source: Capabilities — Introduction | ai-safety-atlas.com/chapters/v1/capabilities/introduction/
The introduction to Chapter 1 of the AI Safety Atlas establishes the foundational context for understanding AI capabilities and frames the chapter’s three core questions.
The Three Inquiries
The chapter is structured around three primary questions that together determine which AI safety strategies are viable:
-
Defining General Intelligence — Rather than treating AGI as a binary category, the authors propose measuring “capability and generality as continuous axes.” This produces precise statements like “our AI system outperforms 85% of humans in 30% of cognitive domains.” See atlas-ch1-capabilities-01-defining-and-measuring-agi.
-
Explaining Recent Progress — Attributes rapid advancement to the “bitter-lesson” principle: computational scale consistently outperforms human-engineered solutions. scaling-laws demonstrate predictable improvement with increased compute and data. The chapter debates whether scale alone suffices for transformative-ai. See atlas-ch1-capabilities-03-leveraging-scale.
-
Forecasting Future Development — Timeline predictions for cognitive labor automation determine which safety strategies are viable. Various “takeoff” scenarios — gradual vs. explosive — inform preparedness approaches. See atlas-ch1-capabilities-05-forecasting-timelines and atlas-ch1-capabilities-06-takeoff.
Method and Stance
The chapter acknowledges that specific benchmarks become outdated quickly, but underlying patterns around scaling laws and capability emergence remain stable and learnable. Content progresses from current capabilities through foundation-models to forecasting methodologies — capability surveys feed into mechanism understanding, which feeds into projection.
Why This Chapter Comes First
In the textbook’s overall logic, capabilities precedes risks (Ch.2) because what AI can do bounds what risk it poses. You cannot reason about misuse risk, misalignment risk, or systemic risk without first establishing what current and near-future systems are actually capable of.