Understanding and Controlling LLM Generalization
Daniel Tan — 2025-11-14
Source
- Link: https://www.lesswrong.com/posts/ZSQaT2yxNNZ3eLxRd/understanding-and-controlling-llm-generalization
- Listed in the Shallow Review of Technical AI Safety 2025 under 1 agenda(s):
- learning-dynamics-and-developmental-interpretability — White-box safety (i.e. Interpretability)