Tag
generalization
Generalization is the ability of a model to keep working on unseen data, shifted distributions, or longer reasoning paths. Here it connects training stability, Hessian-spectrum sharpness, and LLM failures on new maps or longer sequence lengths.
1 articles
