Back to home

Tag

generalization

Generalization is the ability of a model to keep working on unseen data, shifted distributions, or longer reasoning paths. Here it connects training stability, Hessian-spectrum sharpness, and LLM failures on new maps or longer sequence lengths.

1 articles