Tag
1 articles
This paper shows LLM fine-tuning with evolution strategies can drift, and anchored weight decay can curb it.