Tag
1 articles
GTI grounds new vocabulary tokens before fine-tuning, aiming to preserve distinctions that mean initialization tends to collapse.