Tag
1 articles
Parallel-SFT uses equivalent code across languages to improve zero-shot transfer in code RL, especially when moving to lower-resource programming languages.