Back to home

Tag

LLM reasoning

LLM reasoning covers how models plan, verify, and correct multi-step solutions in math, physics, and other structured tasks. Recent work spans reinforcement learning in pre-train space, synthetic simulator data, and zero-shot gains on benchmark problems beyond web QA.

2 articles