Back to home

Tag

reasoning models

Reasoning models are built to handle multi-step inference, verification, and agentic tasks such as math, coding, and interactive problem solving. This tag covers training methods, cold-start behavior, RLVR, loss design, and the cost-performance tradeoffs that shape deployment.

2 articles