Tag
token efficiency
3 articles

Model Releases/May 14
Why Xiaomi’s MiMo-V2.5-Pro Changes Coding Agents More Than Chatbots
MiMo-V2.5-Pro matters because it is built for long, tool-heavy coding work, not chat.

Research/May 5
Why Latent Agents Proves Multi-Agent Debate Should Be Internalized
Latent Agents shows multi-agent debate works best when a single model internalizes it.

Research/Apr 29
Recursive Multi-Agent Systems Could Cut Token Use
RecursiveMAS treats a whole multi-agent setup as one recursive latent computation, reporting 8.3% accuracy gains and big token savings.