Tag
routing
3 articles

Industry News/May 9
Why routing is the real bottleneck in model serving
Routing, not model execution, is the main constraint in modern model serving.

Industry News/May 4
Why routing belongs at the center of model serving
Routing should be the single entry point for model serving because it speeds iteration and unlocks new ML products.

Research/Apr 10
Why multimodal MoE models get distracted
A study of multimodal MoE models finds visual inputs can derail routing to reasoning experts, and a routing-guided fix improves results.