Tag
Model Serving
3 articles

Industry News/May 9
Why routing is the real bottleneck in model serving
Routing, not model execution, is the main constraint in modern model serving.

Industry News/May 6
Why a Single Routing API Wins Model Serving
A single routing API is the right default for model serving platforms.

Industry News/May 4
Why routing belongs at the center of model serving
Routing should be the single entry point for model serving because it speeds iteration and unlocks new ML products.