Running
2
vLLM Semantic Router
ðŸ§
Classify and detect PII in text
A Mixture-of-Models(MoM) Router that understands the request intent.
One fabric. Many minds. We're introducing MoM (Mixture of Models)—a family of specialized routing models that power vLLM-SR's intelligent decision-making.
vLLM-SR solves a critical problem: how to route LLM requests to the right model at the right time. Not every query needs the same resources—"What's the weather?" shouldn't cost as much as "Analyze this legal contract."