SGLang

Name: SGLang
Availability: InStock

Structured generation language for fast serving: RadixAttention, constrained decoding, and multi-turn batching for frontier-class workloads.

Why it is included

Active research-to-production path competing with vLLM on latency and structured output.

Labs pushing structured LLM programs and high-QPS chat on GPUs.

vLLM · TensorRT-LLM