TensorRT-LLM

Name: TensorRT-LLM
Availability: InStock

NVIDIA TensorRT–based library for optimized LLM inference on GPUs with multi-GPU and speculative decoding features.

Why it is included

Open-source (Apache-2.0) serving path when you standardize on NVIDIA datacenter GPUs.

Production LLM serving on NVIDIA hardware with maximum kernel optimization.

vLLM · SGLang