MLC LLM

Name: MLC LLM
Availability: InStock

Universal deployment stack compiling models to Vulkan, Metal, CUDA, and WebGPU via TVM/Unity for phones, browsers, and servers.

Why it is included

Unique open angle for edge and WebGPU LLM inference beyond desktop CUDA defaults.

Teams shipping LLMs to mobile, WebGPU, or heterogeneous devices.

llama.cpp · ONNX Runtime