vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

[view on github]last commit: May 24, 2026

stars

80,847

7d

+386

30d

+2,698

90d

+10,045

## star history

## found in

Awesome Open Source AI/⚡ 3. Inference Engines & Serving