vLLMvLLM/Recipes
DocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Moonshot AI

moonshotai·5 recipesHuggingFace

Multimodal

2
Kimi-K2.6
1T / 32B
moe
INT4714G
v0.19.1+→
Kimi-K2.5
1T / 32B
moe
INT4714GNVFP4600G
v0.19.1+→

Text

3
Kimi-K2-Thinking
1T / 32B
moe
INT4600GNVFP4600G
v0.12.0+→
Kimi-Linear-48B-A3B-Instruct
48B / 3B
moe
BF16115G
v0.11.2+→
Kimi-K2-Instruct
1T / 32B
moe
FP81200G
v0.12.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API