vLLMvLLM/Recipes
DocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Meta

meta-llama·3 recipesHuggingFace

Text

3
Llama-4-Scout-17B-16E-Instruct
109B / 17B
moe
BF16262GFP8131GNVFP465G
v0.12.0+→
Llama-3.3-70B-Instruct
70B
dense
BF16170GFP884GNVFP442G
v0.12.0+→
Llama-3.1-8B-Instruct
8B
dense
BF1620GNVFP45GFP810G
v0.6.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API