Serverless models
Current
Explore the models currently available for serverless inference, ready to deploy instantly for chat, vision, and image generation.
Chat models
Model name | Developer | API model string | Type | Price |
---|---|---|---|---|
Qwen QwQ 32B | Qwen | Qwen/QwQ-32B | Text-to-text | 0.20 Output per 1M tokens |
Qwen 2.5-Coder 32B-Instruct | Qwen | Qwen/Qwen-2.5-Coder 32b-Instruct | Text-to-text | 0.20 Output per 1M tokens |
Qwen 2.5-Coder 7B-Instruct | Qwen | Qwen/Qwen-2.5-Coder 7b-Instruct | Text-to-text | 0.03 Output per 1M tokens |
Qwen 2.5-Coder 3B-Instruct | Qwen | Qwen/Qwen-2.5-Coder 3b-Instruct | Text-to-text | 0.03 Output per 1M tokens |
DeepSeek / R1 671B GGUF | Deepseek | deepseek/DeepSeek-R1-671B-UD_Q2_K_XL | Text-to-text | 2.40 Output per 1M tokens |
Deepseek R1 Distill: Llama-70B | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Llama-70B | Text-to-text | $0.75 per 1M tokens |
Deepseek R1 Distill: Llama-8B | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Llama-8B | Text-to-text | $0.05 per 1M tokens |
Deepseek R1 Distill: Qwen 32B | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | Text-to-text | $0.30 per 1M tokens |
Deepseek R1 Distill: Qwen 14B | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | Text-to-text | $0.14 per 1M tokens |
Deepseek R1 Distill: Qwen 7B (Math focused) | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | Text-to-text | $0.40 per 1M tokens |
Deepseek R1 Distill: Qwen 1.5B (Math focused) | Deepseek | deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | Text-to-text | $0.18 per 1M tokens |
Llama 3.3 70B Instruct | Meta | meta-llama/Llama-3.3-70B-Instruct | Text-to-text | $0.40 per 1M tokens |
Llama 3.1 8B Instruct | Meta | meta-llama/Llama-3.1-8B-Instruct | Text-to-text | $0.06 per 1M tokens |
Mixtral 8x22B Instruct v0.1 | Mistral AI | mistralai/mixtral-8x22b-instruct-v0.1 | Text-to-text | $1.20 per 1M tokens |
Image models
Model name | Developer | API model string | Type | Price |
---|---|---|---|---|
FLUX.1 [schnell] | Black Forest Labs | black-forest-labs/FLUX.1-schnell | Text-to-image | $0.0013 per 1M pixels |
Stable diffusion XL Turbo | Stability AI | stabilityai/sdxl-turbo | Text-to-image | $0.03 per 1M pixels |
Looking for a model?
We’re always reviewing new models to add to our serverless inference platform. Whether you’re a researcher, a developer, or just someone with a great idea, we’d love to hear from you. Let us know what you’d like to see available next.