Model Catalog
12 items
Applied Filters
deepseek-ai /
DeepSeek-R1-Distill-Qwen-32B
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
Qwen /
Qwen2.5-14B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-14B-Instruct-1M
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-72B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
Qwen /
Qwen2.5-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-Coder-14B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-Coder-32B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
bartowski /
Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation
Llama.cpp
Accelerated llama.cpp
Q4_K_M
GPU 1x Nvidia L4
$ 0.8
Qwen /
Qwen2.5-Coder-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-Math-72B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
Qwen /
Qwen2.5-Math-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
QwQ-32B
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8