Model Catalog
12 items
deepseek-ai /
DeepSeek-R1-Distill-Qwen-32B
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
Qwen /
Qwen2.5-14B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-14B-Instruct-1M
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-72B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
Qwen /
Qwen2.5-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-Coder-14B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-Coder-32B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
Qwen /
Qwen2.5-Coder-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-Math-72B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
Qwen /
Qwen2.5-Math-7B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
Qwen /
QwQ-32B
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8