Model Catalog
7 items
Qwen /
Qwen3-Embedding-4B
1
Deployed 1 time
Feature Extraction
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95
Qwen /
Qwen3-Embedding-0.6B
Feature Extraction
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95
meta-llama /
Llama-3.1-70B-Instruct
94
Deployed 94 times
Text Generation
vLLM
Accelerated vLLM
INF2 24x Cores
$ 12
meta-llama /
Meta-Llama-3-8B-Instruct
95
Deployed 95 times
Text Generation
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95
meta-llama /
Llama-3.1-8B-Instruct
479
Deployed 479 times
Text Generation
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95
meta-llama /
Llama-3.2-1B-Instruct
58
Deployed 58 times
Text Generation
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95
meta-llama /
Llama-3.2-3B-Instruct
132
Deployed 132 times
Text Generation
vLLM
Accelerated vLLM
INF2 2x Cores
$ 1.95