Model Catalog
29 items
meta-llama /
Llama-2-13b-chat-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-2-13b-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-2-70b-chat-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-2-70b-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-2-7b-chat-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-2-7b-hf
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.1-70B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-3.1-70B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Llama-3.1-8B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.1-8B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.2-1B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.2-1B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.2-3B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Llama-3.2-3B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Meta-Llama-3-70B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Meta-Llama-3-70B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
meta-llama /
Meta-Llama-3-8B
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
meta-llama /
Meta-Llama-3-8B-Instruct
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
mistralai /
Mistral-7B-Instruct-v0.3
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
mistralai /
Mistral-7B-v0.3
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
mistralai /
Mixtral-8x7B-Instruct-v0.1
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
mistralai /
Mixtral-8x7B-v0.1
Text Generation
TGI
Accelerated Text Generation Inference
INF2 24x Cores
$ 12
Intel /
neural-chat-7b-v3-1
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
Intel /
neural-chat-7b-v3-3
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
stabilityai /
sdxl-turbo
Text-to-Image
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
stabilityai /
stable-diffusion-2-1
Text-to-Image
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
stabilityai /
stable-diffusion-xl-base-1.0
Text-to-Image
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
lmsys /
vicuna-7b-v1.5
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75
HuggingFaceH4 /
zephyr-7b-beta
Text Generation
TGI
Accelerated Text Generation Inference
INF2 2x Cores
$ 0.75