14 items
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Sentence Embeddings
Quantization: Q4_K_M
CPU 8x Intel Sapphire Rapids
$ 0.268
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
Quantization: Q4_K_M
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Image-Text-to-Text
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour