57 items
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
Quantization: IQ1_S
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
Quantization: IQ2_XXS
GPU 8x Nvidia L40S
$ 23.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
Quantization: Q8_0
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 8x Nvidia L40S
$ 23.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
Quantization: Q4_K_M
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia A100
$ 16
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
Quantization: Q8_0
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour