Model Catalog
15 items
Applied Filters
allenai /
olmOCR-2-7B-1025-FP8
51
Deployed 51 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen3-VL-30B-A3B-Thinking
39
Deployed 39 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 2x Nvidia A100
$ 5
Qwen /
Qwen3-VL-8B-Instruct
111
Deployed 111 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia A100
$ 2.5
lightonai /
LightOnOCR-1B-1025
39
Deployed 39 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L4
$ 0.8
deepseek-ai /
DeepSeek-OCR
183
Deployed 183 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L4
$ 0.8
zai-org /
GLM-4.1V-9B-Thinking
3
Deployed 3 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
ServiceNow-AI /
Apriel-1.5-15b-Thinker
4
Deployed 4 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 2x Nvidia A100
$ 5
rednote-hilab /
dots.ocr
59
Deployed 59 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L4
$ 0.8
nanonets /
Nanonets-OCR-s
22
Deployed 22 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
Qwen /
Qwen2.5-VL-7B-Instruct
241
Deployed 241 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia A100
$ 2.5
google /
paligemma2-10b-mix-448
2
Deployed 2 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
google /
paligemma2-10b-mix-224
3
Deployed 3 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
google /
paligemma2-3b-mix-448
19
Deployed 19 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L4
$ 0.8
google /
gemma-3-12b-it
222
Deployed 222 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia L40S
$ 1.8
google /
gemma-3-27b-it
275
Deployed 275 times
Image-Text-to-Text
vLLM
Accelerated vLLM
GPU 1x Nvidia A100
$ 2.5