Model Catalog
14 items
Applied Filters
sentence-transformers /
all-MiniLM-L6-v2
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 1x Intel Sapphire Rapids
$ 0.033
sentence-transformers /
all-mpnet-base-v2
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
BAAI /
bge-base-en-v1.5
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
BAAI /
bge-large-en-v1.5
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
BAAI /
bge-multilingual-gemma2
Sentence Embeddings
GPU 1x Nvidia L40S
$ 1.8
llmrails /
ember-v1
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
thenlper /
gte-large
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
thenlper /
gte-large-zh
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
beethogedeon /
gte-Qwen2-7B-instruct-Q4_K_M-GGUF
Sentence Embeddings
Llama.cpp
Accelerated llama.cpp
Q4_K_M
CPU 8x Intel Sapphire Rapids
$ 0.268
intfloat /
multilingual-e5-large
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
intfloat /
multilingual-e5-large-instruct
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
mixedbread-ai /
mxbai-embed-large-v1
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
sentence-transformers /
paraphrase-multilingual-MiniLM-L12-v2
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
sentence-transformers /
sentence-t5-xxl
Sentence Embeddings
GPU 1x Nvidia L4
$ 0.8