Model Catalog
11 items
Qwen /
Qwen3-Embedding-0.6B-GGUF
8
Deployed 8 times
Feature Extraction
Llama.cpp
Accelerated llama.cpp
F16
CPU 2x Intel Sapphire Rapids
$ 0.067
google /
embeddinggemma-300m
47
Deployed 47 times
Sentence Similarity
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
onnx-community /
embeddinggemma-300m-ONNX
12
Deployed 12 times
Sentence Similarity
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
nomic-ai /
nomic-embed-text-v1.5
33
Deployed 33 times
Sentence Similarity
TEI
Accelerated Text Embeddings Inference
CPU 8x Intel Sapphire Rapids
$ 0.268
ggml-org /
jina-reranker-v1-turbo-en-GGUF
7
Deployed 7 times
Text Ranking
Llama.cpp
Accelerated llama.cpp
F16
CPU 8x Intel Sapphire Rapids
$ 0.268
cross-encoder /
ms-marco-MiniLM-L12-v2
24
Deployed 24 times
Text Ranking
CPU 1x Intel Sapphire Rapids
$ 0.033
thenlper /
gte-large
16
Deployed 16 times
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
sentence-transformers /
all-mpnet-base-v2
72
Deployed 72 times
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
sentence-transformers /
all-MiniLM-L6-v2
328
Deployed 328 times
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 1x Intel Sapphire Rapids
$ 0.033
beethogedeon /
gte-Qwen2-7B-instruct-Q4_K_M-GGUF
22
Deployed 22 times
Sentence Embeddings
Llama.cpp
Accelerated llama.cpp
Q4_K_M
CPU 8x Intel Sapphire Rapids
$ 0.268
BAAI /
bge-base-en-v1.5
66
Deployed 66 times
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134