Model Catalog
4 items
Applied Filters
beethogedeon /
gte-Qwen2-7B-instruct-Q4_K_M-GGUF
Sentence Embeddings
Llama.cpp
Accelerated llama.cpp
Q4_K_M
CPU 8x Intel Sapphire Rapids
$ 0.268
Qwen /
Qwen3-Embedding-0.6B
Sentence Embeddings
GPU 1x Nvidia T4
$ 0.5
Qwen /
Qwen3-Embedding-4B
Sentence Embeddings
GPU 1x Nvidia L4
$ 0.8
Qwen /
Qwen3-Embedding-8B
Sentence Embeddings
GPU 1x Nvidia L4
$ 0.8
You can try extending your search to
.