Inference Endpoints
  • Catalog

Model Catalog

Inference Task
All Available Tasks Text Generation Text-to-Image Image-Text-to-Text Sentence Embeddings Sentence Similarity Sentence Ranking Zero Shot Classification Automatic Speech Recognition Summarization Feature Extraction
Price $ 0 - 50 / hour
  • 0
  • 0.1
  • 0.5
  • 1
  • 5
  • 50
Inference Server
All Llama.cpp TEI TGI vLLM
Hardware Accelerator
ALL CPU GPU INF2
License
Hub Models
Browse All Models
11 items
Applied Filters
GPU Sentence Embeddings Clear All
Author avatar
sentence-transformers /

all-MiniLM-L6-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
sentence-transformers /

all-mpnet-base-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
BAAI /

bge-base-en-v1.5

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
BAAI /

bge-large-en-v1.5

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
Author avatar
BAAI /

bge-multilingual-gemma2

Sentence Embeddings
GPU 1x Nvidia L40S
$ 1.8
Author avatar
thenlper /

gte-large

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
beethogedeon /

gte-Qwen2-7B-instruct-Q4_K_M-GGUF

Sentence Embeddings
Llama.cpp
Accelerated llama.cpp
Q4_K_M
GPU 1x Nvidia T4
$ 0.5
Author avatar
intfloat /

multilingual-e5-large

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
intfloat /

multilingual-e5-large-instruct

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
mixedbread-ai /

mxbai-embed-large-v1

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
Author avatar
sentence-transformers /

paraphrase-multilingual-MiniLM-L12-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8