Inference Endpoints
  • Catalog
  • Docs
  • Support
Search
Inference Task
All Available Tasks Text Generation Text-to-Image Image-Text-to-Text Sentence Embeddings Sentence Ranking Zero Shot Classification Automatic Speech Recognition Summarization Feature Extraction
Price $ 0 - 50 / hour
  • 0
  • 0.1
  • 0.5
  • 1
  • 5
  • 50
Inference Server
All Llama.cpp TEI TGI
Hardware Accelerator
ALL CPU GPU INF2
License
Hub Models
Browse All Models

Model Catalog

10 items
Applied Filters
GPU TEI Sentence Embeddings Clear All
Author avatar
sentence-transformers /

all-MiniLM-L6-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
sentence-transformers /

all-mpnet-base-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
BAAI /

bge-base-en-v1.5

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
BAAI /

bge-large-en-v1.5

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
Author avatar
thenlper /

gte-large

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
thenlper /

gte-large-zh

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
intfloat /

multilingual-e5-large

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
intfloat /

multilingual-e5-large-instruct

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
Author avatar
mixedbread-ai /

mxbai-embed-large-v1

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
Author avatar
sentence-transformers /

paraphrase-multilingual-MiniLM-L12-v2

Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8