Inference
Endpoints
Catalog
Log In
Inference Task
All Available Tasks
Text Generation
Text-to-Image
Image-Text-to-Text
Sentence Embeddings
Sentence Similarity
Text Ranking
Automatic Speech Recognition
Feature Extraction
Price
$ 0 - 50 / hour
0
0.1
0.5
1
5
50
Inference Engine
All
Llama.cpp
TEI
vLLM
SGLang
Hardware Accelerator
ALL
CPU
GPU
INF2
License
License:
All
Hub Models
Browse All Models
Model Catalog
2 items
Deploy from Hugging Face
Applied Filters
NEURON
Feature Extraction
Clear All
Order by:
Most Recent
Qwen /
Qwen3-Embedding-4B
1
Deployed 1 time
Feature Extraction
vLLM
Accelerated vLLM
INF2
2x Cores
$
1.95
Qwen /
Qwen3-Embedding-0.6B
Feature Extraction
vLLM
Accelerated vLLM
INF2
2x Cores
$
1.95