Inference
Endpoints
Catalog
Docs
Support
Log In
Search
Inference Task
All Available Tasks
Text Generation
Text-to-Image
Image-Text-to-Text
Sentence Embeddings
Sentence Ranking
Zero Shot Classification
Automatic Speech Recognition
Summarization
Feature Extraction
Price
$ 0 - 50 / hour
0
0.1
0.5
1
5
50
Inference Server
All
Llama.cpp
TEI
TGI
Hardware Accelerator
ALL
CPU
GPU
License
License:
All
Hub Models
Browse All Models
Model Catalog
1 items
Deploy from Hugging Face
Applied Filters
GPU
LLAMACPP
Sentence Ranking
Clear All
Order by:
Name
ggml-org /
jina-reranker-v1-turbo-en-GGUF
Sentence Ranking
Llama.cpp
Accelerated llama.cpp
F16
GPU
1x Nvidia T4
$
0.5