Inference
Endpoints
Catalog
Docs
Support
Log In
Search
Inference Task
All Available Tasks
Text Generation
Text-to-Image
Image-Text-to-Text
Sentence Embeddings
Sentence Ranking
Zero Shot Classification
Automatic Speech Recognition
Summarization
Feature Extraction
Price
$ 0 - 50 / hour
0
0.1
0.5
1
5
50
Inference Server
All
Llama.cpp
TEI
TGI
Hardware Accelerator
ALL
CPU
GPU
License
License:
All
Hub Models
Browse All Models
Model Catalog
1 items
Deploy from Hugging Face
Applied Filters
GPU
LLAMACPP
Sentence Embeddings
Clear All
Order by:
Name
beethogedeon /
gte-Qwen2-7B-instruct-Q4_K_M-GGUF
Sentence Embeddings
Llama.cpp
Accelerated llama.cpp
Q4_K_M
GPU
1x Nvidia T4
$
0.5