Back to Catalog
Create Endpoint
Qwen

Qwen3-VL-30B-A3B-Thinking

Catalog model officially supported by Inference Endpoints.

This model is from our Model Catalog, and comes with an optimized configuration. Deployment has been verified by Hugging Face.

new-account
/
$5 / h
per running replica

Contact us if you'd like to request a custom solution or instance type.

Nvidia A100
2x GPUs · 160 GB 22x vCPUs · 290 GB
$5 / h
suggested
Best performance/price ratio for selected model and accelerator.
  • Only you can access your endpoint, using a Hugging Face Token generated from your personal account.