Deploy Llama-3.1-70B-Instruct

Catalog model officially supported by Inference Endpoints.

This model is from our Inference Catalog and comes with an optimized configuration.

More options
Commit Revision
Specify a revision commit hash for the Hugging Face repository
optional

Contact us if you'd like to request a custom solution or instance type.

N. Virginia us-east-1
  • The Endpoint is available from the Internet, and secured with TLS/SSL.
  • Only you can access it, using a Hugging Face Token generated from your personal account.
$8.30 / h
per running replica