Deploy LLaMA2-7B-ZH-Chat-52k
Explore our Inference Catalog to deploy popular models on optimized configuration.
- The Endpoint is available from the Internet, and secured with TLS/SSL.
- Only you can access it, using a Hugging Face Token generated from your personal account.
Number of replicas
Automatically scale the number of replicas within Min and Max based on compute usage. Min is always 0 if Scale-To-Zero is active.
More options
Autoscaling Strategy
Control what type of trigger will cause your Endpoint to scale up.
Default Env
Environment variables that will be provided to your container during deployment.
Key
Value
Secret Env
Same as Default, but people with access to this endpoint will not be able to read these values after creation.
Key
Value
Commit Revision optional
Specify a revision commit hash for the Hugging Face repository
Container Arguments optional
Arguments passed to the container entrypoint.
Container Command optional
Command executed in the container.