Configuring a Model
The LLM Engine provides multiple common models for users to select from based on the actual service and budget.
Prerequisites
You have contacted the system administrator to enable the LLM Engine feature for the tenant.
Procedure
- Sign in to the AICC as a tenant space administrator, choose , and bind a model.
- On the Large Model tab, click
. The Adding a Model page is displayed.
Figure 1 Adding a modelParameter
Description
Name
This parameter is user-defined. It is recommended that this parameter be related to the model type for future use.
Type
Select a model based on the actual service.
Version
Large language model version which is set as required.
Using azure_openai as an example
api_base
URL for requesting a large model.
api_key
- Function: api_key is a key used for identity authentication. It needs to be provided each time a request is sent to the OpenAI API. This is the key to ensuring that only authorized users can access the API service.
- Obtaining method: To obtain the value of api_key, you need to register with the OpenAI official website and create an account. After you create an account and log in to the system, you can find your API key on the OpenAI user console.
- Security: api_key is sensitive information and must be kept properly. Do not expose your API key in public code libraries or client codes to prevent unnecessary fees and security risks.
api_version
- Function: api_version allows users to specify the version to be used through an API request. This is especially important when the API is updated, because a new version may introduce new functions or change existing functions. If you want to use an API of a specific version or ensure backward compatibility, set api_version.
- Obtaining method: The API version information can be found in the official documents or release notes of OpenAI. The API version may be displayed in the URL of the API or the specific version information may be provided in the API document.
- Setting method: In some API client libraries or invocations, you may need to specify the API version in the request configuration. For example, you may set the version number in the request URL or request header, or set api_version in the configuration of some libraries.
deployment_id
- Function: deployment_id is used to specify a specific model deployment instance. This parameter is used to select a specific deployment for API invocation when multiple models or versions are deployed.
- Model management: During model management and O&M, deployment_id allows users to control and reference different model deployments. This is very important for managing multiple model versions or performing A/B testing of models in the PR environment.
- Click Save. The large model is created successfully.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot