Updated on 2025-11-04 GMT+08:00

Managing Third-Party Model Deployment Tasks

Updating a Model

After completing the operations in Creating a Third-Party Model Deployment Task, you can replace the deployed model and upgrade the configuration. The procedure is as follows:

  1. Log in to ModelArts Studio Large Model Deveopment Platform. In the My Spaces area, click the required workspace.
    Figure 1 My Spaces
  2. In the navigation pane, choose Model Development > Model Deployment. Click the model name to go to the model details page.
  3. Click Update Model in the upper right corner.
  4. In the Modifiable Configuration > Deployed Model area, you can select a model to replace the deployed model.
  5. In the Upgrade Configuration area, select either of the following upgrade modes:
    • Full upgrade: Services of the old and new versions run at the same time until the new version completely replaces the old version. The old version can still be used before the new version is deployed. Twice the number of resources required by the service will be used for a one-time full upgrade.
    • Rolling upgrade: Some instance resources are reserved for rolling upgrade. Instances of the old version are stopped one by one or in batches and the new version is started. During the rolling upgrade, you can change the number of instances. If you choose to reduce the number of instances during the upgrade, the system deletes the old version first and then performs the upgrade. The old version is unavailable during the upgrade.

    After the upgrade configuration is complete, you need to restart the deployment task. The upgrade mode is the restart mode.

Modifying Deployment Configurations

After completing the operations in Creating a Third-Party Model Deployment Task, you can modify the description of the deployed model and upgrade the configuration, but cannot replace the model. The procedure is as follows:

  1. Log in to ModelArts Studio Large Model Deveopment Platform. In the My Spaces area, click the required workspace.
    Figure 2 My Spaces
  2. In the navigation pane, choose Model Development > Model Deployment. Click the model name to go to the model details page.
  3. Click Modify Deployment in the upper right corner.
  4. In the Modifiable Configuration area, modify the description of a deployed model if required.
  5. In the Upgrade Configuration area, select either of the following upgrade modes:
    • Full upgrade: Services of the old and new versions run at the same time until the new version completely replaces the old version. The old version can still be used before the new version is deployed. Twice the number of resources required by the service will be used for a one-time full upgrade.
    • Rolling upgrade: Some instance resources are reserved for rolling upgrade. Instances of the old version are stopped one by one or in batches and the new version is started. During the rolling upgrade, you can change the number of instances. If you choose to reduce the number of instances during the upgrade, the system deletes the old version first and then performs the upgrade. The old version is unavailable during the upgrade.

    After the upgrade configuration is complete, you need to restart the deployment task. The upgrade mode is the restart mode.

Scaling Model Deployment Instances

After the operations in Creating a Third-Party Model Deployment Task are complete, you can scale in or out the instances where the model has been deployed. The procedure is as follows:

  1. Log in to ModelArts Studio Large Model Deveopment Platform. In the My Spaces area, click the required workspace.
    Figure 3 My Spaces
  2. In the navigation pane, choose Model Development > Model Deployment. Click the model name to go to the model details page.
  3. Click Scale in the upper right corner. On the page that is displayed, change the number of instances and click OK.

Managing Model Deployment Tasks

In the navigation pane, choose Model Development > Model Deployment. In the model service list, you can start, stop, call, and delete a service.

  1. Log in to ModelArts Studio Large Model Deveopment Platform. In the My Spaces area, click the required workspace.
    Figure 4 My Spaces
  2. In the navigation pane, choose Model Development > Model Deployment. Perform the following operations:
    • Start: Click Start in the Operation column to start a stopped model service. After the model service is started, it can be called for inference.
    • Stop: Click Stop in the Operation column to stop a model service in the Running state.
    • Call Statistics: Choose More > Call Statistics in the Operation column to view the call statistics of the model, including the number of calls and token statistics.
    • Delete: Click More > Delete in the Operation column to delete unnecessary model deployment tasks.

      Deleting a training job is a high-risk operation. Before deleting a job, ensure that it is no longer needed.