Updated on 2024-08-14 GMT+08:00

Parameter Overview

You can use ServiceStep to create a service deployment phase. The following is an example of defining a ServiceStep.

Table 1 ServiceStep

Parameter

Description

Mandatory

Data Type

name

Name of a service deployment phase. The name contains a maximum of 64 characters, including only letters, digits, underscores (_), and hyphens (-). It must start with a letter and must be unique in a workflow.

Yes

str

inputs

Inputs of the service deployment phase

No

ServiceInput or ServiceInput list

outputs

Outputs of the service deployment phase

Yes

ServiceOutput or ServiceOutput list

title

Title for frontend display

No

str

description

Description of the service deployment phase

No

str

policy

Phase execution policy

No

StepPolicy

depend_steps

Dependency phases

No

Step or step list

Table 2 ServiceInput

Parameter

Description

Mandatory

Data Type

name

Input name of the service deployment phase. The name can contain a maximum of 64 characters, including only letters, digits, underscores (_), and hyphens (-), and must start with a letter. The input name of a step must be unique.

Yes

str

data

Input data object of the service deployment phase

Yes

Model list or service object. Currently, only ServiceInputPlaceholder, ServiceData, and ServiceUpdatePlaceholder are supported.

Table 3 ServiceOutput

Parameter

Description

Mandatory

Data Type

name

Output name of the service deployment phase. The name can contain a maximum of 64 characters, including only letters, digits, underscores (_), and hyphens (-), and must start with a letter. The output name of a step must be unique.

Yes

str

service_config

Configurations for service deployment

Yes

ServiceConfig

Table 4 ServiceConfig

Parameter

Description

Mandatory

Data Type

infer_type

Inference mode. The value can be real-time, batch, or edge. The default value is real-time.

  • real-time: real-time service. The model is deployed as a web service.
  • batch: batch service. A batch service can perform inference on batch data and automatically stops after data processing is completed.
  • edge: edge service. A model is deployed as a web service on an edge node through IEF. Create an edge node on IEF beforehand.

Yes

str

service_name

Service name. Enter 1 to 64 characters. Only letters, digits, hyphens (-), and underscores (_) are allowed.

NOTE:

If you do not specify this parameter, the default service name is generated automatically.

No

str, Placeholder

description

Service description, which contains a maximum of 100 characters. By default, this parameter is left blank.

No

str

vpc_id

ID of the VPC to which a real-time service instance is deployed. By default, this parameter is left blank. In this case, ModelArts allocates a dedicated VPC to each user, and users are isolated from each other. To access other service components in the VPC of the service instance, set this parameter to the ID of the corresponding VPC. Once a VPC is configured, it cannot be modified. If both vpc_id and cluster_id are configured, only the dedicated resource pool takes effect.

No

str

subnet_network_id

ID of a subnet. By default, this parameter is left blank. This parameter is mandatory when vpc_id is configured. Enter the network ID displayed in the subnet details on the VPC management console. A subnet provides dedicated network resources that are isolated from other networks.

No

str

security_group_id

Security group. By default, this parameter is left blank. This parameter is mandatory when vpc_id is configured. A security group is a virtual firewall that provides secure network access control policies for service instances. A security group must contain at least one inbound rule to permit the requests whose protocol is TCP, source address is 0.0.0.0/0, and port number is 8080.

No

str

cluster_id

ID of a dedicated resource pool. By default, this parameter is left blank, indicating that no dedicated resource pool is used. When using a dedicated resource pool to deploy services, ensure that the cluster is running properly. After this parameter is configured, the network configuration of the cluster is used, and the vpc_id parameter does not take effect. If both this parameter and cluster_id in real-time config are configured, cluster_id in real-time config is preferentially used.

No

str

additional_properties

Additional configurations

No

dict

apps

Whether to enable application authentication for service deployment. Multiple application names can be entered.

No

str, Placeholder, list

envs

Environment variables

No

dict

Example:

example = ServiceConfig()
# This object is used in the output of the service deployment phase.

If there is no special requirement, use the default values.