Subscribing to a Built-in Commercial Service in ModelArts Studio (MaaS)

MaaS offers commercial services that give businesses high-performance, reliable inference APIs. Billing is based on token usage. Commercial services fit situations needing high stability, frequent calls, and expert support.

Operation Scenarios

Enterprise intelligent customer service: Enterprises hope to optimize their customer service systems by using inference APIs to achieve intelligent Q&A and intent recognition, thereby improving customer service efficiency and customer satisfaction.
Content creation assistance: Media and advertising companies leverage inference APIs for copywriting and creative generation, enhancing the efficiency and quality of content production.
Intelligent data analysis: Financial and e-commerce enterprises use inference APIs to conduct in-depth analysis of massive amounts of data, uncovering data value and aiding in decision-making.

Constraints

This function is available only in the CN-Hong Kong region.
Subscribing to a commercial service will subscribe all versions of the service. You cannot subscribe to just one version alone.
Currently, commercial services cannot be disabled.

Billing

Model inference services process input by converting it into identifiable tokens. Using a built-in MaaS service charges you based on token count. For details, see Model Inference Billing Items.

Coupon Instructions

When there is a discount offer, a relevant prompt will appear on the commercial service tab of the built-in service page. When subscribing to a commercial service, available coupons will be claimed by default. During the billing process, coupons will be applied first.
The scope of application and eligibility criteria for different coupon activities vary. Whether you can successfully claim a coupon depends on the actual activity rules.
For information on the issuance and usage of discount coupons for model services, go to Billing Center > Promotions > Coupons.

Service Calls

The platform can send requests to different regions as needed.

Subscribing to a Commercial Service

Log in to the ModelArts Studio (MaaS) console and select the target region on the top navigation bar.
In the navigation pane on the left, choose Real-Time Inference.
Choose Built-in Services > Commercial Services. In the Operation column of the target service, click Subscribe.

In the displayed dialog box, select commercial services as required, select I have read and agree to this statement and ModelArts Studio Service Agreement, and click Subscribe.

Selecting a commercial service will subscribe all model versions of the service.

The table below lists commercial service list parameters.

**Table 1** Commercial service list parameters
Parameter	Description
Name	Name of the commercial service. To view the service version and model parameter, click on the left of the service name to view the service version. The name displayed in the model Parameter column can be used for model calls. Figure 1 Viewing the service version
Subscription Status	Subscribed: The commercial service is enabled. Not Subscribed: The commercial service is not enabled.
Type	Type of the commercial service.
Billing Mode	Billing mode of the commercial service. The billing mode varies depending on the model. For details, see Billing Item (ModelArts Studio).
Infer Price	Inference pricing of the commercial service. The inference pricing varies depending on the model. For details, see Billing Item (ModelArts Studio). In the upper right corner of the page, enable or disable Million tokens unit display. Figure 2 Million tokens unit display If Million tokens unit display is disabled, the price is displayed by thousand tokens. Figure 3 K tokens If Million tokens unit display is enabled, the price is displayed by million tokens. Figure 4 Million tokens
Discounts	Discounts available for the commercial service. -- indicates that there is no discount.
Model flow limiting	Total quota for accessing all services under the same model within the current account. TPM: The number of tokens processed per minute, including both input and output. RPM: The number of requests processed per minute.
Call Statistics	Click to go to the Service Call Details page and view the call data and monitoring metric details of the commercial service in a specified period. For more information, see Viewing the Call Data and Monitoring Metrics of Real-Time Inference on ModelArts Studio (MaaS).
Operation	Operations supported by the commercial service. Unsubscribe: If this button is unavailable, the service cannot be disabled. No fee is generated when the service is not used. View Call Description: Click View Call Description, select a service version, and view the information and procedure for calling the commercial service. For more information, see Calling a Model Service in ModelArts Studio (MaaS).

Traffic Control Rules

To ensure fairness in users' calls to the model, MaaS has set up basic traffic limiting. If the limit is exceeded, API requests will fail, and you will need to wait until the traffic limiting conditions are lifted before making another call.

Tokens Per Minute (TPM): The number of tokens (input + output) processed per minute.
Requests Per Minute (RPM): The number of requests processed per minute.

If the model service has an RPM of 300, it means that up to 10 requests can be processed per second (300/30 = 10). If a user sends 300 requests in 1 second, it will far exceed the service's processing capacity, leading to request failures.

It is recommended to send API requests evenly to avoid sending a large number of requests in a short period. According to the API gateway's rate limiting mechanism, if the number of requests in one second exceeds RPM/30 x 1, the excess requests may trigger the API gateway's rate limiting, resulting in request failures and the return of error code 429 (Too Many Requests).

Notes:

Even if users send requests at a rate of RPM/30, network latency and the uncertainty of request arrival times may still result in a small number of failed requests. The actual time of request arrival at the server is the determining factor, not the time when the request was sent.

About Arrears

If you are in arrears when using a model service, the resource is frozen and enters the grace period. The resource status changes to Frozen. At this point, the frozen resources cannot be used, and new models that have not been subscribed to will not be supported for subscription.

Both the grace and retention periods are 15 days. After the retention period expires, if you still have not paid the account arrears, the related resources and order records under your account for this model will be cleared, and the data cannot be recovered. To use the model service again, pay off the outstanding amount and subscribe to the model service again.

Check your account balance and top up your account in time to ensure stable online services.

Figure 5 Periods of pay-per-use resources
Click to enlarge

FAQs

Is there any billing example?
For details about billing items and billing examples, see MaaS Model Inference Billing Items.
Can a paid service be disabled after subscription?
Paid services cannot be disabled. No expenditures are generated for unused services.
What should I do if the subscription status is frozen when I use a commercial service?
At this point, the frozen resources cannot be used, and new models that have not been subscribed to will not be supported for subscription. You can unfreeze the resources by recharging your account. The frozen resource instances will be restored for use, and models that have not been subscribed to will support subscription. For details, see Top-Up and Payment.