Subscribing to a Built-in Commercial Service in ModelArts Studio (MaaS)
MaaS offers commercial services that give businesses high-performance, reliable inference APIs. Billing is based on token usage. Commercial services fit situations needing high stability, frequent calls, and expert support.
Operation Scenarios
- Enterprise intelligent customer service: Enterprises hope to optimize their customer service systems by using inference APIs to achieve intelligent Q&A and intent recognition, thereby improving customer service efficiency and customer satisfaction.
- Content creation assistance: Media and advertising companies leverage inference APIs for copywriting and creative generation, enhancing the efficiency and quality of content production.
- Intelligent data analysis: Financial and e-commerce enterprises use inference APIs to conduct in-depth analysis of massive amounts of data, uncovering data value and aiding in decision-making.
Constraints
- This function is available only in the CN-Hong Kong region.
- Subscribing to a commercial service will subscribe all versions of the service. You cannot subscribe to just one version alone.
- Currently, commercial services cannot be disabled.
Billing
Model inference services process input by converting it into identifiable tokens. Using a built-in MaaS service charges you based on token count. For details, see Model Inference Billing Items.
Coupon Instructions
- When there is a discount offer, a relevant prompt will appear on the commercial service tab of the built-in service page. When subscribing to a commercial service, available coupons will be claimed by default. During the billing process, coupons will be applied first.
- The scope of application and eligibility criteria for different coupon activities vary. Whether you can successfully claim a coupon depends on the actual activity rules.
- For information on the issuance and usage of discount coupons for model services, go to Billing Center > Promotions > Coupons.
Service Calls
The platform can send requests to different regions as needed.
Subscribing to a Commercial Service
- Log in to the ModelArts Studio (MaaS) console and select the target region on the top navigation bar.
- In the navigation pane on the left, choose Real-Time Inference.
- Choose Built-in Services > Commercial Services. In the Operation column of the target service, click Subscribe.
- In the displayed dialog box, select commercial services as required (all services are selected by default), select I have read and agree to this statement and ModelArts Studio Service Agreement, and click Subscribe.
Selecting a commercial service will subscribe all model versions of the service.
The table below lists commercial service list parameters.
Table 1 Commercial service list parameters Parameter
Description
Name
Name of the commercial service. Click
on the left of the service name to view the service version. The name displayed in the model Parameter column can be used for model calls.Figure 1 Viewing the service version
Subscription Status
- Subscribed: The commercial service is enabled.
- Not Subscribed: The commercial service is not enabled.
Type
Type of the commercial service.
Billing Mode
Billing mode of the commercial service. The billing mode varies depending on the model. For details, see Billing Item (ModelArts Studio).
Infer Price
Inference pricing of the commercial service. The inference pricing varies depending on the model. For details, see Billing Item (ModelArts Studio).
Discounts
Discounts available for the commercial service. -- indicates that there is no discount.
Model flow limiting
Total quota for accessing all services under the same model within the current account.
- TPM: The number of tokens processed per minute, including both input and output.
- RPM: The number of requests processed per minute.
Call Statistics
Click
to go to the Service Call Details page and view the call data and monitoring metric details of the commercial service in a specified period. For more information, see Viewing the Call Data and Monitoring Metrics of Real-Time Inference on ModelArts Studio (MaaS).Operation
Operations supported by the commercial service.
- Unsubscribe: If this button is unavailable, the service cannot be disabled. No fee is generated when the service is not used.
- View Call Description: Click View Call Description, select a service version, and view the information and procedure for calling the commercial service. For more information, see Calling a Model Service in ModelArts Studio (MaaS).
Traffic Control Rules
To ensure fairness in users' calls to the model, MaaS has set up basic traffic limiting. If the limit is exceeded, API requests will fail, and you will need to wait until the traffic limiting conditions are lifted before making another call.
- Tokens Per Minute (TPM): The number of tokens (input + output) processed per minute.
- Requests Per Minute (RPM): The number of requests processed per minute.
If the model service has an RPM of 300, it means that up to 10 requests can be processed per second (300/30 = 10). If a user sends 300 requests in 1 second, it will far exceed the service's processing capacity, leading to request failures.
It is recommended to send API requests evenly to avoid sending a large number of requests in a short period. According to the API gateway's rate limiting mechanism, if the number of requests in one second exceeds RPM/30 x 1, the excess requests may trigger the API gateway's rate limiting, resulting in request failures and the return of error code 429 (Too Many Requests).
Notes:
Even if users send requests at a rate of RPM/30, network latency and the uncertainty of request arrival times may still result in a small number of failed requests. The actual time of request arrival at the server is the determining factor, not the time when the request was sent.
About Arrears
If you are in arrears when using a model service, the resource is frozen and enters the grace period. The resource status changes to Frozen. At this point, the frozen resources cannot be used, and new models that have not been subscribed to will not be supported for subscription.
Both the grace and retention periods are 15 days. After the retention period expires, if you still have not paid the account arrears, the related resources and order records under your account for this model will be cleared, and the data cannot be recovered. To use the model service again, pay off the outstanding amount and subscribe to the model service again.
Check your account balance and top up your account in time to ensure stable online services.
FAQs
- Is there any billing example?
For details about billing items and billing examples, see MaaS Model Inference Billing Items.
- Can a paid service be disabled after subscription?
Paid services cannot be disabled. No expenditures are generated for unused services.
- What should I do if the model status is frozen when I use a commercial service?
At this point, the frozen resources cannot be used, and new models that have not been subscribed to will not be supported for subscription. You can unfreeze the resources by recharging your account. The frozen resource instances will be restored for use, and models that have not been subscribed to will support subscription. For details, see Top-Up and Payment.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot