Help Center> ModelArts> FAQs> Service Deployment> Service Deployment> Functional Consulting> What Is the Maximum Size of a Prediction Request Body?
Updated on 2023-09-05 GMT+08:00

What Is the Maximum Size of a Prediction Request Body?

After a service is deployed and running, you can send an inference request to the service. The requested content can be text, images, voice, or videos, depending on the model of the service.

If you use the inference request address (URL of HUAWEI CLOUD APIG) displayed on the Usage Guides tab of the service details page for prediction, the maximum size of the request body is 12 MB. If the request body is oversized, the request will be intercepted.

If you perform the prediction on the Prediction tab of the service details page, the size of the request body cannot exceed 8 MB. The size limit varies between the two tab pages because they use different network links.

Ensure that the size of a request body does not exceed the upper limit. If there are high-concurrency and heavy-traffic inference requests, submit a service ticket to professional service support.

Functional Consulting FAQs

more