What Is the Maximum Size of a Prediction Request Body?
After a service is deployed and running, you can send an inference request to the service. The requested content can be text, images, voice, or videos, depending on the model of the service.
If you use the inference request address (URL of APIG) displayed on the Usage Guides tab of the service details page for prediction, the maximum size of the request body is 12 MB. If the request body is oversized, the request will be intercepted.
If you perform the prediction on the Prediction tab of the service details page, the size of the request body cannot exceed 8 MB. The size limit varies between the two tab pages because they use different network links.
Ensure that the size of a request body does not exceed the upper limit. If there are high-concurrency and heavy-traffic inference requests, submit a service ticket to professional service support.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.