Ray and XDS Billing Items
Fabric implements distinct billing policies based on its Ray and inference service scenarios. For details, see Table 1.
Billing Item |
Billing description |
---|---|
Ray resources |
You are billed based on the specifications and quantity of Ray resources provisioned. Pricing varies by Data Processing Unit (DPU) or AI Compute Unit (ACU) specifications. Both yearly/monthly and pay-per-use billing modes are available. |
Model compute unit hours |
Billing is based on the compute unit hours consumed by model instances deployed on inference endpoints. This item supports pay-per-use billing. The cost is calculated as: (Number of model instances under an inference endpoint) × (Number of compute units) × (Usage duration reported in seconds). Refer to Common Models for specific compute unit requirements of different base models. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot