Help Center/ Data Lake Insight/ Billing/ Billing Overview
Updated on 2024-04-19 GMT+08:00

Billing Overview

DLI billing includes multiple billed items, each with its own billing mode.
Figure 1 DLI's billing structure

Billed Items

DLI billed items include billings for compute resources, storage resources, and scanned data volume. DLI offers a price calculator that allows you to swiftly determine the reference price for purchasing resources.

For more information about DLI billing, see DLI Pricing Details.

Table 1 DLI billed items

Billed Item

Description

Billing for compute resources

You can choose to be billed for DLI compute resources using either the following methods:
  • Amount of used compute resources (CUH)
  • Package

Billing for storage resources

  • Storage uses are billed based on the amount of table data stored in DLI, in GB.
  • Note that DLI typically compresses the original file size to 1/5 when estimating storage costs. DLI storage is billed based on the size of compressed data.
  • When data is stored on OBS, any charges for storage resource usage will be billed by OBS, not DLI.

Billing for scanned data

  • You are billed based on the scanned data volume in each job, in GB.
  • Currently, only jobs submitted in the default queue are billed based on the scanned data volume.
  • CU is the pricing unit for elastic resource pools. 1 CU consists of 1 vCPU and 4 GB memory. Queues with more CUs have better compute performance.
  • Jobs submitted on the default queue are billed based on the scanned data volume.

Billing Modes

DLI billing modes vary depending on billed items. See Table 2.

Table 2 DLI billing modes

Billed Item

Billing Mode

Description

Compute resources

Pay-per-use

Pay-per-use queues or elastic resource pools are billed by CUH.

Computing fee = Unit price x Number of CUs x Number of hours

CUH

DLI allows you to purchase elastic resource pools in the form of CUH packages.

The included quota will reset based on the subscription period.

Data stored in DLI tables

Pay-per-use

Storage uses are billed based on the amount of data stored in DLI, in GB.

Storage fee = Unit price x Data volume (in GB) x Number of hours

Storage package

If you purchase a storage package, the quota included in the package is preferentially deducted when you use resources. If you exceed the package limit, you will be billed on a pay-per-use basis. The included quota will reset every hour.

Scanned data volume

Pay-per-use

Fees for scanned data volume = Unit price x Scanned data volume (in GB)

Scanned data volume

If you purchase a scanned data volume package, the quota included in the package is preferentially deducted when you use resources. If you exceed the package limit, you will be billed on a pay-per-use basis. The quota included in the scanned data volume package will reset based on the subscription period.

  • Reset by subscription period: For example, if the reset period is monthly and you purchase a package on January 5, the included free resources will reset on the fifth day of each month.
  • Reset by calendar period: For example, if you purchase a package and the reset period is monthly, the included free resources will reset at 00:00:00 on the first day of each calendar month.
  • Storage packages reset every hour based on the calendar period, while other packages reset based on the subscription period.

Application Scenarios

Generally, you are advised to create projects based on different service attributes.
  • Development project: This project is mainly used by engineers for development and debugging. The workload is random and the data volume is small. To address this situation, you are advised to use the CUH billing mode, which can help you effectively control costs and keep resource consumption within a certain range. If you have a short-term need for exclusive resources, you can also select the dedicated resource mode when purchasing a pay-per-use queue.
  • Production project: Your workload is relatively stable (developed and debugged before going online). You are advised to use the package billing mode, which is more cost-effective. In addition, for pay-per-use queues, after being idle for 1 hour (meaning there are no running jobs on the queue), the system will automatically release compute resources. When using the queue again, compute resources need to be reallocated, which may take 5–10 minutes. Using the package billing mode can save waiting time.