Help Center> Data Lake Insight> Billing> Billing for Scanned Data
Updated on 2024-01-23 GMT+08:00

Billing for Scanned Data

Billing by Scanned Data Volume

Billing by scanned data volume: Billing is determined by the amount of data scanned during query job execution. If the scanned byte volume is below 10 MB, it will be rounded up to 10 MB for calculation purposes.

You will not be billed for DDL statements (such as CREATE, ALTER, DROP TABLE), partition management statements, or failed queries.

Canceled queries are charged based on the total amount of data scanned at the time of cancellation.

Billing Modes

DLI's default queue is billed by the scanned data volume. The scanned data volume can be billed using the following modes:

  • Pay-per-use: You are billed based on the scanned data volume when you submit jobs to the default queue. This billing mode is based on usage.

    Billing is calculated per hour from the time of creation, rounded up to the nearest hour.

    Fees for scanned data volume = Unit price x Scanned data volume (in GB)

  • Package: DLI offers package billing for scanned data volume. If you purchase a scanned data volume package, the quota included in the package is preferentially deducted when you use resources. If you exceed the package limit, you will be billed on a pay-per-use basis. The included quota will reset every month.

    See Package Billing for more information on package billing.

This section describes the pay-per-use billing rules for scanned data volume.

Application Scenarios

DLI provides a preconfigured queue named default for you to try out the service. This queue can be used for job execution when you are unsure of the required queue capacity or do not have space to create a new queue. Queue resources are allocated on demand and billed based on the scanned data volume. This is suitable for testing scenarios where resource consumption is low and pay-per-use billing is more cost-effective, without the need for any upfront payment.

Constraints and Limitations

  • Only the default queue is billed based on the scanned data volume.
  • If the scanned byte volume is below 10 MB, it will be rounded up to 10 MB for calculation purposes.
  • You will not be billed for DDL statements (such as CREATE, ALTER, DROP TABLE), partition management statements, or failed queries.
  • Canceled queries are charged based on the total amount of data scanned at the time of cancellation.
  • If a computing task times out or fails, no fee is charged for the task.
  • The way data is stored, including its format and whether it is compressed, directly affects the cost calculation.

    For example, if a Parquet file is one-fourth the size of the original CSV file, the data scan cost will be one-fourth of the cost of scanning the original CSV file.

Billing Rules

Table 1 Billing rules for the scanned data volume

Type

Resource

Description

Data scan

Scanned data volume

Only the default queue supports billing by the scanned data volume. (This queue is preset for you to try out the service.)

Fees for scanned data volume = Unit price x Scanned data volume (in GB)

If the scanned byte volume is below 10 MB, it will be rounded up to 10 MB for calculation purposes.

You can check the scanned data volume on the DLI console.

The following uses a SQL job as an example:

  1. In the navigation pane on the left, choose Job Management > SQL Jobs.
  2. On the SQL Jobs page displayed, filter jobs that use the default queue.
  3. Click to expand the job details and view its scanned data volume.
    Figure 1 Example price

Billing Examples

The prices are just examples. The actual prices are those displayed on DLI Pricing Details.

For details, see the billing example for the scanned data volume on the DLI pricing details page.

Impact of Arrears

Figure 2 shows the statuses a pay-per-use DLI queue can have throughout its lifecycle. After a DLI queue is purchased, it enters the validity period and operates properly during this period. If your account goes into arrears or exceeds the expenditure quota, the queue enters a grace period and then a retention period.

Figure 2 Lifecycle of a pay-per-use DLI resource

Arrears Reminder

The system deducts fees from your account balance for pay-per-use resources at the end of each billing cycle. You will be notified by email, SMS, or in-app messages when your account falls into arrears or exceeds the expenditure quota.

Impact of Arrears

Your resources enter the grace period and you cannot submit jobs in DLI, including SQL, Spark, and Flink jobs. You will need to pay for the fees incurred during the grace period, which you can see on the Billing & Costs > Billing Center > Overview page of the Huawei Cloud console.

If you do not bring your account balance current before the grace period expires, your resources will enter the retention period and become frozen. You cannot perform any operations on the pay-per-use resources during this period.

If you do not bring your account balance current before the retention period ends, your resource will be released and the data cannot be restored.

For details about how to top up your account, see Topping Up an Account.

For details about expenditure quotas, see Expenditure Quota.

Combination of Pay-per-Use and Package

DLI allows you to use package and pay-per-use together.

If you purchase a scanned data volume package, the quota included in the package is preferentially deducted when you use resources. If you exceed the package limit, you will be billed on a pay-per-use basis. The included quota will reset every month.

For details about how to use a package, see Package Billing.