Updated on 2025-02-22 GMT+08:00

ARCHIVELOG

Function

Archives instants on the Timeline based on configurations and deletes archived instants from the Timeline to reduce the operation pressure on the Timeline.

Syntax

RUN ARCHIVELOG ON tableIdentifier;

RUN ARCHIVELOG ON tablelocation;

Parameter Description

Table 1 Parameter descriptions

Parameter

Description

tableIdentifier

Name of the Hudi table

tablelocation

Storage path of the Hudi table

Example

run archivelog on h1;
run archivelog on "obs://bucket/path/h1";

Caveats

  • First, execute the clean command. After the clean command has cleaned up the historical data files, only the corresponding instants on the timeline can be archived.
  • No matter whether the compaction operation is performed, at least x (x indicates the value of hoodie.compact.inline.max.delta.commits) instants are retained and not archived to ensure that there are enough instants to trigger the compaction schedule.
  • When using the metadata service provided by DLI, this command does not support OBS paths.

System Response

You can check if the job status is successful, view the job result, and review the job logs to confirm if there are any exceptions.