Updated on 2024-12-30 GMT+08:00

Creating a Migration Task Group

Scenario

If there is more than 3 TB of data or more than 5 million objects to migrate in a source bucket, you can create a migration task group to migrate the source data quickly. The system will group the source objects into multiple sub-tasks for concurrent migrations.

  • By default, if a task stays in a waiting state for more than 30 days, OMS automatically puts the task into a Failed state and clears the AK/SK pair used by the task. To prevent task failures caused by blocking, do not create tasks too frequently.
  • If an object is larger than 500 GB, migrating it takes a long time. You can submit a service ticket to whitelist the object for migration in segments.
  • During the migration, the system automatically creates a temporary folder named OMS in the target bucket. Do not perform any operations on this folder, including but not limited to modifying, deleting, or adding data in the folder. Otherwise, the migration will be interrupted or fail.

Background

OMS supports the following migration sources:

  • Huawei Cloud
  • AWS
  • Alibaba Cloud
  • Microsoft Azure
  • Tencent Cloud
  • Google Cloud
  • Qiniu Cloud
  • QingCloud
  • Baidu Cloud
  • Kingsoft Cloud
  • UCloud
  • HTTP/HTTPS data sources

This section uses Huawei Cloud as the migration source. For tutorials for other cloud platforms, see Migration from Other Clouds to Huawei Cloud.

OMS makes API calls to the object storage services on source platforms during the migration. These API calls may generate fees on those platforms and you will be responsible for those fees based on those source platform billing structures.

Prerequisites

  • You have signed up for a HUAWEI ID and enabled Huawei Cloud services.
  • You have obtained the AK/SK pairs and required permissions for the source and destination platform accounts.
    • The source platform account needs the permissions for:
      • Listing buckets
      • Obtaining bucket locations
      • Listing objects
      • Obtaining object metadata
      • Obtaining object content

      To learn how to obtain these permissions, see Source Permissions.

    • The destination platform account needs the permissions for:
      • Listing all buckets
      • Obtaining bucket locations
      • Listing objects
      • Obtaining object metadata
      • Obtaining object content
      • Listing uploaded parts
      • Restoring archive objects

      To learn how to obtain these permissions, see Destination Permissions.

  • You have created at least one bucket on OBS.
  • You have created less than 300 migration task groups within the past 24 hours.
  • You have created less than 1,000,000 migration tasks within the past 24 hours.
  • You have less than 1,000,000 migration tasks in Waiting state.

Procedure

  1. Log in to the management console.
  2. Click in the upper left corner and select the desired region.
  3. Click Service List, and choose Migration > Object Storage Migration Service.

    The Object Storage Migration Service page is displayed.

  4. In the navigation pane on the left, choose Migration Task Groups.
  5. Click Create Migration Task Group in the upper right corner.

    Read the service disclaimer, select I have read and agree to the service disclaimer, and click OK.

    The page for creating a migration task group is displayed.

  6. On the Select Source/Destination page, perform the following operations:

    1. Configure source parameters by referring to Table 1.
      Table 1 Source parameters

      Parameter

      Description

      Data Source

      Select the source cloud service provider.

      (Optional) APPID

      Enter the identifier of the account for logging in to the Tencent Cloud console. This parameter is only available if the source cloud service provider is Tencent Cloud.

      Access Key ID

      Enter the AK for accessing the source platform. It can contain a maximum of 100 characters.

      Secret Access Key

      Enter the SK for accessing the source platform. The SK matches the AK, and they group into a pair. It can contain a maximum of 100 characters.

      Specify Source Bucket

      Enable this option if your source account does not have the permissions required to list buckets but have the permissions to list objects in buckets, obtain object metadata, and obtain objects. Then specify the bucket or path to be migrated.

      For details, see Table 2.

      Table 2 Description of Specify Source Bucket

      If You...

      Then...

      Enable Specify Source Bucket

      1. Select the region where the source bucket is located from the Region drop-down list.
      2. In the Bucket text box, enter the bucket or the path to be migrated. The path must have the bucket name included.
      3. Click Connect.
        NOTE:

        OMS will check whether the source platform can be connected using the information you entered. If the source platform can be connected, the system will prompt you.

      Disable Specify Source Bucket. (This feature is disabled by default.)

      1. Click List Buckets.

        OMS will check whether the source platform can be connected using the information you entered. After the source platform is connected, all buckets in your account will be listed in the Bucket drop-down list on the premises that your account has the permissions needed to list buckets.

      2. In the Bucket drop-down list, select the bucket you want to migrate.
    2. Set the destination parameters listed in Table 3.
      Table 3 Destination parameters

      Parameter

      Description

      Access Key ID

      Enter the AK for accessing Huawei Cloud. It can contain a maximum of 100 characters.

      Secret Access Key

      Enter the SK for accessing Huawei Cloud. The SK matches the AK, and they group into a pair. It can contain a maximum of 100 characters.

      Then click List Buckets.

      Bucket

      Select a destination OBS bucket for storing the migrated data.

  7. Click Next. The Set Task Group Parameters page is displayed.
  8. Set parameters in the Source Configuration area.

    1. Select a migration method.

      Migration Method

      Description

      Object name prefix

      Enter the names or prefixes of objects to be migrated and click Add. The system will automatically list the matched objects. You can click to delete a prefix, and then the system will not migrate the objects matched with the deleted prefix.

      NOTE:
      • If the objects you want to migrate are stored in the root directory of the source bucket, add their prefixes directly. If these objects are stored in none-root directories, add their directories and prefixes in the format of Bucket name/Folder name/Prefix.
      • If no object names or prefixes are specified, all objects in the source bucket will be migrated.
      • The objects to be migrated are recorded in .txt files, and each migration task has a list file. The storage path of the files is oms/taskgroup/ in the destination bucket. These list files will be retained after the migration, and you can manually delete them.

      Object list folder

      Generally, this method is used for incremental migration. You need to list the names of source objects to be migrated in TXT files and store the files in an OBS bucket on Huawei Cloud. OMS will migrate all objects specified in the files.

      1. Select the bucket where the object lists are stored.
        NOTE:

        The listed buckets are obtained using the destination AK/SK pair you entered in 6.b.

        The bucket used for storing object lists must be in the same region as the destination bucket, or you cannot select these object lists.

      2. Click Select next to Object List Folder, select the object list files, and click OK.
      NOTE:
      • An object list file cannot exceed 1,024 MB.
      • A maximum of 2,000 object list files can be stored in the object list folder.
      • An object list file must be a .txt file, and the ContentType metadata of the file must be text/plain.
      • An object list file must be in UTF-8 without BOM.
      • Each line in an object list file can contain only one object name, and the object name must be URL encoded if it contains special characters or spaces.
      • Spaces are not allowed in each line in an object list file. Spaces may cause migration failures because they may be mistakenly identified as object names.
      • Each line in an object list file cannot be longer than 65,535 characters, or the migration will fail.
      • The ContentEncoding metadata of each object list file must be left empty, or the migration will fail.
      • Archived data can only be accessed and obtained by OMS after it is restored. If the object list files are archived, manually restore them before migration.

      URL list folder

      Generally, this method is used to migrate HTTP/HTTPS data. You need to write the URLs of the files to be migrated and their destination object names into one or more .txt files, and save the files to a folder in an OBS bucket on Huawei Cloud. You can store up to 2,000 list files in that folder. Each list file cannot exceed 1 GB. OMS migrates all objects specified in the URL list files.

      1. Select the bucket where the list files are stored.
        NOTE:
        • The listed buckets are obtained using the destination AK/SK pair you entered in 6.b.
        • The bucket used for storing the list files must be located in the same region as the destination bucket, or you cannot select these files.
        • For security purposes, if the migration source supports both HTTP and HTTPS, you are advised to use HTTPS for the migration. If HTTP is used, data may be eavesdropped on, tampered with, or vulnerable to man-in-the-middle or replay attacks.
        • If the data to be migrated is not public and you do not want to make the data public, you are advised to use other migration methods.
        • URL redirection is not supported.
      2. Click Select next to URL List Folder, select the URL list files, and click OK.
        NOTE:
        • A URL list file cannot exceed 1,024 MB.
        • A maximum of 2,000 URL list files can be stored in the URL list folder.
        • A URL list file must be a .txt file, and the Content-Type metadata of the file must be text/plain.
        • A URL list file must be in UTF-8 without BOM.
        • Each line in a URL list file can contain only one URL and one destination object name.
        • Each line in a URL list file cannot be longer than 65,535 characters, or the migration will fail.
        • The Content-Encoding metadata of each URL list file must be left empty, or the migration will fail.
        • In the URL list file, you must use a tab character (\t) to separate the URL and destination object name in each line. Do not use spaces to separate them. The format is [URL][Tab character][Destination object name]. Chinese and special characters in the source and destination object names must be URL encoded.

          Please note that:

          The URL encoding must start from the second character after the domain name in a line. Do not encode the protocol header, domain name, or slash before or after the domain name. Otherwise, the format verification will fail.

          In each line, use a tab character (Tab key on the keyboard) to separate the URL and the destination object name. Do not use spaces.

          For example:
          http://xxx.xxx.xxx.xxx.com/doc/thefile2.txt doc/thefile2.txt http://xxx.xxx.xxx.xxx.com/the%20file.txt the%20file.txt http://xxx.xxx.xxx.xxx.com/the%20file2.txt the+file2.txt http://xxx.xxx.xxx.xxx.com/doc/thefile.txt doc/thefile.txt

          In the preceding examples, after the files represented by the URLs are copied to the destination bucket, the objects are named doc/thefile2.txt, the file.txt, the file2.txt, and doc/thefile.txt.

        • Spaces are not allowed in each line in the URL list file. Spaces may cause migration failures because they may be mistakenly identified as object names.
        • URLs in the list file can be accessed using HEAD and GET methods.
    2. Decide whether to migrate metadata.
      • If you select Do not migrate, OMS will only migrate the Content-Type metadata.
      • If you select Migrate, OMS will migrate supported metadata.

        Only Chinese characters, English letters, digits, and hyphens (-) can be migrated. Other characters cannot be migrated.

    3. Set Selective Migration.
      • If you do not need to filter source data by time, set this option to No.
      • If you need to filter source data by time, set this option to Yes. Then only source data modified after the specified time will be migrated.

  9. Configure parameters in the Destination Configuration area.

    1. Data Encryption: If the destination is a parallel file system (PFS), data encryption is not supported.
      • If Enable is not selected, objects will not be encrypted after being migrated.
      • If Enable is selected, all migrated objects will be encrypted before they are stored into the destination bucket.
        • Enabling data encryption may decrease the migration speed by about 10%.
        • Data encryption is only available if the encryption service is supported in the region you are migrating to.
    2. Specify Prefix
      • If this option is set to No, the names of objects remain unchanged before and after the migration.
      • If this option is set to Yes, you can specify a prefix to rename or redefine the paths for objects migrated to the destination bucket.
        Figure 1 Specifying a prefix

        To learn how to use Specify Prefix, see Adding a Name Prefix or Path Prefix to Migrated Objects.

    3. Object Storage Class

      Choose the storage class that your data will be migrated to. For details about storage classes, see How Do I Choose Storage Classes?

  10. (Optional) Click Show next to Advanced Settings to expand the advanced options. Configure advanced options by referring to Table 4.

    Table 4 Advanced settings

    Parameter

    Description

    Failed Object Record

    Decide whether to record the objects that fail to be migrated.

    • If this option is set to No, the system will not record the objects that fail to be migrated. If the migration fails and needs to be retried, all objects in the source bucket need to be migrated again.
    • If this option is set to Yes, the system will record the objects that fail to be migrated. If the migration fails and needs to be retried, you can only migrate these failed objects again.
    NOTE:
    • For a migration task, no matter it is contained in a task group or not, the objects that fail to be migrated are recorded in a list named after the task name, and the list is uploaded to the oms/failed_object_lists/ directory of the destination bucket.
    • A maximum of 100,000 failed objects can be recorded in a failed object list file.

      If more than 100,000 objects fail to be migrated in a migration task, you are advised to rectify the fault based on the existing failed object list and perform the migration again.

    • This file contains information about the failed migration task, including the failure cause and time (in local UTC), URL-encoded names of failed objects, non-URL-encoded names of failed objects, and error codes returned by source SDKs.
    • This file is retained in the destination bucket even after the objects in the file are successfully re-transferred. If you do not need the file, delete it manually.

    Archive Data Restoration

    Archived data must be restored before it can be migrated. You can choose to enable this option if your source cloud service provider supports automatic restoration of archived data.

    • If you enable this option, the system automatically restores and migrates archived objects in the migration task. If an archived object fails to be restored, the system records it in the list of objects that fail to be migrated and continues to migrate other objects in the migration task.
      CAUTION:
      • The following cloud service providers can automatically restore archived objects: Huawei Cloud, Alibaba Cloud, Kingsoft Cloud, and Tencent Cloud.
      • The system will restore archived data before migrating it, and you pay the source cloud platform for the API requests and storage space generated accordingly.
      • The copies generated for restoring archived data will be stored for three days by default. If there is more than 3 TB of data or more than 3 million objects in a source bucket, create a migration task group to migrate data in case the migration fails because the restored data is archived again.
      • Pausing or restarting a migration task will result in restoration of all archived data again, no matter it has been restored or not.
      • After objects are migrated from the source platform to Huawei Cloud, they are in the same storage class as the destination bucket.
    • If you do not enable this option, the system directly records archive objects in the list of objects that failed to be migrated, skips them, and continues to migrate other objects in the migration task.

    SMN Notification

    Decide whether to use SMN to get notifications about migration results.
    • If you do not enable this option, no SMN message will be sent after the migration is complete.
    • If you enable this option, you will receive an SMN message informing you of the migration results.

      Perform the following steps:

      1. Click Add next to the SMN Topic box. The Select SMN Topic dialog box is displayed.
      2. Select a topic for publishing SMN messages and click OK.

        If OMS is not authorized to publish messages to the selected SMN topic, a Warning dialog box will appear after you click OK. If you need to complete the authorization, click OK to authorize OMS. Otherwise, click Cancel to return to the Select SMN Topic page and select another SMN topic.

      3. Configure Trigger Condition. You can select one of the following options or both:
      • Migration succeeded: An SMN message will be sent if the migration task succeeds.
      • Migration failed: An SMN message will be sent if the migration task fails.
      NOTE:

      If you experience trouble in using SMN, see What Do I Do If SMN Does Not Work Properly?

    Traffic Limiting

    Set the maximum bandwidth for the migration task during different periods.

    • If you do not need to limit the migration traffic, select No.
    • If you need to limit the migration traffic, select Yes.

      Perform the following steps:

      1. Click Add Rule.

        An empty traffic limiting rule is displayed.

      2. Specify Start Time, End Time, and Maximum Bandwidth.

        The maximum bandwidth ranges from 1 MB/s to 200 MB/s.

        NOTICE:

        For example, if you set Start Time to 08:00, End Time to 12:00, and Maximum Bandwidth to 20 MB/s, the maximum migration speed is limited to 20 MB/s when the migration task runs in the period from 08:00 to 12:00. The migration speed is not limited beyond this period.

        The time is the local standard time of the region you are migrating to.

      3. Repeat steps 1 and 2 to add another traffic limiting rule.

        You can click in the Operation column to delete a rule.

        NOTE:

        A maximum of five rules can be added for a migration task group.

        You can set traffic limiting rules in the task details even after the task is created.

    Object Overwrite

    • Overwrite if source newer or different size: If a source object is not as large as or was last modified more recently than its paired destination object, the source object will overwrite the destination object. Otherwise, the source object will be skipped.
    • Never overwrite: The system will always skip source objects and keeps their paired destination objects.
    • Overwrite if different CRC64 checksum: If a source object has a CRC64 checksum different from the paired destination object, the source object will overwrite the destination object. Otherwise, the source object will be skipped. If either of them does not have a CRC64 checksum, their sizes and last modification times are checked.
      NOTE:
      • This option is only available for migration on Huawei Cloud or from Alibaba Cloud or Tencent Cloud.
      • This option is applied only when the source object has a same encryption attribute as the destination one.
    • Always overwrite: The system will always allow source objects to overwrite their paired destination objects.

    Consistency Check

    • By size and last modification time: With this default setting, the system will check object consistency with object size and last modification time.
    • By CRC64 checksum: The system will check object consistency with CRC64 checksums. If a source object and the paired destination object have CRC64 checksums, the checksums are checked. Otherwise, their sizes and last modification times are checked.

    Priority

    If there are a large number of tasks, you can set task priorities to control which tasks take precedence over other tasks.

    Description

    Enter a brief description about the migration task.

    A maximum of 255 characters can be entered.

  11. Click Next to enter the confirmation page.
  12. Confirm the migration settings and start the migration.

    1. Check that the migration settings are correct.

      If the settings are incorrect, click Previous to return to the previous page for modification.

    2. Click Migrate Now. A message is displayed, indicating that the migration task is created successfully.

      The task group is displayed in the migration task group list.