How Migration Jobs Work
Data Migration Model
Figure 1 shows the simplified migration model used by CDM.
- When data migration jobs are submitted, CDM splits each job into multiple tasks based on the Concurrent Extractors parameter in the job configuration.
Jobs for different data sources may be split based on different dimensions. Some jobs may not be split based on the Concurrent Extractors parameter.
- CDM submits the tasks to the running pool in sequence. Tasks (defined by Maximum Concurrent Extractors) run concurrently. Excess tasks are queued.
Factors Affecting Migration Performance
According to the migration model, the migration speed is affected by factors such as the source read speed, network bandwidth, destination write performance, and CDM cluster and job configuration.
Factor |
Description |
|
---|---|---|
Service-related factors |
Concurrent extractors of a job |
The number of concurrent extractors can be set for a CDM job during the job creation. Setting a proper value for this parameter can effectively improve the migration speed. If the value is too small, migration will be too slow. If the value is too large, the migration job is overloaded and may fail.
|
Maximum concurrent extractors of a cluster |
Setting a proper value for this parameter can effectively improve the migration speed. If the value is too small, migration will be too slow. If the value is too large, the source is overloaded and the system may be unstable. The maximum concurrent extractors vary depending on the CDM cluster flavor. The upper limit is twice the number of vCPUs. The following are the maximum concurrent extractors of some flavors:
|
|
Service model |
If the number of CDM jobs that run concurrently exceeds the maximum concurrent extractors for the CDM cluster, some jobs will be queued, and the migration will be prolonged. Avoid running too many jobs simultaneously, which may cause slow migration due to insufficient resources. |
|
Data model |
The migration speed is also affected by the data structure. The following are some examples:
|
|
Source read speed |
It depends on the performance of the data source at the source. For details about how to increase the read speed, see the documents of data sources at the source. |
|
Network bandwidth |
The CDM cluster can communicate with the data source through an intranet, public network VPN, NAT, or Direct Connect.
|
|
Destination write performance |
It depends on the performance of the data source at the destination. For details about how to improve the performance, see the documents of data sources at the destination. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot