Storage Types
Storage options vary based on performance, ease of use, and cost. No single storage type fits every need. Explore cloud storage scenarios to improve your usage.
| Storage Type | Application | Advantage | Disadvantage |
|---|---|---|---|
| EVS | It is suitable for data and algorithm exploration in the development environment and offers good performance. | Block storage SSDs feature better overall I/O performance than NFS. The storage capacity can be dynamically expanded to up to 4,096 GB. As persistent storage, EVS disks are mounted to /home/ma-user/work. The data in this directory is retained after the instance is stopped. The storage capacity can be expanded online based on demand. | This type of storage can only be used in a single development environment. |
| Object Storage Service – Parallel File System | NOTE:
Parallel file systems mounted as persistent storage for AI development and exploration.
| PFS is an optimized high-performance object storage file system with low storage costs and large throughput. It can quickly process high-performance computing (HPC) workloads. PFS mounting is recommended if OBS is used. NOTE: Package or split the data to be uploaded by 128 MB or 64 MB. Download and decompress the data in local storage for better I/O and throughput performance. | The performance of frequent read and write operations on small files is poor, which may cause notebook instance freezing. Exercise caution when using this storage type for heavy model training and large file decompression. NOTE: Before mounting PFS storage to a notebook instance, grant ModelArts with full read and write permissions on the PFS bucket. The policy will be retained even after the notebook instance is deleted. |
| Object Storage Service – Bucket | NOTE:
When uploading or downloading a large amount of data in the development environment, you can use OBS buckets to transfer data. | Low storage cost and high throughput, but average performance in reading and writing small files. It is good practice to package or split the file by 128 MB or 64 MB. In this way, you can download the packages, decompress them, and use them locally. | The object storage semantics is different from the Posix semantics and needs to be further understood. The performance of frequent read and write operations on small files is poor, which may cause notebook instance freezing. Exercise caution when using this storage type in scenarios such as model training and large file decompression. |
| Scalable File Service | Available only in dedicated resource pools. Use SFS storage in informal production scenarios such as exploration and experiments. Development and training environments can mount the same SFS storage simultaneously, eliminating the need to download data for every training job. Generally, this setup is not suitable for large-scale training exceeding 32 PUs or for models with heavy I/O read/write requirements. | SFS uses NFS and can be shared among multiple development and training environments. It is ideal for light-duty distributed training jobs that do not need extra data downloads at the start. | The performance is lower than that of EVS block storage. |
| Local storage | First choice for heavy-duty training jobs. | High-performance SSDs for the used VM or BMS, featuring high file I/O throughput. For heavy-duty training jobs, store data in the target directory and then start training. By default, the storage is mounted to the /cache directory. For details about the available space of the /cache directory, see What Are Sizes of the /cache Directories for Different Notebook Specifications in DevEnviron? | The storage lifecycle is associated with the container lifecycle. Data needs to be downloaded each time the training job starts. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot