Volcano Scheduler
Description
Volcano is a batch scheduling platform based on Kubernetes. It provides a series of features required by machine learning, deep learning, bioinformatics, genomics, and other big data applications, as a powerful supplement to Kubernetes capabilities.
Volcano provides general computing capabilities such as high-performance job scheduling, heterogeneous chip management, and job running management. It accesses the computing frameworks for various industries such as AI, big data, gene, and rendering and schedules up to 1,000 pods per second for end users, greatly improving scheduling efficiency and resource utilization.
Volcano provides job scheduling, job management, and queue management for computing applications. Its main features are as follows:
- Diverse computing frameworks: CRD provides common APIs for batch computing tasks. With various plug-ins and advanced job lifecycle management, computing frameworks such as TensorFlow, MPI, and Spark can run on Kubernetes in containers.
- Advanced scheduling: Advanced scheduling capabilities are provided for batch computing and high-performance computing scenarios, including group scheduling, priority preemption, packing, resource reservation, and task topology.
- Queue management: Queues can be effectively managed for scheduling jobs. Complex job scheduling can be implemented based on queue priorities or through multi-level queues.
Volcano has been open-sourced in GitHub at https://github.com/volcano-sh/volcano.
Constraints
When upgrading the plug-in, exercise caution when you downgrade a later version to an earlier version, as this may cause job scheduling failures.
Installing the Plug-in
Certain plug-ins are automatically installed when you enable Lite Cluster resources. For details, see Enabling Lite Cluster Resources.
Components
Component |
Description |
Resource Type |
---|---|---|
volcano-scheduler |
Schedule pods. |
Deployment |
volcano-controller |
Synchronize CRDs. |
Deployment |
volcano-admission |
Webhook server, which verifies and modifies resources such as pods and jobs |
Deployment |
Change History
Plug-in Version |
New Feature |
---|---|
1.17.11 |
|
1.16.8 |
|
1.15.8 |
Supported Ascend NPU dual-die affinity scheduling. |
1.15.6 |
Resources can be oversubscribed based on pod profiling. |
1.13.5 |
|
1.12.18 |
|
1.12.1 |
Optimized application auto scaling performance. |
1.11.9 |
|
1.10.10 |
Fixed the issue that the local PV plug-in fails to calculate the number of pods pre-bound to the node. |
1.10.7 |
Fixed the issue that the local PV plug-in fails to calculate the number of pods pre-bound to the node. |
1.7.1 |
Supported clusters v1.25. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot