创建PyTorchJob
功能介绍
创建PyTorchJob。
PyTorchJob即PyTorch任务,是基于PyTorch开源框架的kubernetes自定义资源类型,有多种角色可以配置,能够帮助我们更简单地实现PyTorch的训练。
URI
POST /apis/kubeflow.org/v1/namespaces/{namespace}/pytorchjobs
参数 |
是否必选 |
描述 |
---|---|---|
namespace |
Yes |
object name and auth scope, such as for teams and projects |
参数 |
是否必选 |
描述 |
---|---|---|
pretty |
No |
If 'true’, then the output is pretty printed. |
请求消息
请求参数:
请求参数的详细描述请参见表165。
请求示例:
{ "apiVersion": "kubeflow.org/v1", "kind": "PyTorchJob", "metadata": { "name": "pytorch-test" }, "spec": { "pytorchReplicaSpecs": { "Master": { "replicas": 1, "restartPolicy": "Never", "template": { "spec": { "containers": [ { "name": "pytorch", "image": "*.*.*.215:20202/gcs/pytorch-cpu:v1", "command": [ "python", "/var/mnist.py" ], "args": [ "--backend", "gloo" ], "resources": { "limits": { "cpu": 2, "memory": "4Gi" }, "requests": { "cpu": 2, "memory": "4Gi" } } } ], "imagePullSecrets": [ { "name": "imagepull-secret" } ] } } }, "Worker": { "replicas": 1, "restartPolicy": "OnFailure", "template": { "spec": { "containers": [ { "name": "pytorch", "image": "*.*.*.215:20202/gcs/pytorch-cpu:v1", "command": [ "python", "/var/mnist.py" ], "args": [ "--backend", "gloo" ], "resources": { "limits": { "cpu": 2, "memory": "4Gi" }, "requests": { "cpu": 2, "memory": "4Gi" } } } ], "imagePullSecrets": [ { "name": "imagepull-secret" } ] } } } } } }
响应消息
响应参数:
响应参数的详细描述请参见表165。
响应示例:
{ "apiVersion": "kubeflow.org/v1", "kind": "PyTorchJob", "metadata": { "creationTimestamp": "2019-07-24T10:29:45Z", "generation": 1, "name": "pytorch-test", "namespace": "kube-test", "resourceVersion": "72516798", "selfLink": "/apis/kubeflow.org/v1/namespaces/kube-test/pytorchjobs/pytorch-test", "uid": "f4c79668-adfd-11e9-8041-340a9837e2a7" }, "spec": { "pytorchReplicaSpecs": { "Master": { "replicas": 1, "restartPolicy": "Never", "template": { "spec": { "containers": [ { "args": [ "--backend", "gloo" ], "command": [ "python", "/var/mnist.py" ], "image": "*.*.*.215:20202/gcs/pytorch-cpu:v1", "name": "pytorch", "resources": { "limits": { "cpu": 2, "memory": "4Gi" }, "requests": { "cpu": 2, "memory": "4Gi" } } } ], "imagePullSecrets": [ { "name": "imagepull-secret" } ] } } }, "Worker": { "replicas": 1, "restartPolicy": "OnFailure", "template": { "spec": { "containers": [ { "args": [ "--backend", "gloo" ], "command": [ "python", "/var/mnist.py" ], "image": "*.*.*.215:20202/gcs/pytorch-cpu:v1", "name": "pytorch", "resources": { "limits": { "cpu": 2, "memory": "4Gi" }, "requests": { "cpu": 2, "memory": "4Gi" } } } ], "imagePullSecrets": [ { "name": "imagepull-secret" } ] } } } } }, "status": { } }
状态码
状态码 |
描述 |
---|---|
200 |
OK |
201 |
Created |
202 |
Accepted |
401 |
Unauthorized |
400 |
Badrequest |
500 |
Internal error |
403 |
Forbidden |