Creating a TFJob
Function
This API is used to create a TFJob.
TensorFlow Training (TFJob) is a TensorFlow-based Kubernetes custom resource that you can use to run standalone or distributed TensorFlow training jobs. For details about the open-source framework TensorFlow, visit https://www.tensorflow.org.
URI
POST /apis/kubeflow.org/v1/namespaces/{namespace}/tfjobs
|
Parameter |
Mandatory |
Description |
|---|---|---|
|
namespace |
Yes |
Object name and auth scope, such as for teams and projects. |
|
Parameter |
Mandatory |
Description |
|---|---|---|
|
pretty |
No |
If 'true', then the output is pretty printed. |
Request
Request parameters
For the description about request parameters, see Table 154.
Example request
{
"apiVersion": "kubeflow.org/v1",
"kind": "TFJob",
"metadata": {
"name": "tfjob-test"
},
"spec": {
"backoffLimit": 6,
"tfReplicaSpecs": {
"Ps": {
"replicas": 1,
"template": {
"spec": {
"containers": [
{
"args": [
"python",
"/opt/tf-benchmarks/scripts/tf_cnn_benchmarks/tf_cnn_benchmarks.py",
"--batch_size=1",
"--model=resnet50",
"--variable_update=parameter_server",
"--flush_stdout=true",
"--num_gpus=1",
"--local_parameter_device=cpu",
"--device=cpu",
"--data_format=NHWC"
],
"image": "*.*.*.215:20202/cci/tf-benchmarks-cpu:v1",
"name": "tensorflow",
"ports": [
{
"containerPort": 2222,
"name": "tfjob-port"
}
],
"resources": {
"limits": {
"cpu": "2",
"memory": "4Gi"
},
"requests": {
"cpu": "2",
"memory": "4Gi"
}
}
}
],
"restartPolicy": "OnFailure",
"imagePullSecrets": [
{
"name": "imagepull-secret"
}
]
}
}
},
"Worker": {
"replicas": 1,
"template": {
"spec": {
"containers": [
{
"args": [
"python",
"/opt/tf-benchmarks/scripts/tf_cnn_benchmarks/tf_cnn_benchmarks.py",
"--batch_size=1",
"--model=resnet50",
"--variable_update=parameter_server",
"--flush_stdout=true",
"--local_parameter_device=cpu",
"--device=cpu",
"--data_format=NHWC"
],
"image": "*.*.*.215:20202/cci/tf-benchmarks-cpu:v1",
"name": "tensorflow",
"ports": [
{
"containerPort": 2222,
"name": "tfjob-port"
}
],
"resources": {
"limits": {
"cpu": "2",
"memory": "4Gi"
},
"requests": {
"cpu": "2",
"memory": "4Gi"
}
}
}
],
"restartPolicy": "OnFailure",
"imagePullSecrets": [
{
"name": "imagepull-secret"
}
]
}
}
}
}
}
}
Response
Response parameters
For the description about response parameters, see Table 154.
Example response
{
"apiVersion": "kubeflow.org/v1",
"kind": "TFJob",
"metadata": {
"creationTimestamp": "2019-07-23T12:39:47Z",
"generation": 1,
"name": "tfjob-test",
"namespace": "kube-test",
"resourceVersion": "72050567",
"selfLink": "/apis/kubeflow.org/v1/namespaces/kube-test/tfjobs/tfjob-test",
"uid": "f461f966-ad46-11e9-aaa4-340a9837e413"
},
"spec": {
"backoffLimit": 6,
"tfReplicaSpecs": {
"Ps": {
"replicas": 1,
"template": {
"spec": {
"containers": [
{
"args": [
"python",
"/opt/tf-benchmarks/scripts/tf_cnn_benchmarks/tf_cnn_benchmarks.py",
"--batch_size=1",
"--model=resnet50",
"--variable_update=parameter_server",
"--flush_stdout=true",
"--num_gpus=1",
"--local_parameter_device=cpu",
"--device=cpu",
"--data_format=NHWC"
],
"image": "*.*.*.215:20202/cci/tf-benchmarks-cpu:v1",
"name": "tensorflow",
"ports": [
{
"containerPort": 2222,
"name": "tfjob-port"
}
],
"resources": {
"limits": {
"cpu": "2",
"memory": "4Gi"
},
"requests": {
"cpu": "2",
"memory": "4Gi"
}
}
}
],
"imagePullSecrets": [
{
"name": "imagepull-secret"
}
],
"restartPolicy": "OnFailure"
}
}
},
"Worker": {
"replicas": 1,
"template": {
"spec": {
"containers": [
{
"args": [
"python",
"/opt/tf-benchmarks/scripts/tf_cnn_benchmarks/tf_cnn_benchmarks.py",
"--batch_size=1",
"--model=resnet50",
"--variable_update=parameter_server",
"--flush_stdout=true",
"--local_parameter_device=cpu",
"--device=cpu",
"--data_format=NHWC"
],
"image": "*.*.*.215:20202/cci/tf-benchmarks-cpu:v1",
"name": "tensorflow",
"ports": [
{
"containerPort": 2222,
"name": "tfjob-port"
}
],
"resources": {
"limits": {
"cpu": "2",
"memory": "4Gi"
},
"requests": {
"cpu": "2",
"memory": "4Gi"
}
}
}
],
"imagePullSecrets": [
{
"name": "imagepull-secret"
}
],
"restartPolicy": "OnFailure"
}
}
}
}
},
"status": {
}
}
Status Codes
|
Status Code |
Description |
|---|---|
|
200 |
OK |
|
201 |
Created |
|
202 |
Accepted |
|
401 |
Unauthorized |
|
400 |
Badrequest |
|
500 |
Internal error |
|
403 |
Forbidden |
Last Article: TFJob
Next Article: Reading a TFJob
Did this article solve your problem?
Thank you for your score!Your feedback would help us improve the website.