Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update tf related resource files #150

Merged
merged 2 commits into from
May 10, 2019
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 6 additions & 2 deletions example/integrations/tensorflow/Dockerfile
Original file line number Diff line number Diff line change
@@ -1,6 +1,10 @@
#NOTE: the build process would change during developing.
# NOTE: the build process would change during developing,
# the commit ID when first creating the image: 62c833f806db621943a6cf8195657b9d0fa67d93 (master)
# original image is: gcr.io/kubeflow/tf-benchmarks-cpu:v20171202-bdab599-dirty-284af3,
# the image needs an update to use the latest tf-benchmark logic
# ref => https://github.com/tensorflow/benchmarks/tree/master/scripts/tf_cnn_benchmarks.
FROM python:2.7
MAINTAINER volcano <maintainer@volcano.sh>
MAINTAINER volcano <volcano-sh@googlegroups.com>
RUN apt-get update --fix-missing \
&& apt-get install -y git \
&& apt-get clean \
Expand Down
7 changes: 4 additions & 3 deletions example/integrations/tensorflow/tf-example.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -27,14 +27,15 @@
# 2019-04-23 11:10:25.552861: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:215]
# Initialize GrpcChannelCache for job worker -> {0 -> tensorflow-benchmark-worker-0.tensorflow-benchmark:2222}
#
# **NOTES**: This example may take about an hour to finish.
# **NOTES**: This example may take about an hour to finish. When running multiple jobs, please ensure enough resource
# is guaranteed for each of the worker pods.

apiVersion: batch.volcano.sh/v1alpha1
kind: Job
metadata:
name: tensorflow-benchmark
spec:
minAvailable: 2
minAvailable: 3
schedulerName: kube-batch
plugins:
env: []
Expand Down Expand Up @@ -65,7 +66,7 @@ spec:
resources: {}
workingDir: /opt/tf-benchmarks/scripts/tf_cnn_benchmarks
restartPolicy: OnFailure
- replicas: 1
- replicas: 2
name: worker
policies:
- event: TaskCompleted
Expand Down