OOM issue #22

ecilay · 2018-08-09T02:07:04Z

When i followed the instructions as specified in the docker setup, it always give out of memory error. But I am already using an AWS P3 instance, which has a Tesla V100.
Is this expected or sometime is wrong in my setup?

My config:
tensorflow-gpu: 1.10.0
keras: 2.0.9

Error from vgg_normalised.py line 38:

OOM when allocating tensor of shape [3] and type float
[[Node: vgg_encoder/preprocess/Const_1 = Constdtype=DT_FLOAT, value=Tensor<type: float shape: [3] values: -103.939 -116.779 -123.68>, _device="/job:localhost/replica:0/task:0/device:GPU:0"]]

Thanks!

eridgd · 2018-08-09T21:57:19Z

The V100 has 16GB of VRAM, so that should certainly be enough (it runs fine on my 4GB GPU). Does the same thing happen if you try running it outside of docker?

The first thing that comes to mind is that you may be running something else that's holding onto GPU mem, that's a mistake I make all the time. If you run nvidia-smi and look in the "Memory-Usage" column, does it show 16GB total with only a small portion of that used? You can run this within the container as well: nvidia-docker run --rm wct-tf nvidia-smi

ecilay · 2018-08-09T23:50:00Z

hello thanks for the reply!
Yea i checked nvidia-smi all the time and it is occupying the full memory like below screenshot for a aws p2 instance (12G GPU). I didn't use docker, directly issue the python command in instance.
I used the command python3 stylize.py --checkpoints models/relu5_1 models/relu4_1 models/relu3_1 models/relu2_1 models/relu1_1 --relu-targets relu5_1 relu4_1 relu3_1 relu2_1 relu1_1 --style-size 512 --alpha 0.8 --out-path static/style.jpg.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM issue #22

OOM issue #22

ecilay commented Aug 9, 2018 •

edited

Loading

eridgd commented Aug 9, 2018

ecilay commented Aug 9, 2018 •

edited

Loading

OOM issue #22

OOM issue #22

Comments

ecilay commented Aug 9, 2018 • edited Loading

eridgd commented Aug 9, 2018

ecilay commented Aug 9, 2018 • edited Loading

ecilay commented Aug 9, 2018 •

edited

Loading

ecilay commented Aug 9, 2018 •

edited

Loading