-
Notifications
You must be signed in to change notification settings - Fork 904
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conv2DCustomBackpropFilterOp only supports NHWC error #73
Comments
I am also having this issue. @ChenXi1992 Are you also using Windows? Also, (@merkoski too), what versions of python, tensorflow-gpu, etc, are you using, and did you tweak anything? Maybe we can find a common culprit. |
I'm afraid this doesn't help. In my case the GPU does work just fine, throughout the Tacotron modeling; it fails with the "Conv2DCustomBackpropFilterOp only supports NHWC" error during the Wavenet modeling. I am on Windows10, Python 3.6.5, Tensorflow (GPU) 1.8.0. |
GPU seems fine for me, also. Tacotron seems to be using it fine, but it fails at Wavenet. I haven't tried any other Wavenet implementations, yet, as I'm away. I had to set I'm running Python 3.6.4, with Tensorflow gpu 1.8.0. |
Hey @DanRuta I had to do something similar, in my case specifying tacotron_cache_size of 3 in hparams (via command line in my case). I had to do this in order for the data to fit onto the GPU, but I wonder if a non-standard hparams is causing the issue somehow.... |
Has anyone been able to find a working solution to this problem? For a while I've been stuck with the same error now :/ |
Not me, unfortunately. The existing suggestions were not applicable or didn't solve it. |
Does anyone have any new leads on getting Wavenet to work? I looked all over github and I can't seem to find a solution. I tried isolating my GPU for training, changing h-parameters, and changing data_formats to 'NCHW.' However, nothing seems to quite work :(. I'm running Python 3.6.2 and Tensorflow GPU 1.8.0. Here's my nvcc: Here's my output from "tf.__version__sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))":
|
it seems that all have similar errors Have I written custom code |
@v-yunbin Currently waiting for a reply. Perhaps we should upvote it to try to get a higher priority? Feel free to add to the post as well. System information
|
Encountered the same issue. Tried running Wavenet training without changing anything in hparams.py, still is still occurred. |
hi guys, my wavenet train has been training, replacing tensorflow cpu with tensorflow gpu(1.8 version) |
@v-yunbin, that's awesome to hear! Thanks! |
@avivelor |
Had another go at this today. I nuked my python installation, installed and used Python 3.6 with virtualenv, installing only the requirements.txt and tensorflow-gpu 1.9.0, but the issue persists. I ran It trained the tacotron model for a bit over a couple of hours, and crashed with the NHWC error at the "Wavenet Train" stage. *OS Platform and Distribution: Windows 10 64bit I preprocessed my data using the LJSpeech default option I also tried v1.8.0 of tensorflow-gpu, running --model="Wavenet" (and then tacotron_batch_size 8 and 4), to no avail. Finally, I reinstalled CUDA drivers, updating to 9.2 (with batch sizes of 16 or 8), also to no avail. At this point, I'm no longer sure if this has anything to do with dependencies. |
I was not able to reproduce the Error no matter how hard I tried. (I even changed the Conv2Dtranspose to channels_last and couldn't reproduce. My setup:
I would ask from you guys to check that you don't have tensorflow (cpu) version installed in parallel on your machine, and try to build from source with exact same build I have. |
It seems that the problem appear with Windows 10 64bit |
Hey, I got the same issue when I training the wavenet, Did you find the solution?Thanks very much…… |
I fixed it by switching over to Ray's Setup.
My setup:
- Ubuntu 16.04, 64-bit (HWE installed from server)
- Tensorflow-gpu built from source (tried with 1.7 and 1.8)
- Python: 3.6 (anaconda)
- CUDA/CUDNN: 9.0/7.1
- GPU: Nvidia GTX 1080
…On Tue, Oct 9, 2018, 6:57 PM CherryCloris ***@***.***> wrote:
Hey, I got the same issue when I training the wavenet, Did you find the
solution?Thanks very much……
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#73 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AQPq-jgiAKn59icoi_OZje1P8PHDKfh6ks5ujVP2gaJpZM4UsWbX>
.
|
I am meeting this error,because my tensorflow-cpu by deflault. so i use :conda install tensorflow-gpu |
It's resolved .. |
@lucy3589 Why |
It worked !!!! |
I'm a newbie. Can you be more specific?thank you |
can you explain this a bit more? |
the |
@PrasannaKumarARDS @rameshKrSah Do you know if how this could be implemented into a fork of this project? I'm using karamarieliu/gst_tacotron2_wavenet which is this project but with Tacotron 2 GST. Wavenet on the repo is similar to this repo's earlier 2018 modules.py. I tried changing the
but with it as first, I get this
Am I missing something? |
yes!! I find it useful |
Hi guys, Exiting due to exception: Conv2DCustomBackpropInputOp only supports NHWC. Caused by op 'WaveNet_model/optimizer_1/gradients/WaveNet_model/inference/final_convolution_2/final_convolution_2_1/final_convolution_2/conv1d/Conv2D_grad/Conv2DBackpropInput', defined at: ...which was originally created as op 'WaveNet_model/inference/final_convolution_2/final_convolution_2_1/final_convolution_2/conv1d/Conv2D', defined at: |
Hi there! I'm trying to run this project (BTW, I'm looking forward to it!) and when it gets time to start creating a wavenet training model via "python train.py", I experience this error. Any workarounds? I'm using tensorflow-gpu on a Windows machine. Uninstalling tensorflow-gpu and installing tensorflow does not help.
Generated 328 test batches of size 1 in 0.297 sec
2018-06-18 13:30:33.533678: E T:\src\github\tensorflow\tensorflow\core\common_runtime\executor.cc:660] Executor failed to create kernel. Invalid argument: Conv2DCustomBackpropFilterOp only supports NHWC.
[[Node: model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Conv2DBackpropFilter = Conv2DBackpropFilter[T=DT_FLOAT, _class=["loc:@model/optimizer/clip_by_global_norm/mul_197"], data_format="NCHW", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 16], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](model/optimizer/gradients/model/inference/conv2d_transpose/BiasAdd_grad/tuple/control_dependency/_2947, model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Shape, model/inference/ExpandDims_1/_2949)]]
Exiting due to Exception: Conv2DCustomBackpropFilterOp only supports NHWC.
[[Node: model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Conv2DBackpropFilter = Conv2DBackpropFilter[T=DT_FLOAT, _class=["loc:@model/optimizer/clip_by_global_norm/mul_197"], data_format="NCHW", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 16], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](model/optimizer/gradients/model/inference/conv2d_transpose/BiasAdd_grad/tuple/control_dependency/_2947, model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Shape, model/inference/ExpandDims_1/_2949)]]
Caused by op 'model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Conv2DBackpropFilter', defined at:
File "train.py", line 128, in
main()
File "train.py", line 122, in main
train(args, log_dir, hparams)
File "train.py", line 76, in train
checkpoint = wavenet_train(args, log_dir, hparams, input_path)
File "D:\DL2\Tacotron-2\wavenet_vocoder\train.py", line 244, in wavenet_train
return train(log_dir, args, hparams, input_path)
File "D:\DL2\Tacotron-2\wavenet_vocoder\train.py", line 167, in train
model, stats = model_train_mode(args, feeder, hparams, global_step)
File "D:\DL2\Tacotron-2\wavenet_vocoder\train.py", line 119, in model_train_mode
model.add_optimizer(global_step)
File "D:\DL2\Tacotron-2\wavenet_vocoder\models\wavenet.py", line 365, in add_optimizer
gradients, variables = zip(*optimizer.compute_gradients(self.loss))
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\training\optimizer.py", line 526, in compute_gradients
colocate_gradients_with_ops=colocate_gradients_with_ops)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gradients_impl.py", line 494, in gradients
gate_gradients, aggregation_method, stop_gradients)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gradients_impl.py", line 636, in _GradientsHelper
lambda: grad_fn(op, *out_grads))
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gradients_impl.py", line 385, in _MaybeCompile
return grad_fn() # Exit early
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gradients_impl.py", line 636, in
lambda: grad_fn(op, *out_grads))
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\nn_grad.py", line 54, in _Conv2DBackpropInputGrad
data_format=op.get_attr("data_format")),
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 1190, in conv2d_backprop_filter
dilations=dilations, name=name)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\ops.py", line 3392, in create_op
op_def=op_def)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\ops.py", line 1718, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
...which was originally created as op 'model/inference/conv2d_transpose/conv2d_transpose', defined at:
File "train.py", line 128, in
main()
[elided 3 identical lines from previous traceback]
File "D:\DL2\Tacotron-2\wavenet_vocoder\train.py", line 167, in train
model, stats = model_train_mode(args, feeder, hparams, global_step)
File "D:\DL2\Tacotron-2\wavenet_vocoder\train.py", line 117, in model_train_mode
feeder.input_lengths, x=feeder.inputs)
File "D:\DL2\Tacotron-2\wavenet_vocoder\models\wavenet.py", line 169, in initialize
y_hat = self.step(x, c, g, softmax=False) #softmax is automatically computed inside softmax_cross_entropy if needed
File "D:\DL2\Tacotron-2\wavenet_vocoder\models\wavenet.py", line 435, in step
c = transposed_conv(c)
File "D:\DL2\Tacotron-2\wavenet_vocoder\models\modules.py", line 333, in call
return self.convt(inputs)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\layers\base.py", line 717, in call
outputs = self.call(inputs, *args, **kwargs)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\layers\convolutional.py", line 1667, in call
data_format=utils.convert_data_format(self.data_format, ndim=4))
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\nn_ops.py", line 1254, in conv2d_transpose
name=name)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 1340, in conv2d_backprop_input
dilations=dilations, name=name)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\ops.py", line 3392, in create_op
op_def=op_def)
File "C:\Users\merko\AppData\Local\conda\conda\envs\DL2\lib\site-packages\tensorflow\python\framework\ops.py", line 1718, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
InvalidArgumentError (see above for traceback): Conv2DCustomBackpropFilterOp only supports NHWC.
[[Node: model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Conv2DBackpropFilter = Conv2DBackpropFilter[T=DT_FLOAT, _class=["loc:@model/optimizer/clip_by_global_norm/mul_197"], data_format="NCHW", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 16], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:CPU:0"](model/optimizer/gradients/model/inference/conv2d_transpose/BiasAdd_grad/tuple/control_dependency/_2947, model/optimizer/gradients/model/inference/conv2d_transpose/conv2d_transpose_grad/Shape, model/inference/ExpandDims_1/_2949)]]
2018-06-18 13:30:33.575389: W T:\src\github\tensorflow\tensorflow\core\kernels\queue_base.cc:277] _1_datafeeder/eval_queue: Skipping cancelled enqueue attempt with queue not closed
2018-06-18 13:30:33.581458: W T:\src\github\tensorflow\tensorflow\core\kernels\queue_base.cc:277] _0_datafeeder/intput_queue: Skipping cancelled enqueue attempt with queue not closed
Traceback (most recent call last):
File "train.py", line 128, in
main()
File "train.py", line 122, in main
train(args, log_dir, hparams)
File "train.py", line 78, in train
raise ('Error occured while training Wavenet, Exiting!')
TypeError: exceptions must derive from BaseException
The text was updated successfully, but these errors were encountered: