Fix stop_gradient inconsistent API #7416

angeloskath · 2017-07-24T15:47:52Z

The stop_gradient documentation states that the argument should be a list of
variables. The Theano implementation crashes if the argument is a list of
variables and the CNTK implementation crashes regardless but expects a list anyway (since it passes it to combine). The TensorFlow implementation expects one variable but in case a list is passed it doesn't crash as long as the shapes are all the same.

This commit handles both cases as can be expected. The following code illustrates the current Keras behaviour.

val = np.random.rand(10, 3)
backends = [KTH, KTF, KC]
as = [k.variable(val) for k in backends]
bs = [k.square(a) for k, a in zip(backends, as)]

a, b = as[1], bs[1]
KTF.stop_gradient(a)  # works
KTF.stop_gradient([a, b])  # works but returns a tensor?

a, b = as[0], bs[0]
KTH.stop_gradient(a)  # works
KTH.stop_gradient([a, b])  # crashes

a, b = as[2], bs[2]
KC.stop_gradient(a)  # crashes
KC.stop_gradient([a, b])  # crashes

This PR makes the behaviour consistent. When passed a single tensor a single tensor is returned. When passed a list or a tuple a list is returned.

Although returning a list and expecting a list would be more consistent with the previous documentation it wouldn't be consistent with the code's behaviour which could result in breaking others' code. This change now is backwards compatible but makes the behaviour consistent and documents it.

The stop_gradient documentation states that the argument should be a list of variables. The Theano implementation crashes if the argument is a list of variables and the CNTK implementation crashes if it is not. This commit handles both cases as can be expected.

* commit '84ceb94055b831c486dbf4955fdf1ba0f63320d1': (42 commits) Fix conv reccurent test Style fix in conv recurrent tests. Support return_state parameter in ConvRecurrent2D (keras-team#7407) Small simplification in ResNet50 architecture Update FAQ with info about custom object loading. add example for passing in custom objects in load_model (keras-team#7420) Update applications.md (keras-team#7428) Cast constants in optimizer as floatx. Fix stop_gradient inconsistent API (keras-team#7416) Simplify static shape management in TF backend. Fixed warning showing up when channel axis is 1 (keras-team#7392) Throw exception in LSTM layer if timesteps=1 and unroll=True (keras-team#7387) Style fix Passed the scheduling argument through the `*_generator` function. (keras-team#7236) Fix typos. (keras-team#7374) Fix ImageDataGenerator.standardize to support batches (keras-team#7360) Fix learning phase info being left out in multi-input models (keras-team#7135) Fix PEP8 Fix deserialization bug with layer sharing at heterogenous depths Bug fix: Support multiple outputs in Lambda layer (keras-team#7222) ...

angeloskath force-pushed the stop-gradient-list branch from 6cd6525 to b6daae4 Compare July 24, 2017 17:18

angeloskath force-pushed the stop-gradient-list branch from b6daae4 to 16d9090 Compare July 24, 2017 17:45

fchollet approved these changes Jul 25, 2017

View reviewed changes

fchollet merged commit 0bc856f into keras-team:master Jul 25, 2017

angeloskath deleted the stop-gradient-list branch July 25, 2017 08:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix stop_gradient inconsistent API #7416

Fix stop_gradient inconsistent API #7416

angeloskath commented Jul 24, 2017

Fix stop_gradient inconsistent API #7416

Fix stop_gradient inconsistent API #7416

Conversation

angeloskath commented Jul 24, 2017