`with_flatten`: make the layer input/output types symmetric #821

danieldk · 2022-12-13T08:22:00Z

The idea of the with_flatten layer is that it flattens a nested sequence, passes it to the wrapped layer, and then unflattens the output of the wrapped layer.

However, the layer was asymmetric in that it passes a list to the wrapped layer, but expects back an XP array. This breaks composition with other Thinc layers, such as with_array.

This change makes with_flatten symmetric, in that the inputs/outputs of the with_flatten and the wrapped layer are symmetric.

It seems that this layer is not used in Thinc or spaCy, so maybe it never worked correctly? At any rate, I needed to flatten a nested list in distillation with with_flatten(with_array(...)) in distillation and found that it doesn't actually work.

The idea of the `with_flatten` layer is that it flattens a nested sequence, passes it to the wrapped layer, and then unflattens the output of the wrapped layer. However, the layer was asymmetric in that it passes a list to the wrapped layer, but expects back an XP array. This breaks composition with other Thinc layers, such as `with_array`. This change makes `with_flatten` symmetric, in that the inputs/outputs of the `with_flatten` and the wrapped layer are symmetric.

thinc/layers/with_flatten.py

website/docs/api-layers.md

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

kadarakos

Maybe its me, but since with_flatten is in a deep learning library my initial assumption was that it will call xp.flatten on an ArrayXd making it Array1d and it will reshape it back. Should it dynamically either flatten a list or a tensor? Should it be able to flatten any nesting level with lists just like any order tensor in xp.flatten? Or we just have it like this and add it into the documentation that this is flattening lists of lists (one level only)? Or is it just me who made the assumption? Don't want to derail the PR just mentioning that flatten for me invoked something different.

thinc/layers/with_flatten_v2.py

danieldk · 2023-01-09T09:34:22Z

Maybe its me, but since with_flatten is in a deep learning library my initial assumption was that it will call xp.flatten on an ArrayXd making it Array1d and it will reshape it back. Should it dynamically either flatten a list or a tensor? Should it be able to flatten any nesting level with lists just like any order tensor in xp.flatten? Or we just have it like this and add it into the documentation that this is flattening lists of lists (one level only)? Or is it just me who made the assumption? Don't want to derail the PR just mentioning that flatten for me invoked something different.

These are good questions, and I agree that our naming it not really in line with other deep learning libraries, though it is kinda the normal FP definition.

At any rate, this PR is just for fixing the existing implementation which just flattens one level of lists. If we want something more advanced, it should probably have a different name to avoid confusion (and a separate PR).

shadeMe approved these changes Dec 13, 2022

View reviewed changes

shadeMe reviewed Dec 13, 2022

View reviewed changes

thinc/layers/with_flatten.py Outdated Show resolved Hide resolved

Remove incorrect Sequence from with_flatten docs

52292cd

shadeMe approved these changes Dec 13, 2022

View reviewed changes

adrianeboyd reviewed Dec 13, 2022

View reviewed changes

thinc/layers/with_flatten.py Show resolved Hide resolved

danieldk added 2 commits December 13, 2022 14:46

Rename fixed with_flatten to with_flatten_v2

84ef77f

Try to make the types in the old with_flatten as correct as possible

37db76f

adrianeboyd reviewed Dec 14, 2022

View reviewed changes

website/docs/api-layers.md Show resolved Hide resolved

adrianeboyd reviewed Dec 15, 2022

View reviewed changes

website/docs/api-layers.md Outdated Show resolved Hide resolved

Doc fix

ec0913f

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

kadarakos reviewed Dec 16, 2022

View reviewed changes

thinc/layers/with_flatten_v2.py Outdated Show resolved Hide resolved

Also extract lengths in _flatten

8de682d

adrianeboyd merged commit f7a819f into explosion:master Jan 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`with_flatten`: make the layer input/output types symmetric #821

`with_flatten`: make the layer input/output types symmetric #821

danieldk commented Dec 13, 2022 •

edited

Loading

kadarakos left a comment •

edited

Loading

danieldk commented Jan 9, 2023 •

edited

Loading

with_flatten: make the layer input/output types symmetric #821

with_flatten: make the layer input/output types symmetric #821

Conversation

danieldk commented Dec 13, 2022 • edited Loading

kadarakos left a comment • edited Loading

Choose a reason for hiding this comment

danieldk commented Jan 9, 2023 • edited Loading

`with_flatten`: make the layer input/output types symmetric #821

`with_flatten`: make the layer input/output types symmetric #821

danieldk commented Dec 13, 2022 •

edited

Loading

kadarakos left a comment •

edited

Loading

danieldk commented Jan 9, 2023 •

edited

Loading