Effective dynamic input size compilation #430

machineko · 2020-08-26T19:08:39Z

machineko
Aug 26, 2020

Hey guys I love both flax and Jax I find it to be kinda a fresh air in deep learning env and want to congrats you all for a job well done.

I have one major question which I can't find in both flax and jax docs How to compile dynamically sized inputs faster?

As I can see in JAX documentation and from my experiments XLA is compiling function on array shape level and in my case, i can't just give it more abstract tensor as output array size is also dependent on input size.
And with dynamically sized jit function will be compiled on almost every batch so i would love to hear how to make it faster :P (except padding inputs as memory is limited in this use case and bigger inputs have different batch sizes and different logic)

Answered by avital

Aug 28, 2020

Yes you're right, this is a fundamental design property of XLA, which assumes completely known shapes (and uses that information for compile-time optimizations).

The typical pattern we use is dynamic bucketing based where sequences within certain length ranges are batched together. For example here: https://github.com/google/flax/blob/master/examples/wmt/input_pipeline.py#L204 (in this case we're taking advantage of tf.data.experimental.bucket_by_sequence_length, which is not ideal.

We'd love to find long term flexible solutions, but they also need to be fast, so it's not clear one can implement this in pure Python and support very large models.

View full answer

avital · 2020-08-28T09:31:03Z

avital
Aug 28, 2020

Yes you're right, this is a fundamental design property of XLA, which assumes completely known shapes (and uses that information for compile-time optimizations).

The typical pattern we use is dynamic bucketing based where sequences within certain length ranges are batched together. For example here: https://github.com/google/flax/blob/master/examples/wmt/input_pipeline.py#L204 (in this case we're taking advantage of tf.data.experimental.bucket_by_sequence_length, which is not ideal.

We'd love to find long term flexible solutions, but they also need to be fast, so it's not clear one can implement this in pure Python and support very large models.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Effective dynamic input size compilation #430

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Effective dynamic input size compilation #430

machineko Aug 26, 2020

Replies: 1 comment

avital Aug 28, 2020

machineko
Aug 26, 2020

avital
Aug 28, 2020