Why is the default `use_reentrant=True` when no kwargs were set? #29638

simon-lund · 2024-03-13T16:35:59Z

transformers/src/transformers/modeling_utils.py

Lines 2110 to 2111 in 350c5d1

    
           if gradient_checkpointing_kwargs is None: 
        
               gradient_checkpointing_kwargs = {"use_reentrant": True}

Here use_reentrant=True is set when no kwargs are provided. From the docs of pytorch this seems to be the legacy variant.

Does this have any performance or other advantages that I am not aware of?

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-03-13T17:11:05Z

Hi @simon-lund, thanks for opening an issue!

You can check the git blame to find the PRs which add certain lines - which should normally give you the reasons for the code logic.

In this case, this line was added in #28538 - according to the PR it's for preparation of an upcoming torch release, where it's necessary for us to explicitly pass the use_reentrant kwarg.

The value is set to true based on the current default PT behaviour c.f. this comment

simon-lund · 2024-03-13T19:44:36Z

Ah. I'm sorry, I only searched the issues and found several discussion about this, but I didn't search through the PRs
Thank you very much for the link as well as the hint with the PRs (I was not aware of that) 👍

simon-lund closed this as completed Mar 13, 2024

NanoCode012 mentioned this issue Mar 16, 2024

fix(config): passing gradient_checkpoint_kwargs axolotl-ai-cloud/axolotl#1412

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is the default `use_reentrant=True` when no kwargs were set? #29638

Why is the default `use_reentrant=True` when no kwargs were set? #29638

simon-lund commented Mar 13, 2024

amyeroberts commented Mar 13, 2024

simon-lund commented Mar 13, 2024

Why is the default use_reentrant=True when no kwargs were set? #29638

Why is the default use_reentrant=True when no kwargs were set? #29638

Comments

simon-lund commented Mar 13, 2024

amyeroberts commented Mar 13, 2024

simon-lund commented Mar 13, 2024

Why is the default `use_reentrant=True` when no kwargs were set? #29638

Why is the default `use_reentrant=True` when no kwargs were set? #29638