[gradient_checkpointing
] default to use it for torch 2.3
#28538
Merged
gradient_checkpointing
] default to use it for torch 2.3
#28538