Support Lightning >=2.0.0 and Pandas >=2.0.0 #301

SagiPolaczek · 2023-04-13T14:44:16Z

Ready for CR 🕺🏼

Warning - Lightning >=2.0.0 breaks backward competability

Support Lightning 2.0.0:

References:

https://lightning.ai/pages/releases/2.0.0/ (recommended!!)
https://lightning.ai/docs/pytorch/latest/upgrade/migration_guide.html / thanks to @smartdanny
*_epoch_end() migration guide

Changes in Fuse:

Trainer(strategy=None) is no longer supported. We should now use strategy="auto". see here for more info.
Trainer(auto_select_gpus=...) also got removed.
Tuner and Trainer broke up 💔 -> relevant for KNIGHT’s fuse baseline.
Migrate *_epoch_end() to on_*_epoch_end(). see changes in run_mnist_custom_pl_imp.py and the relevant reference above for more info.

Remarks:

Support pandas 2.0.0:

References:

https://pandas.pydata.org/docs/whatsnew/v2.0.0.html#deprecations

Changes in Fuse:

df.append() was removed. see here . Relevant for an EHR transformer dataset. Had to switch with pd.concat().

SagiPolaczek

minor explanations

SagiPolaczek · 2023-04-16T12:00:51Z

fuse_examples/multimodality/ehr_transformer/dataset.py

@@ -246,9 +246,10 @@ def _read_raw_data(raw_data_path: str) -> Tuple[pd.DataFrame, pd.DataFrame]:
        outcomes = ["Outcomes-a.txt", "Outcomes-b.txt"]
        for o in outcomes:
            o_file = os.path.join(raw_data_path + "/" + o)
-            df_outcomes = df_outcomes.append(pd.read_csv(o_file)[["RecordID", "In-hospital_death"]]).reset_index(
+            df_outcomes = pd.concat([df_outcomes, pd.read_csv(o_file)[["RecordID", "In-hospital_death"]]]).reset_index(


append was removed in 2.0.0

setup.py

mosheraboh

Looks good; see if it's possible to be backward compatible though.

mosheraboh · 2023-04-16T19:38:58Z

fuse/requirements.txt

@@ -8,7 +8,7 @@ matplotlib>=3.3.3
 scikit-learn>=0.23.2
 termcolor>=1.1.0
 pycocotools>=2.0.1
-pytorch_lightning>=1.6,<2.0.0 # temp - need to make all tests pass with this version first
+pytorch_lightning>=2.0.0 # Lightning 2.0.0 is not backward compatible


Can we be in someway backward compatible - it seems too restrictive

setup.py

SagiPolaczek · 2023-04-16T20:42:07Z

Looks good; see if it's possible to be backward compatible though.

Thanks!
I thought about it a bit later and it might be possible yes. Will try that on Wednesday.

SagiPolaczek · 2023-04-19T10:27:50Z

@mosheraboh
Now we support both <2.0.0 , >=2.0.0 !

Please see the two last runs that differs only by the Lightning version:

I'll do another self-review now.

NOTE

To support both versions I had to delete arguments (and arguments values) that brake the backward computability such as strategy and auto_scale_batch_size.

SagiPolaczek · 2023-04-19T10:52:06Z

fuse_examples/imaging/classification/knight/baseline/fuse_baseline.py

        num_sanity_val_steps=-1,
-        auto_scale_batch_size="binsearch",
+        # auto_scale_batch_size="binsearch",  # should use Tuner -  https://lightning.ai/pages/releases/2.0.0/#tuner


This option is vary between <2.0.0 and >=2.0.0 !

mosheraboh

Looks good!

mosheraboh

Looks great, one question inline

mosheraboh · 2023-04-24T11:37:21Z

fuse_examples/imaging/classification/mnist/simple_mnist_starter.py

@@ -180,13 +180,20 @@ def run_train(params: dict) -> None:
        distributed=distributed,
    )

+    # A workaround to support multiple GPUs with a custom batch_sampler for both Lightning versions
+    #       see: https://lightning.ai/pages/releases/2.0.0/#sampler-replacement
+    kwargs = {}


Shouldn't it be wrapped with if distributed?

It could be wrapped, but it does nothing when it is not on distributed mode. (for example, the default value is True)

I'll wrap it anyway for clarity, thanks!

mosheraboh

Looks great

# ✅ Same as [Fuse's](BiomedSciAI/fuse-med-ml#301) ## IMPORTANT Couldn't fix the following unit-test: ``` /dccstor/mm_hcls/usr/sagi/git_repos/fuse-drug/fusedrug_examples/tests/test_bimodal_mca.py BimodalMCATestCase.test_runner ``` Got some weird issue where it just crashes in the end of the multi-processing part: (**without multi-processing it works OK !**) ``` column names used=['molecule_sequence', 'molecule_id'] - if it does not look like column names make sure that the following args are properly set: first_row_is_columns_names, columns_names allow_access_by_id is enabled, building in-memory offset map (num_workers=128) multiprocess pool created with 128 workers. 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 441/441 [00:35<00:00, 12.55it/s] [rank: 0] Received SIGTERM: 15████████████████████████████████████████████████████████████████████████████████████████▍| 439/441 [00:35<00:00, 14.29it/s] [rank: 0] Received SIGTERM: 15 [rank: 0] Received SIGTERM: 15 [rank: 0] Received SIGTERM: 15 [rank: 0] Received SIGTERM: 15 [rank: 0] Received SIGTERM: 15 [rank: 0] Received SIGTERM: 15 . . . ``` I spent few hours to try to fix it but didn't succeed. The closest thing I found was [this](https://lightning.ai/forums/t/support-for-pytorchdata-dataloader2-multiprocessing-issue/2756) Lightning x PyData issue. --------- Co-authored-by: Sagi Polaczek <sagi.polaczek@ibm.com>

pl_ver >= 2.0

f6eb69c

SagiPolaczek marked this pull request as draft April 13, 2023 14:44

Sagi Polaczek added 11 commits April 13, 2023 19:46

changes for mnist

0071006

minor for a jenkins run

3986703

fixed isic (?)

73761f3

typing fixes

4aadcd8

removed all

1349dd5

git ignore lightnig logs

9a98aae

revert - run mnist in subprocess

e4e5d14

fixed knight baseline

c90613b

minors

d126020

fixed ehr dataset

0ed38bc

fixed concat and splitting issue

be80391

SagiPolaczek changed the title ~~Support PyTorch Lightning >= 2.0~~ Support **Lightning >= 2.0.0** and **Pandas >= 2.0.0** Apr 16, 2023

SagiPolaczek changed the title Support **Lightning >= 2.0.0** and **Pandas >= 2.0.0** Support Lightning >= 2.0.0 and Pandas >= 2.0.0 Apr 16, 2023

changed a tiny comment in reqs

8ee16ac

SagiPolaczek commented Apr 16, 2023

View reviewed changes

SagiPolaczek linked an issue Apr 16, 2023 that may be closed by this pull request

Support pytorch lightning 2.0.0 #292

Closed

SagiPolaczek marked this pull request as ready for review April 16, 2023 12:06

SagiPolaczek requested a review from mosheraboh April 16, 2023 12:06

mosheraboh reviewed Apr 16, 2023

View reviewed changes

Sagi Polaczek added 3 commits April 19, 2023 09:54

initial try to support both Lightning 2.0<= and 2,0>

e32e1a6

Merge branch 'master' into sagi/support_pl_2.0

56c216f

testing for <2.0.0

8908334

SagiPolaczek commented Apr 19, 2023

View reviewed changes

fixed unwanted changes

d535892

SagiPolaczek requested a review from mosheraboh April 19, 2023 10:54

migrated 'run_mnist_custom_pl_imp.py' to Lightning >=2.0.0

ec10168

SagiPolaczek changed the title ~~Support Lightning >= 2.0.0 and Pandas >= 2.0.0~~ Support Lightning >=2.0.0 and Pandas >=2.0.0 Apr 19, 2023

revert reqs

c4f9b15

mosheraboh previously approved these changes Apr 20, 2023

View reviewed changes

delete 'dp' artifcats + add WA for custom batch sampler DDP support

41633a6

SagiPolaczek dismissed mosheraboh’s stale review via 41633a6 April 20, 2023 10:50

Sagi Polaczek added 2 commits April 20, 2023 13:59

dummy commit - reactivated stoic21 test

6e70fc6

minor fix in stoic21 runner

4cc52d6

SagiPolaczek requested a review from mosheraboh April 20, 2023 13:26

mosheraboh previously approved these changes Apr 24, 2023

View reviewed changes

Sagi Polaczek added 2 commits April 27, 2023 09:23

Merge branch 'master' into sagi/support_pl_2.0

ac46ccc

wrapped WA with

f3cfd31

SagiPolaczek dismissed mosheraboh’s stale review via f3cfd31 April 27, 2023 06:29

fixed mnist

012f9f6

SagiPolaczek requested a review from mosheraboh April 27, 2023 10:43

mosheraboh approved these changes Apr 27, 2023

View reviewed changes

SagiPolaczek merged commit 2d74f98 into master Apr 27, 2023

SagiPolaczek deleted the sagi/support_pl_2.0 branch April 27, 2023 12:18

SagiPolaczek mentioned this pull request Jun 12, 2023

Support Lightning 2.0.0 >= BiomedSciAI/fuse-drug#54

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Lightning >=2.0.0 and Pandas >=2.0.0 #301

Support Lightning >=2.0.0 and Pandas >=2.0.0 #301

SagiPolaczek commented Apr 13, 2023 •

edited

Loading

SagiPolaczek left a comment

SagiPolaczek Apr 16, 2023

mosheraboh left a comment

mosheraboh Apr 16, 2023

SagiPolaczek Apr 16, 2023

SagiPolaczek commented Apr 16, 2023

SagiPolaczek commented Apr 19, 2023 •

edited

Loading

SagiPolaczek Apr 19, 2023

mosheraboh left a comment

mosheraboh left a comment

mosheraboh Apr 24, 2023

SagiPolaczek Apr 27, 2023

SagiPolaczek Apr 27, 2023

mosheraboh left a comment

Support Lightning >=2.0.0 and Pandas >=2.0.0 #301

Support Lightning >=2.0.0 and Pandas >=2.0.0 #301

Conversation

SagiPolaczek commented Apr 13, 2023 • edited Loading

Ready for CR 🕺🏼

Warning - Lightning >=2.0.0 breaks backward competability