WandbLogger does not mark uploaded model as 'artifact' #4903

kyoungrok0517 · 2020-11-30T07:30:42Z

🐛 Bug

I'm using WandbLogger with the latest pytorch-lightning==1.0.8. It seems like trained checkpoint is treated as mere file not a model artifact, even I turned on log_model=True. It's much convenient to use model artifact from other script so I hope that is done by pytorch-lightning automatically.

Environment

* CUDA:
        - GPU:
                - GeForce RTX 3090
        - available:         True
        - version:           11.0
* Packages:
        - numpy:             1.19.2
        - pyTorch_debug:     True
        - pyTorch_version:   1.7.0
        - pytorch-lightning: 1.0.8
        - tensorboard:       2.3.0
        - tqdm:              4.51.0
* System:
        - OS:                Linux
        - architecture:
                - 64bit
                - ELF
        - processor:         x86_64
        - python:            3.8.5
        - version:           #60-Ubuntu SMP Fri Nov 6 10:37:59 UTC 2020

The text was updated successfully, but these errors were encountered:

Borda · 2020-11-30T10:24:21Z

@borisdayma mind have look?

borisdayma · 2020-12-09T14:40:33Z

This is a great idea!
Right now, wandb uploads the saved model folder at the end of the run, which may include one or several models (when saving checkpoints).

We could upload artifacts at the end of the run or as the model trains.
I think it may make more sense to upload at the end of training to avoid uploading unnecessarily too many times the model, for example if we are only interested in the final version or only the top 5 best models.

Finally, I think each artifact name should be related to the run id so if we want to supersede a specific artifact, we will need to use the same run id (knowing that W&B still let you use any number of alias latest, best, etc on each artifact).

Let me know if you have any comments before I try to implement it.

kyoungrok0517 · 2020-12-09T16:19:57Z

This is a great idea!
Right now, wandb uploads the saved model folder at the end of the run, which may include one or several models (when saving checkpoints).

We could upload artifacts at the end of the run or as the model trains.
I think it may make more sense to upload at the end of training to avoid uploading unnecessarily too many times the model, for example if we are only interested in the final version or only the top 5 best models.

Finally, I think each artifact name should be related to the run id so if we want to supersede a specific artifact, we will need to use the same run id (knowing that W&B still let you use any number of alias latest, best, etc on each artifact).

Let me know if you have any comments before I try to implement it.

Thanks for the response. I think following the checkpoint settings (e.g. preserve best top-k) would be nice, as you've mentioned in your comment. Then we can try the models in the other script using wandb API (e.g. automatic download).

stale · 2021-01-08T19:01:17Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

borisdayma · 2021-01-08T19:26:08Z

Keep issue active

tchaton · 2021-01-18T17:18:27Z

Hey @borisdayma,

Did you start to work on it ?

Best,
T.C

borisdayma · 2021-01-18T19:59:20Z

I didn't have the time yet but it's definitely on my TODO list.
I have a PR ongoing and I prefer to wait for it to be merged before.

borisdayma · 2021-02-26T15:43:56Z

I am now working on it and I just realized that #5537 introduced a change in behavior at this line.

Models were supposed to be saved in the W&B run folder only when log_model=True (since those files may be automatically uploaded at the end of a run).

With artifacts, we can let PL save the files directly where it wants and upload models as artifacts separately.

stale · 2021-03-28T18:54:42Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

borisdayma · 2021-05-28T01:36:36Z

PR has now been merged so this issue should be closed!

kyoungrok0517 added bug Something isn't working help wanted Open to be worked on labels Nov 30, 2020

Borda added 3rd party Related to a 3rd-party logger Related to the Loggers labels Nov 30, 2020

borisdayma mentioned this issue Dec 15, 2020

How to access wandb logging directory from lightning? #2725

Closed

stale bot added the won't fix This will not be worked on label Jan 8, 2021

stale bot removed the won't fix This will not be worked on label Jan 8, 2021

edenlightning added the priority: 1 Medium priority task label Feb 9, 2021

borisdayma mentioned this issue Feb 26, 2021

feat(wandb): log models as artifacts #6231

Merged

11 tasks

stale bot added the won't fix This will not be worked on label Mar 28, 2021

stale bot closed this as completed Apr 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WandbLogger does not mark uploaded model as 'artifact' #4903

WandbLogger does not mark uploaded model as 'artifact' #4903

kyoungrok0517 commented Nov 30, 2020 •

edited

Loading

Borda commented Nov 30, 2020

borisdayma commented Dec 9, 2020

kyoungrok0517 commented Dec 9, 2020

stale bot commented Jan 8, 2021

borisdayma commented Jan 8, 2021

tchaton commented Jan 18, 2021

borisdayma commented Jan 18, 2021

borisdayma commented Feb 26, 2021 •

edited

Loading

stale bot commented Mar 28, 2021

borisdayma commented May 28, 2021

WandbLogger does not mark uploaded model as 'artifact' #4903

WandbLogger does not mark uploaded model as 'artifact' #4903

Comments

kyoungrok0517 commented Nov 30, 2020 • edited Loading

🐛 Bug

Environment

Borda commented Nov 30, 2020

borisdayma commented Dec 9, 2020

kyoungrok0517 commented Dec 9, 2020

stale bot commented Jan 8, 2021

borisdayma commented Jan 8, 2021

tchaton commented Jan 18, 2021

borisdayma commented Jan 18, 2021

borisdayma commented Feb 26, 2021 • edited Loading

stale bot commented Mar 28, 2021

borisdayma commented May 28, 2021

kyoungrok0517 commented Nov 30, 2020 •

edited

Loading

borisdayma commented Feb 26, 2021 •

edited

Loading