Added validation to the training script and exposed more of the settngs for network #113

cfd1 · 2023-09-25T23:00:52Z

Modulus Pull Request

Description

Added validation to the training script
Added more parameters in the constants.py

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The CHANGELOG.md is up to date with these changes.

Dependencies

…ngs for network Signed-off-by: LimitingFactor <aswift0n3@gmail.com>

Updated the wandb initialisation Signed-off-by: LimitingFactor <aswift0n3@gmail.com>

mnabian · 2023-10-10T19:08:40Z

Hi @cfd1 , thanks for the PR! Is this ready for review?

cfd1 · 2023-10-11T14:34:12Z

@mnabian yes, it's ready for a review please

mnabian · 2023-10-19T01:04:40Z

examples/cfd/vortex_shedding_mgn/constants.py

+    do_concat_trick: bool = False
+    num_processor_checkpoint_segments: int = 0
+    # activation_fn: str = "relu"
+
    # performance configs
    amp: bool = False
    jit: bool = False

    # test & visualization configs


Change to "visualization configs"

mnabian · 2023-10-19T01:14:10Z

examples/cfd/vortex_shedding_mgn/train.py

+def get_options():
+    parser = argparse.ArgumentParser()
+
+    parser.add_argument("--entity", "-e", type=str, default=None)


Can we add these to the config file and not use argparse here?

mnabian · 2023-10-19T01:17:55Z

examples/cfd/vortex_shedding_mgn/train.py

    # initialize distributed manager
    DistributedManager.initialize()
    dist = DistributedManager()

+    # initialize loggers
+    if wandb:


Is this if statement necessary? This can be done by changing the mode argument in initialize_wandb

mnabian · 2023-10-19T01:20:02Z

examples/cfd/vortex_shedding_mgn/train.py


-if __name__ == "__main__":
+    @torch.no_grad()
+    def validation(self):


This is very similar to the predict method in `inference.py. Can we make the prediction code more modular to avoid code duplication?

mnabian · 2023-10-19T01:21:28Z

examples/cfd/vortex_shedding_mgn/train.py

+        # Train the model
+        tmp_start = time.time()
+        loss_train_agg = 0
+        for graph in tqdm(trainer.dataloader):


Does tqdm play nicely in the multi-gpu runs?

NickGeneva requested a review from mnabian September 26, 2023 18:11

NickGeneva added external Issues/PR filed by people outside the team enhancement New feature or request labels Sep 26, 2023

cfd1 force-pushed the mgn/train_validation branch from 5b543a5 to 1a36344 Compare October 3, 2023 12:26

NickGeneva added the 2 - In Progress Currently a work in progress label Oct 3, 2023

LimitingFactor and others added 2 commits October 4, 2023 19:28

Added validation to the training script and exposed more of the setti…

869af0b

…ngs for network Signed-off-by: LimitingFactor <aswift0n3@gmail.com>

Updated the CHANGELOG.md

8ad3efc

Updated the wandb initialisation Signed-off-by: LimitingFactor <aswift0n3@gmail.com>

cfd1 force-pushed the mgn/train_validation branch from f715d7e to 8ad3efc Compare October 4, 2023 18:28

mnabian reviewed Oct 19, 2023

View reviewed changes

mnabian assigned cfd1 Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added validation to the training script and exposed more of the settngs for network #113

Added validation to the training script and exposed more of the settngs for network #113

cfd1 commented Sep 25, 2023 •

edited

Loading

mnabian commented Oct 10, 2023

cfd1 commented Oct 11, 2023

mnabian Oct 19, 2023

mnabian Oct 19, 2023

mnabian Oct 19, 2023

mnabian Oct 19, 2023

mnabian Oct 19, 2023

Added validation to the training script and exposed more of the settngs for network #113

Are you sure you want to change the base?

Added validation to the training script and exposed more of the settngs for network #113

Conversation

cfd1 commented Sep 25, 2023 • edited Loading

Modulus Pull Request

Description

Checklist

Dependencies

mnabian commented Oct 10, 2023

cfd1 commented Oct 11, 2023

mnabian Oct 19, 2023

Choose a reason for hiding this comment

mnabian Oct 19, 2023

Choose a reason for hiding this comment

mnabian Oct 19, 2023

Choose a reason for hiding this comment

mnabian Oct 19, 2023

Choose a reason for hiding this comment

mnabian Oct 19, 2023

Choose a reason for hiding this comment

cfd1 commented Sep 25, 2023 •

edited

Loading