Commit 8aa52ad
DPS and SDA guidance for diffusion (#1381)
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Refactor (#1208)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Add FIGConvNet to crash example (#1207)
* Add FIGConvNet to crash example.
* Add FIGConvNet to crash example
* Update model config
* propose fix some typos (#1209)
Signed-off-by: John E <jeis4wpi@outlook.com>
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
---------
Signed-off-by: John E <jeis4wpi@outlook.com>
Co-authored-by: Alexey Kamenev <alex.kamenev@gmail.com>
Co-authored-by: John Eismeier <42679190+jeis4wpi@users.noreply.github.com>
* Unmigrate the insolation utils (#1211)
* unmigrate the insolation utils
* Revert test and compat map
* Update importlinter
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Refactor (#1216)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Update activations path in dlwp tests (#1217)
* Update activations path in dlwp tests
* Update example paths
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Refactor (#1224)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Update crash readme (#1212)
* update license headers- second try
* update readme
* Bump multi-storage-client to v0.33.0 with rust client (#1156)
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Add jaxtyping to requirements.txt for crash sample (#1218)
* update license headers- second try
* Update requirements.txt
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
---------
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: Yongming Ding <yongmingd@nvidia.com>
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* Refactor (#1231)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Update crash readme (#1212)
* update license headers- second try
* update readme
* Bump multi-storage-client to v0.33.0 with rust client (#1156)
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Add jaxtyping to requirements.txt for crash sample (#1218)
* update license headers- second try
* Update requirements.txt
* Updating to address some test issues
* Replace 'License' link with 'Dev blog' link (#1215)
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Validation fu added to examples/structural_mechanics/crash/train.py (#1204)
* validation added: works for multi-node job.
* rename and rearrange validation function
* validate_every_n_epochs, save_ckpt_every_n_epochs added in config
* corrected bug (args of model) in inference
* args in validation code updated
* val path added and args name changed
* validation split added -> write_vtp=False
* fixed inference bug
* bug fix: write_vtp
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Add saikrishnanc-nv to github actors (#1225)
* Integrate Curator instructions to the Crash example (#1213)
* Integrate Curator instructions
* Update docs
* Formatting changes
* Adding code of conduct (#1214)
* Adding code of conduct
Adopting the code of conduct from the https://www.contributor-covenant.org/
* Update CODE_OF_CONDUCT.MD
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Create .markdownlintignore
* Revise README for PhysicsNeMo resources and guidance
Updated the 'Getting Started' section and added new resources for learning AI Physics.
* Update README.md
---------
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
---------
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: Yongming Ding <yongmingd@nvidia.com>
Co-authored-by: ram-cherukuri <104155145+ram-cherukuri@users.noreply.github.com>
Co-authored-by: Deepak Akhare <dakhare@nvidia.com>
Co-authored-by: Sai Krishnan Chandrasekar <157182662+saikrishnanc-nv@users.noreply.github.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* update import paths
* Starting to clean up dependency tree.
* Refactor (#1233)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Update crash readme (#1212)
* update license headers- second try
* update readme
* Bump multi-storage-client to v0.33.0 with rust client (#1156)
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Add jaxtyping to requirements.txt for crash sample (#1218)
* update license headers- second try
* Update requirements.txt
* Updating to address some test issues
* Replace 'License' link with 'Dev blog' link (#1215)
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Validation fu added to examples/structural_mechanics/crash/train.py (#1204)
* validation added: works for multi-node job.
* rename and rearrange validation function
* validate_every_n_epochs, save_ckpt_every_n_epochs added in config
* corrected bug (args of model) in inference
* args in validation code updated
* val path added and args name changed
* validation split added -> write_vtp=False
* fixed inference bug
* bug fix: write_vtp
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Add saikrishnanc-nv to github actors (#1225)
* Integrate Curator instructions to the Crash example (#1213)
* Integrate Curator instructions
* Update docs
* Formatting changes
* Adding code of conduct (#1214)
* Adding code of conduct
Adopting the code of conduct from the https://www.contributor-covenant.org/
* Update CODE_OF_CONDUCT.MD
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Create .markdownlintignore
* Revise README for PhysicsNeMo resources and guidance
Updated the 'Getting Started' section and added new resources for learning AI Physics.
* Update README.md
---------
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Fixed minor bug in shape validation in SongUNet (#1230)
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Add Zarr reader for Crash (#1228)
* Add Zarr reader for Crash
* Update README
* Update validation logic of point data in Zarr reader
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Update examples/structural_mechanics/crash/zarr_reader.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Add a test for 2D feature arrays
* Update examples/structural_mechanics/crash/zarr_reader.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
---------
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: Yongming Ding <yongmingd@nvidia.com>
Co-authored-by: ram-cherukuri <104155145+ram-cherukuri@users.noreply.github.com>
Co-authored-by: Deepak Akhare <dakhare@nvidia.com>
Co-authored-by: Sai Krishnan Chandrasekar <157182662+saikrishnanc-nv@users.noreply.github.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Charlelie Laurent <84199758+CharlelieLrt@users.noreply.github.com>
* Added coding standards for model implementations as a custom context for greptile (#1219)
* Added initial set of coding standards for model implementations
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Fixed typos + review comments + added details
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Added more rules for models
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Added model rules to PR checklist
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Added cusror rules for models
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Linked the wiki page to the PR template
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Fixed typo in PR checklist
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Refactor (#1234)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Update crash readme (#1212)
* update license headers- second try
* update readme
* Bump multi-storage-client to v0.33.0 with rust client (#1156)
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Add jaxtyping to requirements.txt for crash sample (#1218)
* update license headers- second try
* Update requirements.txt
* Updating to address some test issues
* Replace 'License' link with 'Dev blog' link (#1215)
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Validation fu added to examples/structural_mechanics/crash/train.py (#1204)
* validation added: works for multi-node job.
* rename and rearrange validation function
* validate_every_n_epochs, save_ckpt_every_n_epochs added in config
* corrected bug (args of model) in inference
* args in validation code updated
* val path added and args name changed
* validation split added -> write_vtp=False
* fixed inference bug
* bug fix: write_vtp
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Add saikrishnanc-nv to github actors (#1225)
* Integrate Curator instructions to the Crash example (#1213)
* Integrate Curator instructions
* Update docs
* Formatting changes
* Adding code of conduct (#1214)
* Adding code of conduct
Adopting the code of conduct from the https://www.contributor-covenant.org/
* Update CODE_OF_CONDUCT.MD
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Create .markdownlintignore
* Revise README for PhysicsNeMo resources and guidance
Updated the 'Getting Started' section and added new resources for learning AI Physics.
* Update README.md
---------
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Corey adams <6619961+coreyjadams@users.noreply.github.com>
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Fixed minor bug in shape validation in SongUNet (#1230)
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Add Zarr reader for Crash (#1228)
* Add Zarr reader for Crash
* Update README
* Update validation logic of point data in Zarr reader
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Update examples/structural_mechanics/crash/zarr_reader.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Add a test for 2D feature arrays
* Update examples/structural_mechanics/crash/zarr_reader.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
---------
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Add AR RT and OT schemes to Crash FIGConvNet (#1232)
* Add AR and OT schemes for FIGConvNet
* Add tests
* Soothe the linter
* Fix the tests
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: Mohammad Amin Nabian <m.a.nabiyan@gmail.com>
Co-authored-by: Yongming Ding <yongmingd@nvidia.com>
Co-authored-by: ram-cherukuri <104155145+ram-cherukuri@users.noreply.github.com>
Co-authored-by: Deepak Akhare <dakhare@nvidia.com>
Co-authored-by: Sai Krishnan Chandrasekar <157182662+saikrishnanc-nv@users.noreply.github.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Charlelie Laurent <84199758+CharlelieLrt@users.noreply.github.com>
Co-authored-by: Alexey Kamenev <alex.kamenev@gmail.com>
* Not seeing any errors in testing ...
* Breakdown of rules into smaller rules (#1236)
* Breakdown of rules into smaller rules
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Fix mismatches in rule IDs referenced in rule text
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Refactor (#1240)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* Formatting active learning module docstrings (#1238)
* docs: fixing Protocol class reference formatting
Signed-off-by: Kelvin Lee <kinlongkelvi@nvidia.com>
* docs: removing mermaid diagram from protocols
Signed-off-by: Kelvin Lee <kinlongkelvi@nvidia.com>
* docs: adding active learning index
* docs: revising docstrings for sphinx formatting
* docs: fix placeholder URL for active learning main docs
---------
Signed-off-by: Kelvin Lee <kinlongkelvi@nvidia.com>
---------
Signed-off-by: Kelvin Lee <kinlongkelvi@nvidia.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Kelvin Lee <kin.long.kelvin.lee@gmail.com>
* Refactor (#1247)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* A new X-MeshGraphNet example for reservoir simulation. (#1186)
* X-MGN for reservoir simulation
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* installation bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* more well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve path_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix while space in config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix version inconsistency in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add versions for some libs in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetiem in mlflow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* formatting
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor loop
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* grad accum bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* total loss bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* added some safe guard for connection indexing
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
* cleanup
* update configs
* Update README.md
style guide rule changes
* Update README.md
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve docstring fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update license yr
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup well
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cimprove infrence fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetime
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve train.py fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve requirement
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* ilcense header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* license header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder (parallel) + added results to readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* delete some unsed files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* address PR comments
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference grdecl header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* support time series
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor update
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace pickle with json
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add license headers
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unused png files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unsed import
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove emojis
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace print with logger
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update docstring
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor updates
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
* Add knn to autodoc table. (#1244)
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: tonishi-nv <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
* Enable import linting on internal imports.
* Remove ensure_available function, it's confusing
* Add logging imports to utils, and fix imports in examples.
* Update imports in minimal examples
* Update structural mechanics examples
* Update import paths: reservoir_sim
* Update import paths: additive manufacturing
* Update import paths: topodiff
* Update import paths: weather part 1
* Update import paths: weather part 2
* Update import paths: molecular dynamics
* Update import paths: geophysics
* Update import paths: cfd + external_aero 1
* Update import paths: cfd + external_aero 2
* Remove more DGL examples
* Remove more DGL examples
* cfd examples 3
* Last batch of example import fixes!
* Enforce and protect external deps in utils.
* Remove DGL. :party:
* Don't force models yet
* Refactor (#1249)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* A new X-MeshGraphNet example for reservoir simulation. (#1186)
* X-MGN for reservoir simulation
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* installation bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* more well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve path_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix while space in config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix version inconsistency in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add versions for some libs in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetiem in mlflow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* formatting
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor loop
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* grad accum bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* total loss bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* added some safe guard for connection indexing
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
* cleanup
* update configs
* Update README.md
style guide rule changes
* Update README.md
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve docstring fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update license yr
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup well
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cimprove infrence fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetime
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve train.py fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve requirement
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* ilcense header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* license header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder (parallel) + added results to readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* delete some unsed files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* address PR comments
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference grdecl header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* support time series
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor update
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace pickle with json
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add license headers
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unused png files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unsed import
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove emojis
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace print with logger
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update docstring
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor updates
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
* Add knn to autodoc table. (#1244)
* Enable import linting on internal imports.
* Remove ensure_available function, it's confusing
* Add logging imports to utils, and fix imports in examples.
* Update imports in minimal examples
* Update structural mechanics examples
* Update import paths: reservoir_sim
* Update import paths: additive manufacturing
* Update import paths: topodiff
* Update import paths: weather part 1
* Update import paths: weather part 2
* Update import paths: molecular dynamics
* Update import paths: geophysics
* Update import paths: cfd + external_aero 1
* Update import paths: cfd + external_aero 2
* Remove more DGL examples
* Remove more DGL examples
* cfd examples 3
* Last batch of example import fixes!
* Enforce and protect external deps in utils.
* Remove DGL. :party:
* Don't force models yet
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: tonishi-nv <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
* Automated model registry (#1252)
* Deleted RegistreableModule
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Removed 'PhysicsNeMo' suffix in Module.from_torch method
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Implemented automatic registration for Module subclasses
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Fixed unused name
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Metadata name deprecation (#1257)
* Initiated deprecation of field 'name' in ModelMetaData
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Removed all occurences of 'name' field in ModelMetaData
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Refactor (#1258)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* A new X-MeshGraphNet example for reservoir simulation. (#1186)
* X-MGN for reservoir simulation
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* installation bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* more well object docstring fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve path_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix while space in config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fix version inconsistency in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add versions for some libs in requirement.txt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve mldlow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetiem in mlflow_utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve exception handling in inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* formatting
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve preprocessor loop
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* grad accum bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* total loss bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* added some safe guard for connection indexing
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* bug fix
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup utils
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
* cleanup
* update configs
* Update README.md
style guide rule changes
* Update README.md
* fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve docstring fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update license yr
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup well
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup preproc fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cimprove infrence fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve datetime
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve train.py fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme fmt
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve requirement
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* ilcense header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve ecl reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* cleanup
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* license header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder (parallel) + added results to readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* delete some unsed files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* address PR comments
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve inference grdecl header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* support time series
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update config
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor update
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* improve graph builder
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update ecl_reader logging
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace pickle with json
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* add license headers
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unused png files
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove unsed import
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* remove emojis
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* replace print with logger
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update docstring
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* minor updates
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update readme
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
* update header
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
* Add knn to autodoc table. (#1244)
* Enable import linting on internal imports.
* Remove ensure_available function, it's confusing
* Add logging imports to utils, and fix imports in examples.
* Update imports in minimal examples
* Update structural mechanics examples
* Update import paths: reservoir_sim
* Update import paths: additive manufacturing
* Update import paths: topodiff
* Update import paths: weather part 1
* Update import paths: weather part 2
* Update import paths: molecular dynamics
* Update import paths: geophysics
* Update import paths: cfd + external_aero 1
* Update import paths: cfd + external_aero 2
* Remove more DGL examples
* Remove more DGL examples
* cfd examples 3
* Last batch of example import fixes!
* Enforce and protect external deps in utils.
* Remove DGL. :party:
* Don't force models yet
* Update version (#1193)
* Fix depenedncies to enable hello world (#1195)
* Remove zero-len arrays from test dataset (#1198)
* Merge updates to Gray Scott example (#1239)
* Remove pyevtk
* update dependency
* update dimensions
* ci issues
* Interpolation model example (#1149)
* Temporal interpolation training recipe
* Add README
* Docs changes based on comments
* Update docstrings and README
* Add temporal interpolation animation
* Add animation link
* Add shape check in loss
* Updates of configs + trainer
* Update config comments
* Update README.md
style guide edits
* Added wandb logging
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Reformated sections in docstring for GeometricL2Loss
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Update README and configs
* README changes + type hint fixes
* Update README.md
* Draft of validation script
* Update validation and README
* Fixed command in README.md for temporal_interpolation example
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Removed unused import in datapipe/climate_interp.py
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Updated license headers in temporal_interpolation example
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Renamed methods to avoid implicit shadowing in Trainer class
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Cosmetic changes in train.py and removed unused import in validate.py
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Added clamp in validate.py to make sure step does not go out of bounds
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Added the temporal_interpolation example to the docs + updated CHANGELOG.md
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
* Addressing remaining comments
* Merged two data source classes in climate_interp.py
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
---------
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
Co-authored-by: Charlelie Laurent <84199758+CharlelieLrt@users.noreply.github.com>
* update versions
---------
Signed-off-by: Tsubasa Onishi <tonishi@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: tonishi-nv <tonishi@nvidia.com>
Co-authored-by: megnvidia <mmiranda@nvidia.com>
Co-authored-by: Kaustubh Tangsali <71059996+ktangsali@users.noreply.github.com>
Co-authored-by: Jussi Leinonen <jleinonen@nvidia.com>
Co-authored-by: Charlelie Laurent <claurent@nvidia.com>
Co-authored-by: Charlelie Laurent <84199758+CharlelieLrt@users.noreply.github.com>
Co-authored-by: Kaustubh Tangsali <ktangsali@nvidia.com>
* Remove IPDB
* Few more dep fixes.
* Refactor (#1261)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* Enable import linting on internal imports.
* Remove ensure_available function, it's confusing
* Add logging imports to utils, and fix imports in examples.
* Update imports in minimal examples
* Update structural mechanics examples
* Update import paths: reservoir_sim
* Update import paths: additive manufacturing
* Update import paths: topodiff
* Update import paths: weather part 1
* Update import paths: weather part 2
* Update import paths: molecular dynamics
* Update import paths: geophysics
* Update import paths: cfd + external_aero 1
* Update import paths: cfd + external_aero 2
* Remove more DGL examples
* Remove more DGL examples
* cfd examples 3
* Last batch of example import fixes!
* Enforce and protect external deps in utils.
* Remove DGL. :party:
* Don't force models yet
* Remove IPDB
* Few more dep fixes.
* Enhance checkpoint configuration for DLWP Healpix and GraphCast (#1253)
* feat(weather): Improve configuration for DLWP Healpix and GraphCast examples
- Added configurable checkpoint directory to DLWP Healpix config and training script.
- Implemented Trainer logic to use specific checkpoint directory.
- Updated utils.py to respect exact checkpoint path.
- Made Weights & Biases entity and project configurable in GraphCast example.
* fix(dlwp_healpix): remove deprecated configs
- Removed the deprecated `verbose` parameter from the `CosineAnnealingLR` configuration in DLWP HEALPix, which was causing a TypeError.
- Removed unused configs from examples/weather/dlwp_healpix/
* Transolver volume (#1242)
* Implement transolver ++ physics attention
* Enable ++ in Transolver.
* Fix temperature correction terms.
* Starting work adapting the domino datapipe techniques to transolver.
* Working towards transolver volume training by mergeing with domino dataset.
Surface dataloading is prototyped, not finished yet.
* Updating
* Remove printout
* Enable transolver for volumetric data
* Update transolver training script to support either surface or volume data.
Applied some cleanup to make the datapipe similar to domino, which
is a step towards unification.
* Updating datapipe
* Tweak transolver volume configs
* Add transolverX model
* Enable nearly-uniform sampling of very very large arrays
* limit benchmarking to train epoch, enable profiler in config
* Update volume config slightly
* Update training scripts to properly enable data preloading
* Working towards adding a muon optimzier in transolver
* Add peter's implementation of muon with a combined optimizer. switch to a flat LR.
* Add updated inference script that can also calculate drag and lift
* Add better docstrings for typhon
* Move typhon to experimental
* Move forwards docstring
* Adding typhon model and configs.
* Update readme.
* Update
* Remove extra model. Update recipes.
* Update cae_dataset.py
Implement abstract methods in base classes.
* Update Physics_Attention.py
Ensure plus parameter is passed to base class.
* Update test_mesh_datapipe.py
Update import path for mesh datapipe.
* Fix ruff issues
---------
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Co-authored-by: Dileep Ranganathan <8152399+dran-dev@users.noreply.github.com>
* Add external import coding standards.
* Update external import standards.
* Ensure vtk functions are protected.
* Protect pyvista import
* Closing more import gaps
* Remove DGL from meshgraphkan
* All models now comply with external import linting.
* Remove DGL datapipes
* cae datapipes in compliance
* Update pyproject.toml
* Add version numbers to deps
* Refactor (#1261)
* Move filesystems and version_check to core
* Fix version check tests
* Reorganize distributed, domain_parallel, and begin nn / utils cleanup.
* Move modules and meta to core. Move registry to core.
No tests fixed yet.
* Add missing init files
* Update build system and specify some deps.
* Reorganize tests.
* Update init files
* Clean up neighbor tools.
* Update testing
* Fix compat tests
* Move core model tests to tests/core/
* Add import lint config
* Relocate layers
* Move graphcast utils into model directory
* Relocating util functionalities.
* Further clean up and organize tests.
* utils tests are passing now
* Cleaning up distributed tests
* Patching tests working again in nn
* Fix sdf test
* Fix zenith angle tests
* Some organization of tests. Checkpoints is moved into utils.
* Remove launch.utils and launch.config. Checkpointing is moved to
phsyicsnemo.utils, launch.config is just gone. It was empty.
* Most nn tests are passing
* Further cleanup. Getting there!
* Remove constants file
* Add import linting to pre-commit.
* Move gnn layers and start to fix several model tests.
* AFNO is now passing.
* Rnn models passing.
* Fix improt
* Healpix tests are working
* Domino and unet working
* Updating to address some test issues
* MGN tests passing again
* Most graphcast tests passing again
* Move nd conv layers.
* update fengwu and pangu
* Update sfno and pix2pix test
* update tests for figconvnet, swinrnn, superresnet
* updating more models to pass
* Update distributed tests, now passing.
* Domain parallel tests now passing.
* Fix active learning imports so tests pass in refactor
* Fix some metric imports
* Remove deploy package
* Remove unused test file
* unmigrate these files ... again?
* Update import linter.
* Cleaning up diffusion models. Not quite done yet.
* Restore deleted files
* Updating more tests.
* Further updates to tests. Datapipes almost working.
* update import paths
* Starting to clean up dependency tree.
* Fixing and adjusting a broad suite of tests.
* Update test/domain_parallel/conftest.py
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
* Minor fix
* Not seeing any errors in testing ...
* Enable import linting on internal imports.
* Remove ensure_available function, it's confusing
* Add logging imports to utils, and fix imports in examples.
* Update imports in minimal examples
* Update structural mechanics examples
* Update import paths: reservoir_sim
* Update import paths: additive manufacturing
* Update import paths: topodiff
* Update import paths: weather part 1
* Update import paths: weather part 2
* Update import paths: molecular dynamics
* Update import paths: geophysics
* Update import paths: cfd + external_aero 1
* Update import paths: cfd + external_aero 2
* Remove more DGL examples
* Remove more DGL examples
* cfd examples 3
* Last batch of example import fixes!
* Enforce and protect external deps in utils.
* Remove DGL. :party:
* Don't force models yet
* Remove IPDB
* Few more dep fixes.
* Enhance checkpoint configuration for DLWP Healpix and GraphCast (#1253)
* feat(weather): Improve configuration for DLWP Healpix and GraphCast examples
- Added configurable checkpoint directory to DLWP Healpix config and training script.
- Implemented Trainer logic to us…1 parent 04236a2 commit 8aa52ad
File tree
5 files changed
+1044
-30
lines changed- physicsnemo/diffusion
- denoisers
- guidance
- noise_schedulers
5 files changed
+1044
-30
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
99 | 99 | | |
100 | 100 | | |
101 | 101 | | |
102 | | - | |
| 102 | + | |
103 | 103 | | |
104 | 104 | | |
105 | 105 | | |
| |||
This file was deleted.
Lines changed: 13 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
26 | 24 | | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
0 commit comments