PolymathicAI
diff --git a/‎.github/workflows/mypy-check.yml‎
Lines changed: 57 additions & 0 deletions b/‎.github/workflows/mypy-check.yml‎
Lines changed: 57 additions & 0 deletions
diff --git a/‎.github/workflows/tests.yaml‎
Lines changed: 35 additions & 0 deletions b/‎.github/workflows/tests.yaml‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 92 additions & 0 deletions b/‎.gitignore‎
Lines changed: 92 additions & 0 deletions
diff --git a/‎.pre-commit-config.yaml‎
Lines changed: 24 additions & 0 deletions b/‎.pre-commit-config.yaml‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎LICENSE‎
Lines changed: 21 additions & 0 deletions b/‎LICENSE‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 128 additions & 0 deletions b/‎README.md‎
Lines changed: 128 additions & 0 deletions
diff --git a/‎assets/ArchitectureWIP.png‎
458 KB b/‎assets/ArchitectureWIP.png‎
458 KB
diff --git a/‎assets/the_well_logo.png‎
22.7 KB b/‎assets/the_well_logo.png‎
22.7 KB
diff --git a/‎assets/walrus_logo.png‎
1.25 MB b/‎assets/walrus_logo.png‎
1.25 MB
@@ -0,0 +1,57 @@
+name: MyPy Check
+
+on:
+  pull_request:
+jobs:
+  mypy-check:
+    runs-on: ubuntu-latest
+    strategy:
+      fail-fast: false
+      matrix:
+        python-version: ["3.12"]
+    env:
+      PY_COLORS: "1"
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: ${{ matrix.python-version }}
+          cache: "pip"
+      - name: Set up authentication for private repo
+        run: |
+          git config --global url."https://${{ secrets.POLYMATHIC_REPO_ACCESS }}@github.com/".insteadOf "https://github.com/"
+      - name: Install package
+        run: |
+          python -m pip install --upgrade pip
+          pip install --extra-index-url https://download.pytorch.org/whl/cpu ".[test]"
+          pip install mypy
+      - name: Run MyPy
+        id: mypy
+        run: |
+          mypy walrus tests --install-types --non-interactive &> mypy_output.txt || true
+      - name: Read MyPy Output
+        id: read-mypy
+        run: |
+          output=$(cat mypy_output.txt | grep -e "error:" -e "note:" -e "warning:" -e "Found")
+          echo "output<<EOF" >> $GITHUB_OUTPUT
+          echo "$output" >> $GITHUB_OUTPUT
+          echo "EOF" >> $GITHUB_OUTPUT
+        continue-on-error: true
+      - name: Comment PR
+        uses: thollander/actions-comment-pull-request@v2
+        with:
+          message: |
+            ## [Automatically-Generated MyPy Results]
+
+            <details>
+            <summary>Click to expand/collapse MyPy results</summary>
+
+            ```
+            ${{ steps.read-mypy.outputs.output }}
+            ```
+
+            </details>
+
+            _Generated by the file [`mypy-check.yml`](https://github.com/${{ github.repository }}/blob/${{ github.sha }}/.github/workflows/mypy-check.yml)._
+          comment_tag: mypy-results
+          GITHUB_TOKEN: ${{ secrets.POLYMATHIC_COMMENT_BOT }}
@@ -0,0 +1,35 @@
+name: Tests
+
+on:
+  pull_request:
+  push:
+    branches:
+      - main
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    strategy:
+      fail-fast: false
+      matrix:
+        python-version: ["3.10", "3.12"]
+    env:
+      PY_COLORS: "1"
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: ${{ matrix.python-version }}
+          cache: "pip"
+      - name: Set up authentication for private repo
+        run: |
+          git config --global url."https://${{ secrets.POLYMATHIC_REPO_ACCESS }}@github.com/".insteadOf "https://github.com/"
+      - name: Install package
+        run: |
+          python -m pip install --upgrade pip
+          pip install --extra-index-url https://download.pytorch.org/whl/cpu ".[test]"
+      - name: Run linter
+        run: |
+          ruff check walrus tests
+          ruff check --select I walrus tests
+      - name: Run tests
+        run: python -m pytest tests
@@ -0,0 +1,92 @@
+# Ignore files generated by the build process
+build/
+dist/
+*.egg-info/
+
+# Ignore system and IDE files
+.DS_Store
+Thumbs.db
+.idea/
+
+#Ignoring the data
+datasets/active_matter/data/
+
+datasets/euler_quadrants/data/
+datasets/euler_quadrants/data_storage_non_benchmarked/
+
+datasets/pattern_formation/data/
+
+
+datasets/helmholtz_staircase/data/
+datasets/viscoelastic_instability/data/
+
+2D/neutron_star_disks/
+2D/planetswe/data/
+
+datasets/rayleigh_benard/data/
+
+datasets/shear_flow/data/
+datasets/supernova_explosion_128/data/
+datasets/supernova_explosion_64/data/
+datasets/turbulence_gravity_cooling/data/
+datasets/rayleigh_taylor_instability/data/
+datasets/turbulent_radiative_layer_3D/data/
+datasets/split_turbulent_radiative_layer_3D/
+datasets/turbulent_radiative_layer_2D/data/
+datasets/acoustic_scattering_discontinuous_2d/data/
+datasets/acoustic_scattering_inclusions_2d/data/
+datasets/acoustic_scattering_maze_2d/data/
+datasets/planetswe/data/
+datasets/post_neutron_star_merger/data/
+datasets/acoustic_scattering_discontinuous_2d/gif/
+datasets/acoustic_scattering_inclusions_2d/gif/
+datasets/acoustic_scattering_maze_2d/gif/
+2D/
+3D/
+3D/rayleigh_taylor_instability/data/
+the_well/benchmark/scripts_to_launch/
+the_well/benchmark/write_bash_script.ipynb
+the_well/benchmark/checkpoints/
+datasets/convective_envelope_rsg/data/
+datasets/MHD_64/data/
+datasets/MHD_256/data/
+datasets/convective_envelope_rsg/sim.mp4
+testing_before_adding/
+viz/
+venv_benchmark_well/
+# Ignore logs and temporary files
+*.log
+*.tmp
+*.pt
+*.gif
+
+# Ignore compiled binaries and libraries
+*.exe
+*.dll
+*.so
+
+# Ignore package manager directories
+node_modules/
+vendor/
+
+# Ignore environment-specific files
+.env
+.env.local
+.env.*.local
+
+# Ignore sensitive or private information
+secrets.txt
+credentials.json
+
+# Ignore backup files
+*.bak
+*.swp
+
+# Ignore generated files
+*.min.js
+*.min.css
+__pycache__
+
+# Ignore run generated output
+outputs/
+wandb/
@@ -0,0 +1,24 @@
+repos:
+- repo: https://github.com/pre-commit/pre-commit-hooks
+  rev: v5.0.0
+  hooks:
+    - id: check-merge-conflict
+    - id: end-of-file-fixer
+    - id: mixed-line-ending
+      args: [--fix=lf]
+    - id: trailing-whitespace
+- repo: https://github.com/astral-sh/ruff-pre-commit
+  # Ruff version.
+  rev: "v0.8.3"
+  hooks:
+    # Run the linter.
+    - id: ruff
+      args: [--fix]
+    # Sort imports
+    - id: ruff
+      args: [--select, I, --fix]
+- repo: https://github.com/pre-commit/mirrors-mypy
+  rev: "v1.13.0"  # Use the sha / tag you want to point at
+  hooks:
+    - id: mypy
+      args: [--install-types, --non-interactive]
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2024 Polymathic AI
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
@@ -0,0 +1,128 @@
+<!-- ![Walrus logo](assets/walrus_circle.png) -->
+<!-- <img src="assets/walrus_logo.png" width="360"> -->
+
+# Walrus: A Cross-domain Foundation Model for Continuum Dynamics
+<div align="center">
+
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![PyTorch](https://img.shields.io/badge/PyTorch-≥2.4.0-ee4c2c.svg)](https://pytorch.org/)
+[![arXiv](https://img.shields.io/badge/arXiv-2511.15684-b31b1b.svg)](https://arxiv.org/abs/2511.15684)
+[![Model on HF](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-yellow)](https://huggingface.co/polymathic-ai/walrus)
+
+[Getting Started](#getting-started) • [Tutorials](#tutorials) • [Model Overview](#model-overview) 
+
+</div>
+
+---
+
+## Overview
+<div align="center">
+    <img src="assets/ArchitectureWIP.png" alt="Walrus schematic" width="600">
+</div>
+
+This repo is built for training and evaluating Walrus, a multi-domain foundation model for continuum dynamics trained primarily on fluid-like behaviors.
+Walrus was trained on 19 different physical scenarios spanning 63 physical variables in both 2 and 3D. Walrus utilizes new tools for adaptive computation and improved stability
+in order to achieve accurate long-term rollouts while co-adapting sampling and distribution to improve training throughput despite handling varying dimensions, resolutions, and
+aspect ratios.
+
+## Getting Started
+
+### Installation
+
+Clone the repository and install locally. Requirements are documented in the pyproject.toml file. 
+
+Most of the data used in experiments is from [the Well](https://github.com/PolymathicAI/the_well),
+so it may be easier to get started if you have the Well at least partially available. 
+
+```bash
+git clone git@github.com:PolymathicAI/walrus.git
+cd walrus
+pip install .
+```
+By default, this repository does not include all dependencies of non-Walrus models. To install optional dependencies,
+instead run:
+
+```bash
+pip install .[external_modes]
+```
+## Running Experiments
+
+This project is orchestrated using [Hydra](https://github.com/facebookresearch/hydra) to give users modular access over 
+various model components, datasets, and runtime options. All training was done in slurm environments. Example invocations
+for training, validating, and finetuning Walrus models can be found in [walrus/run_scripts](walrus/run_scripts).
+
+Most of these use relative paths and slurm. For example, one would launch the training script with:
+```bash
+cd /path/to/thisfolder/walrus/walrus
+sbatch run_scripts/pretrain_example_distributed_walrus.sh
+```
+Local invocations also available for smaller training and validation runs.
+
+## Tutorials
+
+To help get you started, we include a set of demo notebooks. These are:
+
+- [Transforming data into Well Format](demo_notebooks/walrus_example_0_ConvertingDataIntoWellFormat.ipynb) - Since our repository largely uses structures from the Well, we include a guide 
+to transforming data so it can be used easily within this repository. 
+- [Running Walrus](demo_notebooks/walrus_example_1_RunningWalrus.ipynb) - This notebook walks through the basics of how to use Walrus both with Well data and with external data.
+
+If you already have a local copy of the Well, you can skip straight to example 1. However, if you do not, it's recommended to follow example 0 to get some data to use. We also include several example run scripts for training and validation, both using slurm and torchrun for distribution and just running single-GPU code. 
+
+## Model Overview
+
+### Architecture
+
+Walrus uses and encoder-processor-decoder structure. Encoder/decoder are hMLPs/transposed hMLPs using stride modulation to dynamically adjust the
+internal resolution. The processor consists of blocks containing factorized space and time attention. 
+
+### Patch Jittering
+
+Walrus suppressed the growth of long-run instabilities through the use of *patch jittering*. Patch jittering involves randomly translating the reference frame (with padding for boundaries)
+before each step. While the paper goes into more theoretical detail on why this works, the core idea is that the specific downsampling pattern leads to predictable accumulation
+of error and that randomizing this process can help alleviate this pathology.
+
+### Adaptive Compute
+
+To handle varying compute budgets and problem complexities, we also employ [stride modulation](https://arxiv.org/pdf/2507.09264) to allow users to adjust
+their downstream resolution. During pretraining, this was used to keep internal resolution fairly consistent (32/33 per dim in 2D, 16/17 in 3D). In this approach, the downsampling
+layers of the encoder/decoder will dynamically adjust their stride based on a target internal resolution. 
+
+### Efficient Training
+
+Heterogeneous data can easily lead to training bottlenecks as many distribution primitives require regular syncing. Our approach
+adjusts downsampling to ensure consistent token counts, but also adjust context length and batch size to minimize gaps. This isn't 
+enough to completely eliminate discrepencies, so we must also adjust our sampling strategy to be aware of where these dead cycles can emerge.
+We implement sampling such that when training with HSDP, all nodes within a sharding group are forced to sample the same data source. This
+balances batch diversity and efficiency. 
+
+## License
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
+
+## Contact
+
+- **Issues**: [GitHub Issues](https://github.com/PolymathicAI/AION/issues)
+
+For other queries, please reach out to the corresponding author: mmccabe@flatironinstitute.org. 
+
+## Acknowledgements
+
+Walrus is built by [Polymathic AI](https://polymathic-ai.org/) as part of our mission of advancing the frontier of AI for scientific application. Polymathic AI gratefully acknowledges funding from the Simons Foundation and Schmidt Sciences, LLC. This work was performed with compute from the Scientific Computing Core, a
+division of the Flatiron Institute, a division of the Simons Foundation and from the National AI Research Resource Pilot, including support from NVIDIA
+and NVIDIA’s DGX Cloud product which includes the NVIDIA AI Enterprise Software Platform.
+
+## Citing Walrus
+
+Please use the following citation for Walrus:
+
+```
+@misc{mccabe2025walruscrossdomainfoundationmodel,
+      title={Walrus: A Cross-Domain Foundation Model for Continuum Dynamics}, 
+      author={Michael McCabe and Payel Mukhopadhyay and Tanya Marwah and Bruno Regaldo-Saint Blancard and Francois Rozet and Cristiana Diaconu and Lucas Meyer and Kaze W. K. Wong and Hadi Sotoudeh and Alberto Bietti and Irina Espejo and Rio Fear and Siavash Golkar and Tom Hehir and Keiya Hirashima and Geraud Krawezik and Francois Lanusse and Rudy Morel and Ruben Ohana and Liam Parker and Mariel Pettee and Jeff Shen and Kyunghyun Cho and Miles Cranmer and Shirley Ho},
+      year={2025},
+      eprint={2511.15684},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2511.15684}, 
+}
+```
+