GitHub

This is the implementation for the NeurIPS 2023 paper "Training neural operators to preserve invariant measures of chaotic attractors".

@article{jiang2023training,
  title={Training neural operators to preserve invariant measures of chaotic attractors},
  author={Jiang, Ruoxi and Lu, Peter Y and Orlova, Elena and Willett, Rebecca},
  journal={Advances in Neural Information Processing Systems},
  year={2023}
}

Prepare the training data

To generate the Lorenz 96 data in the l96_data folder, run:

python generate_data.py

Then to create noisy observations and speed up the loading process during the training, in the parent folder, run:

python dataloader/dataloader_l96.py

Optimal transport method

This implementation supports DistributedDataParallel training: The default configuration uses 4 GPUs. The default algorithm used to solve the optimal transport problem is Sinkhorn divergence.

To train the neural operator with the optimal transport (OT) method for Lorenz 96 data, run:

bash experiments/OT_l96/srun.sh

The hyperparameters of the OT method are:

Weights of the OT loss: --lambda_geomloss 3.
Regularization value in the Sinkhorn algorithm, where smaller regularization usually leads to high accuracy while slowing the training: --blur 0.02.
Controlling the physical knowledge you want to use during the training, set this to be larger than 0 if you only have partial knowledge of the system, --with_geomloss_kd 0.

Contrastive feature learning method

This implementation supports DistributedDataParallel training: The default configuration uses 4 GPUs.

To train the neural operator with the contrastive learning (CL) method for Lorenz 96 data, run:

bash experiments/CL_l96/srun.sh

The hyperparameters of the OT method are:

Memory bank size, which should be divisible by the current GPU_number times batch_size_metricL: --bank_size 1000.
Temperature value for controlling the radius of the hypersphere feature space: --T_metricL_traj_alone 0.3.

If you are looking to modify the codebase on your own example, you might want to enable running on a single GPU in the code development phase. And this can be done by changing the command --nproc_per_node=1 in the bash file, which means the number of GPUs each process operates on is 1 instead of 4.

Evaluation

We only require 1 GPU for the evaluation. To run the evaluation for each method, run:

bash experiments/OT_l96/eval.sh
bash experiments/OT_l96/eval_partial.sh
bash experiments/CL_l96/eval.sh

Turning on the --eval_LE command means we are calculating the leading Lyapunov exponent (LLE) for the trained neural operator, which takes ~2 hours for 200 test instances. If you want to calculate the ground truth LLE values for the data, run:

python eval_scripts/LE_l96.py

After all the LLEs are calculated, we could compare them by running:

python eval_scripts/read_LE.py

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
dataloader		dataloader
eval_scripts		eval_scripts
experiments		experiments
l96_data_x		l96_data_x
models		models
presentations		presentations
scripts		scripts
README.md		README.md
configuration.py		configuration.py
environment.yml		environment.yml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Prepare the training data

Optimal transport method

Contrastive feature learning method

Evaluation

About

Uh oh!

Releases

Packages

Languages

roxie62/neural_operators_for_chaos

Folders and files

Latest commit

History

Repository files navigation

Prepare the training data

Optimal transport method

Contrastive feature learning method

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages