Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 26 additions & 0 deletions docs/source/AITraining/index.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
AI Training Datasets
====================

The E3SM project and `Allen Institute for AI (Ai2) <https://allenai.org/>`_ have developed several datasets for AI and machine learning applications. These datasets have been postprocessed for ingestion by the `ACE <https://github.com/ai2cm/ace?tab=readme-ov-file#ai2-climate-emulator>`_/`FourCastNet <https://github.com/NVlabs/FourCastNet>`_ emulator.

Dataset Details
***************

- **EAMv2**: 73-year EAMv2 simulation (F2010, perpetual 2010 forcing, repeating annual SST cycle from 2005-2014 average). 6-hourly outputs. More details see: `Duncan et al. 2024 <https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2024JH000136>`_

- **EAMv3**: 51-year EAMv3 AMIP-style simulation (1970-2020, F2010 with AMIP SSTs, constant 2010 CO2). Includes multiple ENSO cycles and global warming trend. More details see: `Wu et al. 2025 <https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025JH000774>`_

- **E3SMv3**: Coupled pre-industrial and historical training data (coming soon)

- **SCREAMv1**: Simple Cloud-Resolving E3SM Atmosphere Model version 1 training data (coming soon)

.. tip::
Check the ``archive_contents`` text file to see files included in each tar archive. You can selectively download the files you need.

Data Access
***********

.. toctree::
:maxdepth: 2

simulation_data/simulation_table
8 changes: 8 additions & 0 deletions docs/source/AITraining/simulation_data/index.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
***************
Simulation Data
***************

.. toctree::
:maxdepth: 2

simulation_table
23 changes: 23 additions & 0 deletions docs/source/AITraining/simulation_data/simulation_table.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
********************************************
AI Training Datasets simulation table
********************************************

+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| Dataset | Status | Data Size | HPSS Path | HPSS URL |
+===================================================================+=================+===========================================================================+===============================================================================+=====================================================================================================================+
| **EAMv2** | | | | |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| EAMv2 AI Training Dataset | Available | 1.2T | /home/projects/e3sm/www/AI_training_data/e3sm-v2-climsst-180x360-gaussian | `Link <https://portal.nersc.gov/archive/home/projects/e3sm/www/AI_training_data/e3sm-v2-climsst-180x360-gaussian>`_ |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| **EAMv3** | | | | |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| EAMv3 AI Training Dataset | Available | 1.3T | /home/projects/e3sm/www/AI_training_data/e3sm-v3-amip-180x360-gaussian | `Link <https://portal.nersc.gov/archive/home/projects/e3sm/www/AI_training_data/e3sm-v3-amip-180x360-gaussian>`_ |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| **E3SMv3** | | | | |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| E3SMv3 Coupled AI Training Dataset | Coming Soon | TBD | TBD | TBD |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| **SCREAMv1** | | | | |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
| SCREAMv1 AI Training Dataset | Coming Soon | TBD | TBD | TBD |
+-------------------------------------------------------------------+-----------------+---------------------------------------------------------------------------+-------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------+
1 change: 1 addition & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ simulations.
v3/index
SCREAMv0/index
SCREAMv1/index
AITraining/index



Expand Down