Symmetry as Intervention;
Causal Estimation with Data Augmentation

Implementation for "An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation" (NeurIPS 2025).

Overview

.
├── src/
│   ├── data_augmentors/    # Data augmentation modules
│   ├── experiments/        # Scripts for running paper experiments
│   ├── regressors/         # ERM, IV, IVL, and baseline models
│   ├── sem/                # Structural equation model definitions
│   └── main.py             # Entry point for training / evaluation
├── config.yaml             # Configuration file for experiments
├── environment.yaml        # Conda environment definition
├── requirements.txt        # Python dependencies
├── Dockerfile              # Container setup for reproducibility
├── LICENSE                 # Code license (MIT)
└── README.md

Setup

Clone this repository.

git clone https://github.com/uzairakbar/causal-data-augmentation.git
cd causal-data-augmentation

To use a GPU, set the --index-url value in requirements.txt as per your CUDA version (see PyTorch installation instructions).

Proceed setup using one of the below options. Or instead simply open this project in Colab.

Conda environment (recommended)

environment=causal-data-augmentation
conda env create -f environment.yaml
conda activate "$environment"
export PYTORCH_ENABLE_MPS_FALLBACK=1

Python `venv` (tested with `3.10.14`)

environment='.causal-data-augmentation'
python -m venv "$environment"
"$environment"/bin/python -m pip install -r requirements.txt
source "$environment"/bin/activate
export PYTORCH_ENABLE_MPS_FALLBACK=1

Docker

Build provided Dockerfile and run.

image=causal-data-augmentation-image
container=causal-data-augmentation-container
docker build --tag "$image" .
docker run --name "$container" \
    --volume "$PWD"/data:/app/data/ \
    --volume "$PWD"/artifacts:/app/artifacts/ \
    "$image"

To delete Docker artifacts after finishing experiments, run the following commands.

image=causal-data-augmentation-image
container=causal-data-augmentation-container
docker rm "$container"
docker image rm -f "$image"

Usage

Configuring experiments

Use the ./config.yaml file to specify the experiment parameters. The provided (default) configuration was used to generate the figures of the paper.

Comment out (or remove) the experiments from ./config.yaml that you are not interested in, and then run the ./src/main.py script to run the remaining experiments.

The generated figures and artifacts are saved in the ./artifacts/ directory after the experiments finish execution.

CPU vs. GPU backend

The code uses a CPU backend for PyTorch by default (recommended for optical_device and linear_simulation experiments). To use a GPU or MPS backend, however, change the CPU_ONLY variable specified in ./src/regressors/utils.py to False.

Running experiments

Simply run the main script python src/main.py, or run the Docker container (see above).

Citation

If you find our work helpful, consider citing our paper and leaving a star ⭐.

@misc{akbar2025causalDataAugmentation,
      title={An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation}, 
      author={Uzair Akbar and Niki Kilbertus and Hao Shen and Krikamol Muandet and Bo Dai},
      year={2025},
      eprint={2510.25128},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2510.25128}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Symmetry as Intervention;
Causal Estimation with Data Augmentation

Contents

Overview

Setup

Conda environment (recommended)

Python `venv` (tested with `3.10.14`)

Docker

Usage

Configuring experiments

CPU vs. GPU backend

Running experiments

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
environment.yaml		environment.yaml
requirements.txt		requirements.txt

License

uzairakbar/causal-data-augmentation

Folders and files

Latest commit

History

Repository files navigation

Symmetry as Intervention;Causal Estimation with Data Augmentation

Contents

Overview

Setup

Conda environment (recommended)

Python venv (tested with 3.10.14)

Docker

Usage

Configuring experiments

CPU vs. GPU backend

Running experiments

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Symmetry as Intervention;
Causal Estimation with Data Augmentation

Python `venv` (tested with `3.10.14`)

Packages