Underwater Image Enhancement with ML

This is an automated machine learning pipeline that replaces manual image editing for underwater GoPro images captured during ROV surveys. Converts RAW GPR files to enhanced images matching manual Adobe Lightroom editing quality.

For People That Just Want to Process Images (GUI Application)

No programming required - Desktop application available for Windows, macOS, and Linux.

👉 See GUI Documentation

Quick steps:

Download the application for your platform
Download a trained model (.pth file)
Select a Folder of Images to Enhance and hit go!

For People Wanting to Train Their Own Models or Use Command Line Inference

Train custom models or integrate into automated workflows.

👉 See Training Documentation

There are many options for customizing model architecture, training parameters, and datasets.

These are defined in the setup_and_train_config.yaml file.

The most important parameters are:

model - Model architecture to use:
- unet - Standard U-Net autoencoder (~31M params) - faster training, good baseline
- ushape_transformer - U-shape Transformer with CMSFFT+SGFMT (~31M params) - better quality, slower training
- ss_uie - State-Space UIE (Mamba + FFT) - requires CUDA + mamba-ssm
- 3d_lut - Image-adaptive 3D LUT (<1M params) - per-pixel colour transform that preserves fine texture by construction; best for mostly-global (Lightroom-style) edits
loss - Loss function: auto (per-model default), combined (L1+MSE), composite (L1 + MS-SSIM + Focal Frequency, texture-preserving), or ss_uie
repo_id - Which hugging face dataset to download and train with
image_size - What size of images to train on. Ideally this should be as large as your GPU memory allows.
batch_size - How many images to process at once. Again, larger is better, but limited by GPU memory.
num_epochs - How many passes through the dataset to train for.

Quick start:

python3.10 -m venv env
source env/bin/activate  # On Windows use `env\Scripts\activate`
pip install -r requirements.txt

# Train a model (downloads dataset automatically and trains
python training/setup_and_train.py

# Run inference on images
python inference/inference.py input.jpg --checkpoint output/best_model.pth

Run Inference (Command Line Image Processing)

See the scripts in the inference/ folder for more details on args

python3.10 -m venv env
source env/bin/activate  # On Windows use `env\Scripts\activate`
pip install -r requirements.txt

python inference/inference.py input.jpg --checkpoint checkpoints/best_model.pth
python inference/inference.py /path/to/images --checkpoint checkpoints/best_model.pth --output enhanced/
python inference/inference.py input.jpg --checkpoint checkpoints/best_model.pth --compare

Preprocess GPR Files

python3.10 -m venv env
source env/bin/activate  # On Windows use `env\Scripts\activate`
pip install -r requirements.txt

python preprocessing/preprocess_images.py /path/to/gpr/files --output-dir processed

Pre-trained Models

Example trained models are available for download here: https://huggingface.co/Seattle-Aquarium

Datasets

The Seattle Aquarium CCR Underwater Image Enhancement Dataset is available at: https://huggingface.co/datasets/Seattle-Aquarium/Seattle_Aquarium_benthic_imagery

References

Project Discussion & Sample Data
U-Net Paper
U-shape Transformer for Underwater Image Enhancement - Lintao Peng et al.
SS-UIE: Adaptive Dual-domain Learning for Underwater Image Enhancement - Lintao Peng et al. (AAAI 2025)
Learning Image-Adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-Time - Hui Zeng et al. (ECCV 2020 / TPAMI 2020) - basis for the 3d_lut model (code)
Focal Frequency Loss for Image Reconstruction and Synthesis - Liming Jiang et al. (ICCV 2021) - frequency term in the composite loss
Loss Functions for Image Restoration with Neural Networks - Hang Zhao et al. (IEEE TCI 2017) - MS-SSIM + L1 mixed loss

Contributing

This project is developed to support the Seattle Aquarium's ROV survey enhancement pipeline. For questions or contributions, refer to the main CCR development repository.

You can also submit PRs or issues here and we will route them accordingly.

Quick Links:

GUI Users: Start with gui/README.md
Training Models: Start with training/README.md

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
assets		assets
build_scripts		build_scripts
dataset_prep		dataset_prep
docs		docs
gui		gui
inference		inference
lib/SS-UIE		lib/SS-UIE
preprocessing		preprocessing
single_test_input		single_test_input
src		src
tests		tests
training		training
.dockerignore		.dockerignore
.gitignore		.gitignore
BUILD_README.md		BUILD_README.md
CHANGES.md		CHANGES.md
CLAUDE.md		CLAUDE.md
README.md		README.md
SETUP_CONFIG.md		SETUP_CONFIG.md
TRAINING.md		TRAINING.md
install_gpr_tools.sh		install_gpr_tools.sh
requirements.txt		requirements.txt
requirements_gui.txt		requirements_gui.txt
setup.py		setup.py
setup_and_train_config.yaml		setup_and_train_config.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Underwater Image Enhancement with ML

For People That Just Want to Process Images (GUI Application)

For People Wanting to Train Their Own Models or Use Command Line Inference

Run Inference (Command Line Image Processing)

Preprocess GPR Files

Pre-trained Models

Datasets

References

Contributing

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Underwater Image Enhancement with ML

For People That Just Want to Process Images (GUI Application)

For People Wanting to Train Their Own Models or Use Command Line Inference

Run Inference (Command Line Image Processing)

Preprocess GPR Files

Pre-trained Models

Datasets

References

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages