GitHub - hwq0726/GoF-for-Watermark-Detection: Code for the paper https://arxiv.org/abs/2510.03944

Overview

This repository contains the code for the NeurIPS 2025 Spotlight paper:

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

(Also available on arXiv)

Our implementations of three watermarking algorithms are based on:

Gumbel-max: Implementation based on link.
Inverse-Transform: Implementation based on link.
SynthID: Implementation based on MarkLLM with significant modifications.

📜 Important Notes:

Implementation Details: Our implementation uses the continuous version of SynthID where $g$-values follow a uniform distribution on $[0,1]$ for each layer. This differs from both the original SynthID and MarkLLM repositories which use discrete $g$-values following Bernoulli(0.5). While both versions were proposed in the SynthID paper, we chose the continuous version as its pivotal statistics provides richer information for watermark detection.
Flexibility in Watermark Detection: For those interested in using our Goodness-of-fit tests for watermark detection, please note that you can generate pivotal statistics using any compatible implementation - you are not restricted to using our code for this step. Our detection framework is designed to work with pivotal statistics regardless of their source.

🛠️ Usage

Watermark Detection with GoF Tests

To detect watermarked text using Goodness-of-Fit (GoF) tests:

Navigate to the GoF-tests directory
Prepare your pivotal statistics:
- Format: numpy array with shape (num_samples, num_tokens)
- Can be generated using any compatible watermarking method
- Example data provided in example_data_gumbel.pkl (Gumbel-max watermark statistics)
Follow the step-by-step guide in demo.ipynb to run the detection tests.

Reproducing Experimental Results

To reproduce the results from our experiments:

For Gumbel-max and Inverse-Transform watermarks: See Gumbel&Transform directory
For SynthID watermark: See SynthID directory

🧭 Repository Structure

`GoF-tests/`

demo.ipynb: A step-by-step guide on how to use the GoF tests to detect watermarked text.
example_data_gumbel.pkl: The example data for the pivotal statistics of Gumbel-max watermark.
detect_utils.py: Contains the code for the baseline and Goodness-of-fit tests used in the paper.

`Gumbel&Transform/`

attack.py: Tests robustness of watermarked text against various attacks:
- Word deletion
- Synonym substitution
- Dipper paraphrase (didn't use it in the paper)
attack_info.py: Perform information-rich attacks in the watermarked text and then perform detection.
detect_human.py: Perform detection on human-written text.
generating_samples.py: Generates watermarked text using the Gumbel-max or the Inverse-Transform algorithm.
detect_utils.py: Contains the code for the baseline and Goodness-of-fit tests used in the paper.
get_score.py: Performs watermark detection using different methods.

`SynthID/`

Contains the implementation of SynthID.

`MarkLLM/`

synthid_generate.py: Generates watermarked text using the SynthID algorithm
attack.py: Tests robustness of watermarked text against various attacks:
- Word deletion
- Synonym substitution
- Dipper paraphrase

`results/`

All experimental results of SynthID are stored in this directory:

read_results.py: Extracts pivotal values from watermarked texts
detect_utils.py: Contains the code for the baseline and Goodness-of-fit tests used in the paper.
get_score.py: Performs watermark detection using different methods. Also we include the information attack in this code.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
GoF-tests		GoF-tests
Gumbel&Transform		Gumbel&Transform
SynthID		SynthID
.gitignore		.gitignore
Neurips25_poster.png		Neurips25_poster.png
README.md		README.md
animation.gif		animation.gif
animation.mp4		animation.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

🛠️ Usage

Watermark Detection with GoF Tests

Reproducing Experimental Results

🧭 Repository Structure

`GoF-tests/`

`Gumbel&Transform/`

`SynthID/`

`MarkLLM/`

`results/`

About

Uh oh!

Releases

Packages

Languages

hwq0726/GoF-for-Watermark-Detection

Folders and files

Latest commit

History

Repository files navigation

Overview

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

🛠️ Usage

Watermark Detection with GoF Tests

Reproducing Experimental Results

🧭 Repository Structure

GoF-tests/

Gumbel&Transform/

SynthID/

MarkLLM/

results/

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`GoF-tests/`

`Gumbel&Transform/`

`SynthID/`

`MarkLLM/`

`results/`

Packages