May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs

📋 Introduction

FUEL (Feedback-driven fUzzing for dEep Learning frameworks via LLMs) is an advanced deep learning (DL) framework fuzzing tool designed to detect bugs in mainstream DL frameworks such as PyTorch and TensorFlow. FUEL combines the powerful generation LLM with the analysis LLM to fully leverage feedback information during the fuzzing loop, generating high-quality test cases to discover potential bugs in DL frameworks. Additionally, FUEL features a feedback-aware simulated annealing algorithm and program self-repair strategy, improving model diversity and validity, respectively.

🎯 Why FUEL?

🔥 Core Advantages

🤖 Intelligent Code Generation: Leverages Large Language Models to generate complex and effective deep learning model test cases
🔄 Feedback-Driven: Smart feedback mechanism based on code coverage, bug reports, and exception logs to continuously optimize test generation strategies via LLMs
❤️‍🩹 Program Self-Repair: Automatically distinguishes between framework bugs and invalid test cases, then intelligently repairs invalid models using LLM-guided analysis
📊 Heuristic Search: Integrates heuristic algorithms like Feedback-Aware Simulated Annealing (FASA) for intelligent API operator selection
🔬 Differential Testing: Supports multiple differential testing modes (hardware differences, compiler differences, etc.)
🔍 Efficient Detection: Successfully discovered 104 new bugs, with 93 confirmed and 49 fixed

🛠️ Key Features

✅ Support for PyTorch and TensorFlow framework testing
✅ Multiple differential testing modes (CPU/CUDA hardware differences, compiler differences)
✅ Intelligent operator selection and combination
✅ Real-time code coverage feedback
✅ Exception detection and bug report generation
✅ Configurable LLM backends (local models/API services)

🏗️ Project Structure

FUEL/
├── 📁 config/           # Configuration files
│   ├── als_prompt/      # Analysis prompt configurations
│   ├── gen_prompt/      # Generation prompt configurations
│   ├── heuristic.yaml   # Heuristic algorithm configuration
│   └── model/           # LLM model configuration
├── 📁 data/             # Data files
│   ├── pytorch_apis.txt # PyTorch API list
│   └── tensorflow_apis.txt # TensorFlow API list
├── 📁 fuel/             # Core source code
│   ├── difftesting/     # Differential testing module
│   ├── exec/            # Code execution module
│   ├── feedback/        # Feedback mechanism module
│   ├── guidance/        # Heuristic search module
│   └── utils/           # Utility classes
├── 📁 experiments/      # Experiment and evaluation scripts
└── 📁 results/          # Test result outputs

⚙️ Experiment Setup

💻 Hardware environment

Important

General test-bed requirements

OS: Ubuntu >= 20.04;
CPU: X86/X64 CPU;
GPU: CUDA architecture (V100, A6000, A100, etc.);
Memory: 128GB GPU Memory available (if you use 72B local model with vLLM);
Storage: at least 100GB Storage available;
Network: Good Network to GitHub and LLM API service;

📦 Software requirement

You need a DeepSeek API key to invoke the DeepSeek API service (of course, you can modify the configuration in ./config/model.yaml)

🚀 Quick Start

📥 clone the repository

git clone https://github.com/NJU-iSE/FUEL.git
cd FUEL

🔧 Install dependencies

Firstly, we should install some necessary Python dependencies. We strongly recommend users use uv to manage the Python environments. Please follow the commands below.

# install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
# sync the dependencies at the root directory
uv sync
# activate the environment
source .venv/bin/activate

⚡ Install PyTorch nightly version

When fuzzing the systems under tests (SUTs), we use the nightly version, in order to detect new bugs.

Here we use CUDA 12.6 as an example. Please install the nightly version based on your CUDA version. You can get the corresponding commands from https://pytorch.org/

UV_HTTP_TIMEOUT=180 uv pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu126

🔑 create API key

In our experiment, we use DeepSeek API to invoke the LLM service. DeepSeek API service is compatible with openai interfaces.

For the below command, you should replace [YOUR_API_KEY] with your own DeepSeek API key.

key="[YOUR_API_KEY]"
echo "$key" > ./config/deepseek-key.txt

🏃 Start fuzzing

Warning

The fuzzing process is time-consuming and may run for many hours to discover meaningful bugs.

python -m fuel.fuzz --lib pytorch run_fuzz \
                    --max_round 1000 \
                    --heuristic FASA \
                    --diff_type cpu_compiler

📃 Parameter Description:

--lib: Target deep learning library (pytorch or tensorflow)
--max_round: Maximum number of testing rounds
--heuristic: Heuristic algorithm (FASA, Random, or None)
--diff_type: Differential testing type (hardware, cpu_compiler, cuda_compiler)

Note that the fuzzing experiment is really time-consuming. Maybe you should check the results after about ~20hours.

🖨️ Check results

Please check the generated models in results/fuel/pytorch. If you want to get the detected bugs, please check outputs/bug_reports.txt.

🔧 Advanced Usage

Warning

These advanced features are not fully tested and are prone to instability. We will continue improving our artifact.

🎮 Using Local LLM Models

python -m fuel.fuzz --lib pytorch run_fuzz \
                    --use_local_gen \
                    --max_round 1000 \
                    --heuristic FASA

👊 Custom Operator Selection

python -m fuel.fuzz --lib pytorch run_fuzz \
                    --op_set data/custom_operators.txt \
                    --op_nums 8 \
                    --max_round 1000

📈 Code Coverage Analysis

bash coverage.sh

🚨 Bug finding (Real-world Contribution)

So far, FUEL has detected 104 previously unknown new 🐛bugs, with 93 already 🥰confirmed and 49 already 🥳fixed. 14 detected bugs were labeled as 🚨high-priority, and one was labeled as 🤯utmost priority. 14 detected bugs has been assigned 🐞CVE IDs. The evidence can be found in Google Sheet.

📡 Contact

Shaoyu Yang: core developer
Haifeng Lin: core developer
Chunrong Fang: supervisor

🙏 Acknowledgement

We thank NNSmith, TitanFuzz, and WhiteFox for their admirable open-source spirit, which has largely inspired this work.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
assets		assets
config		config
data		data
experiments		experiments
fuel		fuel
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
coverage.sh		coverage.sh
fuzz.sh		fuzz.sh
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs

📋 Introduction

🎯 Why FUEL?

🔥 Core Advantages

🛠️ Key Features

🏗️ Project Structure

⚙️ Experiment Setup

💻 Hardware environment

📦 Software requirement

🚀 Quick Start

📥 clone the repository

🔧 Install dependencies

⚡ Install PyTorch nightly version

🔑 create API key

🏃 Start fuzzing

🖨️ Check results

🔧 Advanced Usage

🎮 Using Local LLM Models

👊 Custom Operator Selection

📈 Code Coverage Analysis

🚨 Bug finding (Real-world Contribution)

📡 Contact

🙏 Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

NJU-iSE/FUEL

Folders and files

Latest commit

History

Repository files navigation

May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs

📋 Introduction

🎯 Why FUEL?

🔥 Core Advantages

🛠️ Key Features

🏗️ Project Structure

⚙️ Experiment Setup

💻 Hardware environment

📦 Software requirement

🚀 Quick Start

📥 clone the repository

🔧 Install dependencies

⚡ Install PyTorch nightly version

🔑 create API key

🏃 Start fuzzing

🖨️ Check results

🔧 Advanced Usage

🎮 Using Local LLM Models

👊 Custom Operator Selection

📈 Code Coverage Analysis

🚨 Bug finding (Real-world Contribution)

📡 Contact

🙏 Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages