Stable Diffusion 3.5 LoRA Fine-Tuning

This project provides a complete workflow for fine-tuning Stable Diffusion 3.5 Medium using LoRA (Low-Rank Adaptation). Train on custom datasets with per-image prompts and generate high-quality images on consumer GPUs.

Features

Stable Diffusion 3.5 Medium - Latest model with excellent quality
LoRA Fine-tuning - Efficient training without modifying base model
Per-image Prompts - Custom prompts loaded from sidecar .txt files
Comparison Tools - Side-by-side base vs LoRA model evaluation
Web Scraping - Automated dataset collection with prompt file generation

Prerequisites

Python 3.9+
NVIDIA GPU with 16GB+ VRAM (RTX 4080/4090, A6000, etc.)
CUDA 12.2+ installed
pipenv for dependency management

1. Setup & Installation

First, set up the Python environment and install the required packages.

# Install dependencies
pipenv install

Dataset Collection

1. Web Scraping

Use webscraper.py to download images and create empty prompt files:

pipenv run python webscraper.py

Enter search query and number of images
Images saved to training-images/ with matching .txt files
Curate your dataset: Remove low-quality/irrelevant images

2. Add Custom Prompts

Edit the .txt files to add descriptive prompts for each image:

training-images/
├── image1.jpg
├── image1.txt  ← "A woman with brown hair smiling"
├── image2.jpg
├── image2.txt  ← "A woman in a red dress outdoors"
└── ...

3. Fine-Tuning

Train your LoRA adapter with finetune.py:

pipenv run python finetune.py

Training will take some time, depending on your GPU and the number of images. The script will print the loss at each step. Once complete, the trained LoRA weights will be saved in the lora_weights/ directory.

Image Generation

Standard Inference

Generate images with your fine-tuned model:

pipenv run python inference.py

Model Comparison

Compare base model vs LoRA model side-by-side:

pipenv run python compare.py

Generates 4 comparison images:

base_baseline.png - Base model + simple prompt
base_test.png - Base model + your prompt
lora_baseline.png - LoRA model + simple prompt
lora_test.png - LoRA model + your prompt

Training Tips

Quality over Quantity - 20-50 high-quality images work better than hundreds of poor ones
Diverse Prompts - Use varied, descriptive prompts for each image
Consistent Style - Keep similar lighting/composition for style training
Monitor Loss - Use training loss and comparison script to refine fine-tuning

Contributing

Feel free to submit issues and pull requests to improve the project!

License

This project is open source. Please respect the licenses of the underlying models and libraries.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
lora_weights		lora_weights
output		output
test_scripts		test_scripts
training-images		training-images
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
compare.py		compare.py
finetune.py		finetune.py
generated_image.png		generated_image.png
inference.py		inference.py
webscraper.py		webscraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion 3.5 LoRA Fine-Tuning

Features

Prerequisites

1. Setup & Installation

Dataset Collection

1. Web Scraping

2. Add Custom Prompts

3. Fine-Tuning

Image Generation

Standard Inference

Model Comparison

Training Tips

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion 3.5 LoRA Fine-Tuning

Features

Prerequisites

1. Setup & Installation

Dataset Collection

1. Web Scraping

2. Add Custom Prompts

3. Fine-Tuning

Image Generation

Standard Inference

Model Comparison

Training Tips

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages