UNet Drivable Area Segmentation for Autonomous Driving

This project implements drivable area segmentation using the U-Net architecture on the BDD100K dataset. The model identifies three key areas in driving scenes: ego lane (direct drivable area), adjacent lanes (alternative drivable areas), and background (non-drivable areas).

U-Net Architecture - Original paper by Ronneberger et al.

Dataset

BDD100K Drivable Area Segmentation (3 Classes)

The dataset is automatically downloaded from Google Drive when you run the training script for the first time.

Dataset Source: Pre-processed BDD100K drivable area data (180x320 resolution)
Google Drive ID: 1sX6kHxpYoEICMTfjxxhK9lTW3B7OUxql
Size: ~100MB (compressed)

Class Definitions:

Class ID	Category	Color (RGB)	Description
0	direct	(171, 44, 236)	Current/ego lane - the lane the vehicle is driving in
1	alternative	(86, 211, 19)	Adjacent/alternative lanes - other drivable lanes
2	background	(0, 0, 0)	Non-drivable areas - sidewalks, buildings, etc.

This 3-class approach is essential for:

Lane keeping assistance systems
Autonomous navigation and path planning
Drivable area detection for ADAS
Real-time decision making in autonomous vehicles

Results & Performance

Training Statistics (100 Epochs)

Metric	Value
Best Mean IoU	75.07%
Best Validation Loss	0.2200
Final Training Loss	0.0594
Training Time	~2 hours (RTX 3060)
Inference Speed	30+ FPS (GPU)

Training Curves

The model demonstrates excellent convergence with steady decrease in training and validation loss, consistent improvement in mean IoU metric, and no overfitting observed (validation tracks training).

Prediction Examples

The model accurately segments:

Magenta regions: Ego lane (safe to drive straight)
Green regions: Adjacent lanes (safe for lane changes)
Black regions: Non-drivable areas (obstacles, sidewalks, buildings)

Video Inference Demonstrations

The model performs real-time segmentation on various driving scenarios:

Highway Driving

Residential Area

Campus Environment

Difficult Conditions

Model Architecture

U-Net Implementation Details

The U-Net architecture consists of:

Encoder (Contracting Path)

4 downsampling blocks with max pooling
Layer channels: [64, 128, 256, 512]
Each block: 2× (Conv2D → BatchNorm → ReLU)

Bottleneck

Double convolution at lowest resolution
1024 channels for maximum feature extraction

Decoder (Expanding Path)

4 upsampling blocks with skip connections
Transposed convolutions for spatial resolution recovery
Feature fusion via concatenation with encoder outputs

Output Layer

1×1 convolution for 3-class pixel-wise classification
Total Parameters: 31,037,763

Training Configuration

Loss Function: Dice Loss (multiclass)
Optimizer: Adam
Learning Rate: 3e-4 (OneCycleLR scheduler)
Batch Size: 8
Input Resolution: 180×320×3
Output Classes: 3
Data Split: 70% train, 20% val, 10% test

Quick Start

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended for training)
8GB+ RAM
2GB+ disk space

Installation

Clone the repository

git clone https://github.com/Mark-Moawad/UNet-Drivable-Area-Segmentation.git
cd UNet-Drivable-Area-Segmentation

Create virtual environment

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

The dataset (BDD100K drivable area subset) will be automatically downloaded on first run.

Usage

Training from Scratch

python unet_segmentation.py

The script automatically:

Downloads and extracts the BDD100K dataset (3,430 images)
Splits data into train/val/test sets
Trains the U-Net model with threshold-based early stopping
Saves the best model checkpoint
Generates training curves and prediction visualizations

Video Inference

To run inference on your own driving videos:

Enable video processing in unet_segmentation.py:

process_videos_flag = True

Place videos in data/dataset/testing/
Run inference:

python unet_segmentation.py

Output videos with overlaid segmentation masks will be saved to data/processed/.

Project Structure

UNet-Drivable-Area-Segmentation/
├── unet_segmentation.py          # Main training & inference pipeline
├── utils.py                       # Utility functions (metrics, visualization)
├── requirements.txt               # Python dependencies
├── README.md                      # This file
│
├── data/
│   ├── dataset/                   # BDD100K dataset (auto-downloaded)
│   │   ├── image_180_320.npy     # Pre-processed images (3,430 samples)
│   │   ├── label_180_320.npy     # Segmentation labels
│   │   └── testing/              # Test videos for inference
│   ├── models/                    # Trained model checkpoints
│   │   ├── UNet_baseline.pt      # Best model weights
│   │   └── UNet_baseline_training_stats.csv
│   ├── outputs/                   # Training visualizations
│   │   ├── UNet_baseline_training_curves.png
│   │   └── UNet_baseline_predictions.png
│   ├── processed/                 # Inference output videos
│
├── media/                         # Photos and demo videos for README
│
└── venv/                          # Python virtual environment

References

U-Net Paper: Convolutional Networks for Biomedical Image Segmentation (Ronneberger et al., 2015)
BDD100K Dataset: A Diverse Driving Video Database (Yu et al., 2018)
Berkeley DeepDrive: Official Website

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Mark Moawad
Autonomous Systems Engineer | Computer Vision Specialist

This project demonstrates practical computer vision and deep learning skills for autonomous driving applications, showcasing end-to-end development from model training to production-ready inference.

Acknowledgments

Original U-Net architecture by Ronneberger, Fischer, and Brox
BDD100K dataset team at UC Berkeley
PyTorch and segmentation_models_pytorch communities

Contact

For questions, collaboration opportunities, or professional inquiries:

GitHub: @Mark-Moawad
Email: [email protected]
LinkedIn: https://www.linkedin.com/in/markmoawad96/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UNet Drivable Area Segmentation for Autonomous Driving

Dataset

Class Definitions:

Results & Performance

Training Statistics (100 Epochs)

Training Curves

Prediction Examples

Video Inference Demonstrations

Highway Driving

Residential Area

Campus Environment

Difficult Conditions

Model Architecture

U-Net Implementation Details

Training Configuration

Quick Start

Prerequisites

Installation

Usage

Training from Scratch

Video Inference

Project Structure

References

License

Author

Acknowledgments

Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
media		media
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
unet_segmentation.py		unet_segmentation.py
utils.py		utils.py

License

Mark-Moawad/UNet-Drivable-Area-Segmentation

Folders and files

Latest commit

History

Repository files navigation

UNet Drivable Area Segmentation for Autonomous Driving

Dataset

Class Definitions:

Results & Performance

Training Statistics (100 Epochs)

Training Curves

Prediction Examples

Video Inference Demonstrations

Highway Driving

Residential Area

Campus Environment

Difficult Conditions

Model Architecture

U-Net Implementation Details

Training Configuration

Quick Start

Prerequisites

Installation

Usage

Training from Scratch

Video Inference

Project Structure

References

License

Author

Acknowledgments

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages