TensorFlow GPU Docker Setup

This repository contains Docker configuration for running TensorFlow with GPU support. The setup includes optimizations for WSL2 environments and includes all necessary packages for data science and deep learning tasks.

Prerequisites

Docker installed
Docker Compose installed
NVIDIA GPU with compatible drivers
NVIDIA Container Toolkit installed
If using WSL2: Properly configured GPU passthrough

Quick Start

Using Docker Compose (Recommended)

# Build and run with Docker Compose
docker-compose up --build

# Run in detached mode
docker-compose up --build -d

# Access the running container
docker exec -it tensorflow-gpu-custom bash

# Stop the container
docker-compose down

Using Docker Commands

# Build the image
docker build -t tensorflow-gpu-custom -f Dockerfile.gpu .

# Run with GPU support
docker run --gpus all --name tensorflow-gpu-custom -it tensorflow-gpu-custom

# Run the comprehensive GPU test
docker run --gpus all -it tensorflow-gpu-custom python /app/test_gpu.py

Using Pre-built Container from GitHub

This repository publishes a pre-built container to GitHub Container Registry, which you can use directly:

# Pull the pre-built image
docker pull ghcr.io/mrgkanev/tensorflow-gpu-custom:latest

# Run with GPU support
docker run --gpus all -it ghcr.io/mrgkanev/tensorflow-gpu-custom

# Run the comprehensive GPU test
docker run --gpus all -it ghcr.io/mrgkanev/tensorflow-gpu-custom python /app/test_gpu.py

Included Packages

The Docker image includes the following packages:

TensorFlow 2.11.0 with GPU support
NumPy
Pandas
scikit-learn
CUDA and cuDNN libraries

Performance Optimizations

The Docker image includes several performance optimizations:

GPU memory growth configuration
Thread management for TensorFlow
cuDNN optimizations
Memory allocation to prevent OOM errors

Benchmarks

Usage

# Simple run (all benchmarks, standard sizes)
python /app/tf_benchmark.py

# Quick run with smaller sizes
python /app/tf_benchmark.py --small

# Run specific benchmarks
python /app/tf_benchmark.py --matrix  # Matrix multiplication only
python /app/tf_benchmark.py --dense   # Dense layer only
python /app/tf_benchmark.py --conv    # Convolution only

Detailed Setup Instructions

1. Using Docker Compose

Save the Dockerfile.gpu and docker-compose.yml in your project directory:

# Build and start the container with GPU support
docker-compose up --build

This will build the Docker image based on Dockerfile.gpu and start the container with proper GPU access.

2. Building the Docker Image Manually

Save the Dockerfile.gpu and test_gpu.py in your project directory, then build the image:

docker build -t tensorflow-gpu-custom -f Dockerfile.gpu .

This command creates a Docker image named tensorflow-gpu-custom based on the content of Dockerfile.gpu.

3. Run the Container with GPU Access

To run the container with GPU support:

docker run --gpus all -it tensorflow-gpu-custom

The --gpus all flag is what enables GPU access from within the container.

4. Verifying GPU Access

Once inside the container, you can verify GPU access with these methods:

Option 1: Quick verification

import tensorflow as tf
print(tf.config.list_physical_devices('GPU'))

Option 2: Run the comprehensive test script

python /app/check_gpu.py

or

python /app/test_gpu.py

These scripts perform:

GPU detection and environment checks
Matrix multiplication benchmarks comparing CPU vs GPU speed
Simple neural network training test

GPU Test Script

The included test_gpu.py script provides comprehensive testing of your GPU setup:

# Run directly from host
docker run --gpus all -it tensorflow-gpu-custom python /app/test_gpu.py

# Or run from inside the container
python /app/test_gpu.py

The test script will:

Check if TensorFlow can detect your GPU
Run performance benchmarks using matrix multiplication
Train a simple neural network to verify end-to-end functionality
Provide detailed diagnostics if GPU is not detected

Using with PyCharm

To use this Docker container with PyCharm:

Configure Docker integration in PyCharm:
- Go to Settings/Preferences → Build, Execution, Deployment → Docker
- Add your Docker connection
Configure Python Interpreter:
- Go to Settings/Preferences → Project → Python Interpreter
- Click the gear icon → Add → Docker
- Select your Docker server and the tensorflow-gpu-custom image
Fix common TensorFlow errors:
- If you encounter optimizer errors with LSTM models (like KeyError: 'The optimizer cannot recognize variable lstm/lstm_cell/kernel:0'), use the legacy optimizers:
```
# Change this:
from tensorflow.keras.optimizers import Adam

# To this:
from tensorflow.keras.optimizers.legacy import Adam
```
- Configure GPU memory growth at the start of your script:
```
gpus = tf.config.list_physical_devices('GPU')
for gpu in gpus:
    tf.config.experimental.set_memory_growth(gpu, True)
```
- See the included tensorflow_pycharm_fix.py script for a complete example
Configure Run Configuration:
- Create a new Run Configuration
- Set the Docker container as the target environment
- Set the working directory to your project folder
- Map your local project directory to a directory in the container

Common Issues and Troubleshooting

GPU Not Detected

If you see GPUs available: [] or errors about CUDA, check:

NVIDIA Driver Installation
```
# On host machine
nvidia-smi
```

NVIDIA Container Toolkit Installation

# On Linux host
dpkg -l | grep nvidia-container-toolkit

# Install if missing
sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Test with NVIDIA's Base Container

docker run --gpus all --rm nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi

WSL2-Specific Issues

Install NVIDIA Drivers for WSL
- Download the NVIDIA CUDA driver for WSL
- Install on your Windows host (not inside WSL)
Enable GPU in Docker Desktop
- Open Docker Desktop Settings
- Go to Resources → WSL Integration
- Check "Enable NVIDIA GPU support in WSL 2"
Configure WSL
- Create or edit %USERPROFILE%\.wslconfig file in Windows with:
```
[wsl2]
kernelCommandLine = systemd.unified_cgroup_hierarchy=0
```
Restart WSL
```
wsl --shutdown
```

Performance Optimization

Configure Memory Growth

gpus = tf.config.list_physical_devices('GPU')
for gpu in gpus:
  tf.config.experimental.set_memory_growth(gpu, True)

Monitor GPU Usage

# From host or another terminal
watch -n 0.5 nvidia-smi

Docker Commands Quick Reference

Build with Docker Compose: docker-compose up --build
Stop Docker Compose container: docker-compose down
Build the image: docker build -t tensorflow-gpu-custom -f Dockerfile.gpu .
Run with GPU access: docker run --gpus all -it tensorflow-gpu-custom
Run with volume mount: docker run --gpus all -v $(pwd):/app -it tensorflow-gpu-custom
Check existing images: docker images
Check running containers: docker ps
Execute commands in running container: docker exec -it tensorflow-gpu-custom bash

Sample Training Code

import numpy as np
import tensorflow as tf
from tensorflow.keras import layers
from tensorflow.keras.optimizers.legacy import Adam

# Check for GPU
gpus = tf.config.list_physical_devices('GPU')
print(f"GPUs available: {gpus}")

# Optional: Enable memory growth
for gpu in gpus:
    tf.config.experimental.set_memory_growth(gpu, True)

# Create a simple model
model = tf.keras.Sequential([
    layers.Dense(128, activation='relu', input_shape=(10,)),
    layers.Dense(64, activation='relu'),
    layers.Dense(1, activation='sigmoid')
])

# Compile the model
model.compile(optimizer=Adam(), loss='binary_crossentropy', metrics=['accuracy'])

# Generate some fake data
X = np.random.random((1000, 10))
y = np.random.randint(2, size=(1000, 1))

# Train the model
model.fit(X, y, epochs=5, batch_size=32)

Notes

The Docker image is based on TensorFlow 2.11.0-gpu for optimal compatibility
The container includes proper environment setup for CUDA paths
If you encounter memory issues, adjust batch sizes or enable memory growth
Environment variables are set in the Dockerfile for optimal TensorFlow performance

Fixing Common TensorFlow Errors

KeyError with Optimizers

If you see this error:

KeyError: 'The optimizer cannot recognize variable lstm/lstm_cell/kernel:0'

This happens because TensorFlow 2.11.0 has both experimental and legacy optimizers. The experimental optimizers have compatibility issues with certain layer types (especially LSTM).

Solution:

Use legacy optimizers:

# Change this
from tensorflow.keras.optimizers import Adam

# To this
from tensorflow.keras.optimizers.legacy import Adam

Make sure your input shapes are correct:

# Check input shape expectations
print(model.input_shape)

Fix shape mismatches in your data. If your error mentions:

Model was constructed with shape (None, 30, 1) but was called on input with incompatible shape (64, 1, 1)

Ensure your data matches the expected dimensions:

# Reshape your data to match expected input
X = X.reshape(batch_size, sequence_length, features)

See the provided tensorflow_pycharm_fix.py script for complete examples

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.devcontainer		.devcontainer
.github		.github
scripts		scripts
.gitignore		.gitignore
Dockerfile.gpu		Dockerfile.gpu
LICENSE		LICENSE
README.md		README.md
TROUBLESHOOTING.MD		TROUBLESHOOTING.MD
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TensorFlow GPU Docker Setup

Prerequisites

Quick Start

Using Docker Compose (Recommended)

Using Docker Commands

Using Pre-built Container from GitHub

Included Packages

Performance Optimizations

Benchmarks

Usage

Detailed Setup Instructions

1. Using Docker Compose

2. Building the Docker Image Manually

3. Run the Container with GPU Access

4. Verifying GPU Access

Option 1: Quick verification

Option 2: Run the comprehensive test script

GPU Test Script

Using with PyCharm

Common Issues and Troubleshooting

GPU Not Detected

WSL2-Specific Issues

Performance Optimization

Docker Commands Quick Reference

Sample Training Code

Notes

Fixing Common TensorFlow Errors

KeyError with Optimizers

About

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

MrGKanev/TensorFlow-GPU-Docker-Setup

Folders and files

Latest commit

History

Repository files navigation

TensorFlow GPU Docker Setup

Prerequisites

Quick Start

Using Docker Compose (Recommended)

Using Docker Commands

Using Pre-built Container from GitHub

Included Packages

Performance Optimizations

Benchmarks

Usage

Detailed Setup Instructions

1. Using Docker Compose

2. Building the Docker Image Manually

3. Run the Container with GPU Access

4. Verifying GPU Access

Option 1: Quick verification

Option 2: Run the comprehensive test script

GPU Test Script

Using with PyCharm

Common Issues and Troubleshooting

GPU Not Detected

WSL2-Specific Issues

Performance Optimization

Docker Commands Quick Reference

Sample Training Code

Notes

Fixing Common TensorFlow Errors

KeyError with Optimizers

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 3

Uh oh!

Languages