Backup and Restore

This comprehensive guide explains how to use the Neo4j Kubernetes Operator to back up and restore your Neo4j Enterprise clusters. The operator provides advanced backup and restore capabilities through Neo4jBackup and Neo4jRestore Custom Resources, supporting multiple storage backends, scheduled backups, point-in-time recovery, and more.

🚀 Quick Start (5 minutes)

New to backup and restore? Start here for an immediate working backup solution:

Step 1: Create Your First Backup

# 1. Create admin credentials (if not already done)
kubectl create secret generic neo4j-admin-secret \
  --from-literal=username=neo4j --from-literal=password=admin123

# 2. Apply a simple backup to local storage
kubectl apply -f examples/backup-restore/backup-pvc-simple.yaml

Step 2: Monitor Progress

# Watch backup status
kubectl get neo4jbackups simple-backup -w

# Check backup job logs
kubectl logs job/simple-backup-backup

Step 3: What You Just Created

✅ Backup Resource: Backs up your single-node-cluster to local PVC storage
✅ Compression: Automatically compresses backup data
✅ Verification: Validates backup integrity after creation
✅ Retention: Keeps the 5 most recent backups

🎯 Success Indicator: Status should show Completed with BackupSuccessful condition.

Next Steps by User Type

👥 Teams/Production: Continue to Cloud Storage Setup → Scheduled Backups
🔧 Developers: Try Database-Specific Backups → Restore Testing
🏢 Enterprise: Jump to Point-in-Time Recovery → Advanced Configuration

Prerequisites

Neo4j Enterprise cluster running version 5.26.0+ (semver) or 2025.01.0+ (calver)
Kubernetes cluster with the Neo4j Operator installed
Appropriate storage backend configured (S3, GCS, Azure, or PVC)
Admin credentials for the Neo4j cluster

Neo4j Version Requirements

The backup and restore functionality requires Neo4j Enterprise version 5.26.0 or later, or calver versions 2025.01.0 and later. The operator will automatically validate the Neo4j version before performing backup or restore operations.

Supported Versions:

Semver: 5.26.0, 5.26.1 (5.26.x is the last semver LTS — no 5.27+ exists)
Calver: 2025.01.0, 2025.06.1, 2026.01.0, etc.
Enterprise Tags: 5.26.0-enterprise, 2025.01.0-enterprise, etc.

Backup Architecture

How Backups Work

The Neo4j Kubernetes Operator uses a backup sidecar architecture for reliability:

Backup Sidecar Container: Every Neo4j pod includes an automatic backup sidecar
- Handles backup execution directly on the Neo4j node
- Allocated 1Gi memory to prevent OOM issues
- Monitors /backup-requests volume for backup jobs
Backup Job: When you create a Neo4jBackup resource, the operator:
- Creates a Kubernetes Job that connects to the sidecar
- Sends backup request via shared volume
- Monitors backup progress and status
Path Management: The backup sidecar automatically:
- Creates the full backup destination path before execution
- Handles Neo4j 5.26+ requirement that paths must exist
- Manages backup retention and cleanup
RBAC Management: The operator automatically:
- Creates necessary service accounts in each namespace
- Sets up roles with pods/exec and pods/log permissions for backup jobs
- Manages role bindings for secure backup execution
- No manual RBAC configuration required - all permissions are handled automatically

Important Notes

Path Creation: Neo4j 5.26+ and 2025.x+ require the backup destination path to exist before running the backup command. The operator handles this automatically.
Memory Requirements: The backup sidecar requires 1Gi memory for reliable operation
Direct Execution: Backups run directly on Neo4j nodes, not through kubectl
RBAC: Starting with the latest version, the operator automatically creates all necessary RBAC resources. No manual role or binding creation is needed.
Permissions: The operator grants backup jobs the ability to execute commands in pods (pods/exec) and read pod logs (pods/log)

Backup Operations

Basic Backup Concepts

The operator supports two types of backups:

Cluster Backup: Backs up all databases in the cluster (default)
Database Backup: Backs up a specific database

Backups can be performed as:

One-time backups: Created immediately
Scheduled backups: Automated using cron expressions

Storage Backends

The operator supports multiple storage backends:

Backend	Type	Best For	Difficulty	Cost
PVC	`pvc`	Development, testing	🟢 Beginner	Low
S3	`s3`	Production, AWS environments	🟡 Intermediate	Medium
GCS	`gcs`	Production, GCP environments	🟡 Intermediate	Medium
Azure	`azure`	Production, Azure environments	🟡 Intermediate	Medium

💡 Choosing a Storage Backend:

Just starting? Use PVC for simplicity and local testing
Production ready? Choose the cloud provider matching your cluster
Multi-cloud? S3-compatible storage offers the most flexibility

One-Time Backup Examples

🟢 Beginner: Backup to PVC

The simplest backup option using local Kubernetes storage:

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: simple-backup
spec:
  target:
    kind: Cluster
    name: single-node-cluster
  storage:
    type: pvc
    pvc:
      name: backup-storage
      size: 50Gi
      storageClassName: standard
  options:
    compress: true
    verify: true
  retention:
    maxCount: 5

✅ Perfect for: Development, testing, getting started ⏱️ Setup time: 2 minutes 📋 Prerequisites: None beyond basic cluster

🟡 Intermediate: Cluster Backup to S3

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: cluster-backup-s3
spec:
  target:
    kind: Cluster
    name: my-neo4j-cluster
  storage:
    type: s3
    bucket: my-backup-bucket
    path: neo4j-backups/cluster
    cloud:
      provider: aws
  options:
    compress: true
    verify: true
  retention:
    maxAge: "30d"
    maxCount: 10

✅ Perfect for: Production AWS environments, long-term retention ⏱️ Setup time: 10 minutes 📋 Prerequisites: AWS S3 bucket, IAM credentials or roles

🟡 Intermediate: Database Backup to GCS

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: database-backup-gcs
spec:
  target:
    kind: Database
    name: myapp-db
  storage:
    type: gcs
    bucket: my-gcs-backup-bucket
    path: neo4j-backups/myapp
    cloud:
      provider: gcp
  options:
    compress: true
    encryption:
      enabled: true
      keySecret: backup-encryption-key
      algorithm: AES256

✅ Perfect for: Multi-database environments, GCP production ⏱️ Setup time: 15 minutes 📋 Prerequisites: GCS bucket, service account credentials

🟡 Intermediate: Backup to Azure Blob Storage

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: backup-azure
spec:
  target:
    kind: Cluster
    name: production-cluster
  storage:
    type: azure
    bucket: backups  # Container name
    path: neo4j/production
    cloud:
      provider: azure
  options:
    compress: true
    verify: true

✅ Perfect for: Azure production environments ⏱️ Setup time: 10 minutes 📋 Prerequisites: Azure storage account, access keys

🔄 Alternative: Backup to PVC

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: backup-pvc
spec:
  target:
    kind: Cluster
    name: dev-cluster
  storage:
    type: pvc
    pvc:
      name: backup-storage
      storageClassName: fast-ssd
      size: 100Gi
  options:
    compress: true

Note: This is a duplicate of the beginner example above. Already covered in the progression.

Scheduled Backup Examples

🟡 Intermediate: Daily Scheduled Backup

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: daily-backup
spec:
  target:
    kind: Cluster
    name: production-cluster
  schedule: "0 2 * * *"  # Daily at 2 AM UTC
  storage:
    type: s3
    bucket: production-backups
    path: daily
    cloud:
      provider: aws
  retention:
    maxAge: "7d"
    maxCount: 7
    deletePolicy: Delete
  options:
    compress: true
    verify: true

✅ Perfect for: Production environments, consistent backup strategy ⏱️ Setup time: 15 minutes 📋 Prerequisites: Production cluster, cloud storage 🔄 Schedule: Daily at 2 AM UTC - adjust timezone as needed

🔴 Advanced: Weekly Backup with Long Retention

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: weekly-backup
spec:
  target:
    kind: Cluster
    name: production-cluster
  schedule: "0 1 * * 0"  # Weekly on Sunday at 1 AM UTC
  storage:
    type: gcs
    bucket: long-term-backups
    path: weekly
    cloud:
      provider: gcp
  retention:
    maxAge: "90d"
    maxCount: 12
    deletePolicy: Archive
  options:
    compress: true
    verify: true
    encryption:
      enabled: true
      keySecret: backup-encryption-key

✅ Perfect for: Enterprise compliance, long-term archival ⏱️ Setup time: 20 minutes 📋 Prerequisites: Encryption setup, compliance policies 🔄 Schedule: Weekly on Sunday - complements daily backups

🔧 Suspended Backups

You can temporarily suspend scheduled backups:

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jBackup
metadata:
  name: maintenance-backup
spec:
  target:
    kind: Cluster
    name: my-cluster
  schedule: "0 3 * * *"
  suspend: true  # Suspends the backup schedule
  storage:
    type: s3
    bucket: backups
    path: maintenance

Restore Operations

Basic Restore Concepts

The operator supports multiple restore sources:

Backup Reference: Restore from an existing Neo4jBackup resource
Storage Location: Restore directly from storage path
Point-in-Time Recovery (PITR): Restore to a specific point in time

Simple Restore Examples

🟢 Beginner: Restore from Backup Reference

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: restore-from-backup
spec:
  targetCluster: my-neo4j-cluster
  databaseName: neo4j
  source:
    type: backup
    backupRef: daily-backup
  options:
    verifyBackup: true
    replaceExisting: true
  force: false
  stopCluster: true

✅ Perfect for: Quick recovery, testing restore procedures ⏱️ Restore time: 5-15 minutes 📋 Prerequisites: Existing backup resource

🟡 Intermediate: Restore from Storage Location

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: restore-from-storage
spec:
  targetCluster: recovery-cluster
  databaseName: myapp-db
  source:
    type: storage
    storage:
      type: s3
      bucket: backup-bucket
      path: neo4j-backups/cluster/backup-20250104-120000
    backupPath: /backup/cluster/backup-20250104-120000
  options:
    verifyBackup: true
    replaceExisting: true
  force: true
  stopCluster: true

✅ Perfect for: Cross-cluster recovery, disaster scenarios ⏱️ Restore time: 10-30 minutes 📋 Prerequisites: Direct storage access, backup path knowledge

🔴 Advanced: Point-in-Time Recovery (PITR)

PITR allows you to restore your database to a specific point in time using base backups and transaction logs. This is the most sophisticated restore option for precise recovery scenarios.

PITR Configuration

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: pitr-restore
spec:
  targetCluster: recovery-cluster
  databaseName: production-db
  source:
    type: pitr
    pointInTime: "2025-01-04T12:30:00Z"
    pitr:
      baseBackup:
        type: backup
        backupRef: daily-backup
      logStorage:
        type: s3
        bucket: transaction-logs
        path: neo4j-logs/production
        cloud:
          provider: aws
      logRetention: "7d"
      recoveryPointObjective: "5m"
      validateLogIntegrity: true
      compression:
        enabled: true
        algorithm: gzip
        level: 6
      encryption:
        enabled: true
        keySecret: log-encryption-key
        algorithm: AES256
  options:
    verifyBackup: true
    replaceExisting: true
  force: true
  stopCluster: true
  timeout: "2h"

✅ Perfect for: Compliance requirements, precise recovery points ⏱️ Restore time: 30-120 minutes 📋 Prerequisites: Base backup, transaction logs, advanced understanding

🔴 Advanced: PITR with Storage-based Base Backup

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: pitr-storage-restore
spec:
  targetCluster: disaster-recovery
  databaseName: critical-app
  source:
    type: pitr
    pointInTime: "2025-01-04T14:45:30Z"
    pitr:
      baseBackup:
        type: storage
        storage:
          type: gcs
          bucket: base-backups
          path: production/base-backup-20250104
        backupPath: /backup/base-backup-20250104
      logStorage:
        type: gcs
        bucket: transaction-logs
        path: production/logs
      validateLogIntegrity: true
  options:
    verifyBackup: true
  force: true
  stopCluster: true

✅ Perfect for: Disaster recovery, multi-region scenarios ⏱️ Restore time: 45-180 minutes 📋 Prerequisites: Complex storage setup, advanced operational knowledge

🔴 Advanced: Restore with Hooks

Pre and post-restore hooks allow you to execute custom operations before and after the restore process.

Restore with Cypher Hooks

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: restore-with-hooks
spec:
  targetCluster: my-cluster
  databaseName: myapp
  source:
    type: backup
    backupRef: production-backup
  options:
    verifyBackup: true
    replaceExisting: true
    preRestore:
      cypherStatements:
        - "CALL dbms.backup.prepare()"
        - "CALL db.checkpoint()"
    postRestore:
      cypherStatements:
        - "CALL db.awaitIndexes()"
        - "CALL dbms.security.clearAuthCache()"
        - "MATCH (n:User) SET n.lastRestore = datetime()"
  force: false
  stopCluster: true

Restore with Job Hooks

apiVersion: neo4j.neo4j.com/v1alpha1
kind: Neo4jRestore
metadata:
  name: restore-with-job-hooks
spec:
  targetCluster: staging-cluster
  databaseName: app-data
  source:
    type: backup
    backupRef: staging-backup
  options:
    verifyBackup: true
    preRestore:
      job:
        template:
          container:
            image: my-registry/data-prep:latest
            command: ["/bin/sh"]
            args: ["-c", "echo 'Preparing for restore'; /scripts/pre-restore.sh"]
            env:
              - name: CLUSTER_NAME
                value: staging-cluster
              - name: DATABASE_NAME
                value: app-data
        timeout: "10m"
    postRestore:
      job:
        template:
          container:
            image: my-registry/data-validator:latest
            command: ["/bin/sh"]
            args: ["-c", "/scripts/validate-restore.sh"]
            env:
              - name: NEO4J_URI
                value: "neo4j://staging-cluster:7687"
              - name: NEO4J_PASSWORD
                valueFrom:
                  secretKeyRef:
                    name: staging-admin-secret
                    key: password
        timeout: "15m"
  stopCluster: true

🎯 Decision Guide: Choose Your Backup Strategy

Quick Decision Tree

Are you just getting started?
├─ YES → Start with PVC backup (🟢 Beginner)
└─ NO ↓

Do you need production-grade reliability?
├─ YES → Use cloud storage (S3/GCS/Azure) (🟡 Intermediate)
└─ NO → PVC backup is sufficient

Do you need compliance/audit trails?
├─ YES → Weekly encrypted backups + PITR (🔴 Advanced)
└─ NO → Daily backups with retention

Do you need precise recovery points?
├─ YES → Point-in-Time Recovery (🔴 Advanced)
└─ NO → Regular backup/restore is sufficient

Storage Backend Comparison

Factor	PVC	S3	GCS	Azure
Setup Complexity	🟢 Simple	🟡 Medium	🟡 Medium	🟡 Medium
Cost	Low	Medium	Medium	Medium
Durability	Cluster-dependent	99.999999999%	99.999999999%	99.999999999%
Multi-region	❌ No	✅ Yes	✅ Yes	✅ Yes
Encryption	Optional	✅ Built-in	✅ Built-in	✅ Built-in
Best For	Dev/Test	AWS prod	GCP prod	Azure prod

Backup Frequency Recommendations

Environment	Frequency	Retention	Storage
Development	Manual	3-5 backups	PVC
Staging	Daily	7 days	Cloud
Production	Daily + Weekly	30d + 90d	Cloud + Archive
Critical Systems	Daily + PITR	90d + compliance	Multi-region cloud

Monitoring Backup and Restore Operations

Checking Backup Status

# List all backups
kubectl get neo4jbackups

# Get detailed backup status
kubectl describe neo4jbackup daily-backup

# View backup history
kubectl get neo4jbackup daily-backup -o jsonpath='{.status.history}'

# Check backup job logs
kubectl logs job/daily-backup-backup

Checking Restore Status

# List all restores
kubectl get neo4jrestores

# Get detailed restore status
kubectl describe neo4jrestore restore-operation

# Check restore job logs
kubectl logs job/restore-operation-restore

# Monitor restore progress
kubectl get neo4jrestore restore-operation -w