DeepBench Analysis

Overview

DeepBench Analysis is a visualization and analysis tool for evaluating the robustness of machine learning models against various image perturbations. It provides an interactive dashboard that allows researchers and practitioners to:

Compare different ML models' performance under various image augmentations
Analyze model stability across different use cases (Medical, Autonomous Driving, etc.)
Visualize the impact of perturbations through detailed metrics and plots
Browse and compare results across different model collections

Requirements

Python 3.10 or higher
MongoDB instance (local or remote)
Git

Setup

Clone the repository:

git clone [repository-url]
cd deepbench_analysis

Create and activate a virtual environment:

python -m venv venv
# On Windows
venv\Scripts\activate
# On Unix or MacOS
source venv/bin/activate

Install the package and development dependencies:

pip install -e .

Create a .env file in the root directory with the following variables:

# MongoDB Credentials
DBUSER=your_mongodb_username
DBPASSWD=your_mongodb_password

# Custom MongoDB URI 
MONGODB_URI=custom_host

Running the Application

Start the Streamlit application:

streamlit run src/deepbench_analysis.py

The dashboard will be available at http://localhost:8501 by default.

Project Structure

configs/ - Configuration files
src/ - Source code
src/mappings/ - Dataset mapping files
src/app/ - Streamlit application code
src/deepbench_analysis/ - Core analysis functionality
src/deepbench_analysis/additional_metrics/ - Additional metrics calculation code
src/deepbench_analysis/config/ - Configuration parsing code
src/deepbench_analysis/db/ - Database connection and query code
src/deepbench_analysis/logger/ - Logging and performance monitoring code
src/deepbench_analysis/mongodb_data_processing/ - Data extraction and visualization code
src/deepbench_analysis/tabs/ - Streamlit tab code

Database Setup

The application requires a MongoDB instance with:

A database named "Deepbench"
Collections containing model evaluation results

Each collection should follow the project's schema for model results:

the collection should contain more than 10 documents
only one model per collection
models are only comparable if they have the same augmentations

general document structure:

{
  "experiment_name": "test_collection_2025-04-19-23_44_55",
  "git": "ec853d3e45b5ae525b061ba66d24b5568d35a11f",
  "image": "/path/to/image.jpg",
  "gt": "0",
  "resolution": {
    "original": [
      256,
      256
    ],
    "scaled": [
      224,
      224
    ]
  },
  "augment_method": {
    "SatelliteImaging": {
      "Contrast": {
        "contrast": -100
      }
    }
  },
  "model": "model_name",
  "label_score": {
    "0": 0.87,
    "1": 0.09,
    "2": 0.03,
    "3": 0.01
  },
  "prime_img": false,
  "img_array": []
}

Additional Configuration

Modify configs/default_config.toml for:
- mongodb name
- Augmentation methods
- Use cases
- Debug options:
  - Performance logging: set to True to measure performance
  - Collection browsing: set to True to enable browsing MongoDB collections
- Streamlit settings
Adjust .streamlit/config.toml for Streamlit-specific configurations

User Interface Guide

Sidebar

The application's sidebar contains the main controls for data selection and visualization:

Collection Selection
- Primary Collection: Select the main model collection to analyze
- Comparable Collections: Choose one or more collections to compare against the primary collection
- Only collections with matching Shema will be available for comparison
Feature Toggles
- Show Prime Images: Display original (unaugmented) images when expanding augmentation results, if there are any present in the collection
- Display Additional Metrics: Show advanced metrics like confusion matrices, ECE, and ROC curves

Main Tabs

📈 Compare Models

The main analysis dashboard where you can:

View performance comparisons between selected models
Analyze accuracy across different augmentation methods
Examine detailed metrics and visualizations
Download data tables and plots
Filter results by use case and augmentation type
Expand sections to see detailed performance breakdowns

🔍 About

Contains essential information about:

Project TAHAI overview and objectives
DeepBench Analysis tool description
Links to related resources:
- Research paper
- Project websites

🔢 Browse Collections (Debug Mode)

Available when debug mode is enabled in configs/default_config.toml. This is useful if you host a local MongoDB instance without its own ui.

Browse and inspect MongoDB collections
View detailed document structures
Download collection data in CSV or JSON format
Filter and search through documents
Delete collections
Upload TinyDB JSON files to MongoDB

Data Visualization

Each augmentation method shows:
- Accuracy comparison plots
- Performance metrics
- Downloadable data tables
- Expandable sections for detailed analysis
When "Display Additional Metrics" is enabled:
- Confusion matrices
- ROC curves
- Expected Calibration Error (ECE)

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.streamlit		.streamlit
configs		configs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepBench Analysis

Table of Contents

Overview

Requirements

Setup

Running the Application

Project Structure

Database Setup

Additional Configuration

User Interface Guide

Sidebar

Main Tabs

📈 Compare Models

🔍 About

🔢 Browse Collections (Debug Mode)

Data Visualization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DeepBench Analysis

Table of Contents

Overview

Requirements

Setup

Running the Application

Project Structure

Database Setup

Additional Configuration

User Interface Guide

Sidebar

Main Tabs

📈 Compare Models

🔍 About

🔢 Browse Collections (Debug Mode)

Data Visualization

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages