feat: add banner, architecture diagram, MkDocs site, and SOTA auto-update

Leooo-Huang · claude · Leooo-Huang · commit d6ef72d9bd2c · 2026-03-19T23:43:57.000+08:00
Visual:
- SVG banner with dark gradient and 5 modality icons
- Mermaid architecture diagram showing repo structure and automation
- Website badge linking to GitHub Pages

GitHub Pages (MkDocs Material):
- Dark slate theme with searchable dataset catalog
- All 53 dataset cards browsable by modality
- Tabbed "Which Dataset?" quick guide on homepage
- Auto-deploys on push to main

SOTA Auto-Update Pipeline:
- Weekly scrape of Papers with Code API (4 task categories)
- Saves to data/sota-snapshot.json
- Auto-commits via github-actions[bot]

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/.github/workflows/mkdocs.yml b/.github/workflows/mkdocs.yml
@@ -0,0 +1,29 @@
+name: Deploy MkDocs to GitHub Pages
+
+on:
+  push:
+    branches:
+      - main
+
+permissions:
+  contents: write
+  pages: write
+  id-token: write
+
+jobs:
+  deploy:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v4
+
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.12"
+
+      - name: Install dependencies
+        run: pip install -r requirements-docs.txt
+
+      - name: Deploy to GitHub Pages
+        run: mkdocs gh-deploy --force
diff --git a/.github/workflows/sota-update.yml b/.github/workflows/sota-update.yml
@@ -0,0 +1,39 @@
+name: SOTA Snapshot Update
+
+on:
+  schedule:
+    # Every Wednesday at 06:00 UTC
+    - cron: "0 6 * * 3"
+  workflow_dispatch:
+
+permissions:
+  contents: write
+
+jobs:
+  update-sota:
+    runs-on: ubuntu-latest
+
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v4
+
+      - name: Set up Python 3.12
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.12"
+
+      - name: Install dependencies
+        run: pip install requests
+
+      - name: Run SOTA updater
+        run: python tools/sota_updater.py
+
+      - name: Commit updated snapshot
+        run: |
+          git config user.name "github-actions[bot]"
+          git config user.email "41898282+github-actions[bot]@users.noreply.github.com"
+          git add data/sota-snapshot.json
+          # Only commit if there are actual changes
+          git diff --cached --quiet && echo "No changes to commit" || \
+            git commit -m "chore: update SOTA snapshot $(date -u +%Y-%m-%d)"
+          git push
diff --git a/README.md b/README.md
@@ -1,15 +1,23 @@
 # Awesome Human Activity Recognition [![Awesome](https://awesome.re/badge.svg)](https://awesome.re)
 
+<p align="center">
+  <a href="https://github.com/Leo-Cyberautonomy/awesome-human-activity-recognition">
+    <img src="assets/banner.svg" alt="Awesome Human Activity Recognition" width="600">
+  </a>
+</p>
+
 > A curated, researcher-driven guide to **Human Activity Recognition** — 53 datasets, key frameworks, pretrained models, tutorials, and benchmark tools across vision, wearable, skeleton, and multimodal modalities.
 
 [![License: CC BY 4.0](https://img.shields.io/badge/License-CC_BY_4.0-lightgrey.svg)](https://creativecommons.org/licenses/by/4.0/)
 [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](https://github.com/Leo-Cyberautonomy/awesome-human-activity-recognition/pulls)
 [![Last Updated](https://img.shields.io/badge/Updated-March_2026-blue.svg)](#)
+[![Website](https://img.shields.io/badge/Website-GitHub_Pages-blue.svg)](https://leo-cyberautonomy.github.io/awesome-human-activity-recognition/)
 
 **[中文](i18n/README.zh.md)** | [Deutsch](i18n/README.de.md) | [Español](i18n/README.es.md) | [Français](i18n/README.fr.md) | [日本語](i18n/README.ja.md) | [한국어](i18n/README.ko.md) | [Português](i18n/README.pt.md) | [Русский](i18n/README.ru.md)
 
 ## Contents
 
+- [Repository Architecture](#repository-architecture)
 - [Which Dataset Should I Use](#which-dataset-should-i-use)
 - [Datasets](#datasets)
 - [Frameworks and Libraries](#frameworks-and-libraries)
@@ -20,6 +28,38 @@
 - [Tools and Utilities](#tools-and-utilities)
 - [Related Awesome Lists](#related-awesome-lists)
 
+## Repository Architecture
+
+```mermaid
+graph LR
+    subgraph Datasets["53 Datasets"]
+        V["Vision (14)"]
+        S["Skeleton (7)"]
+        W["Wearable (13)"]
+        M["Multimodal (7)"]
+        E["Emerging (12)"]
+    end
+
+    subgraph Ecosystem
+        F["Frameworks & Libraries"]
+        P["Pretrained Models"]
+        T["Tutorials & Courses"]
+    end
+
+    subgraph Automation
+        LC["Link Check\n(weekly)"]
+        SU["SOTA Update\n(weekly)"]
+        CB["Catalog Build\n(on push)"]
+    end
+
+    Datasets --> F
+    Datasets --> P
+    F --> T
+    SU -->|updates| Datasets
+    LC -->|validates| Datasets
+    CB -->|exports| JSON["catalog.json\ncatalog.csv"]
+```
+
 ## Which Dataset Should I Use
 
 > Pick your modality and task, then follow the recommendation to the right section.
diff --git a/assets/banner.svg b/assets/banner.svg
@@ -0,0 +1,140 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 1200 400" width="1200" height="400">
+  <defs>
+    <!-- Background gradient -->
+    <linearGradient id="bg" x1="0%" y1="0%" x2="100%" y2="100%">
+      <stop offset="0%" stop-color="#0d1117"/>
+      <stop offset="45%" stop-color="#1a1a2e"/>
+      <stop offset="100%" stop-color="#16213e"/>
+    </linearGradient>
+
+    <!-- Subtle radial glow behind icons -->
+    <radialGradient id="glow" cx="50%" cy="55%" r="40%">
+      <stop offset="0%" stop-color="#6366f1" stop-opacity="0.08"/>
+      <stop offset="100%" stop-color="#0d1117" stop-opacity="0"/>
+    </radialGradient>
+
+    <!-- Top-edge highlight -->
+    <linearGradient id="topLine" x1="0%" y1="0%" x2="100%" y2="0%">
+      <stop offset="0%" stop-color="#06b6d4" stop-opacity="0"/>
+      <stop offset="20%" stop-color="#06b6d4" stop-opacity="0.8"/>
+      <stop offset="50%" stop-color="#a78bfa" stop-opacity="0.8"/>
+      <stop offset="80%" stop-color="#ec4899" stop-opacity="0.8"/>
+      <stop offset="100%" stop-color="#ec4899" stop-opacity="0"/>
+    </linearGradient>
+
+    <!-- Dot pattern -->
+    <pattern id="dots" x="0" y="0" width="40" height="40" patternUnits="userSpaceOnUse">
+      <circle cx="20" cy="20" r="0.8" fill="#ffffff" opacity="0.04"/>
+    </pattern>
+  </defs>
+
+  <!-- Background -->
+  <rect width="1200" height="400" fill="url(#bg)"/>
+  <rect width="1200" height="400" fill="url(#dots)"/>
+  <rect width="1200" height="400" fill="url(#glow)"/>
+
+  <!-- Top accent line -->
+  <rect x="0" y="0" width="1200" height="2" fill="url(#topLine)"/>
+
+  <!-- Decorative subtle grid lines -->
+  <line x1="0" y1="200" x2="1200" y2="200" stroke="#ffffff" stroke-opacity="0.02" stroke-width="1"/>
+  <line x1="600" y1="0" x2="600" y2="400" stroke="#ffffff" stroke-opacity="0.02" stroke-width="1"/>
+
+  <!-- Title -->
+  <text x="600" y="115" text-anchor="middle" fill="#ffffff" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="46" font-weight="700" letter-spacing="0.5">
+    Awesome Human Activity Recognition
+  </text>
+
+  <!-- Underline accent -->
+  <rect x="420" y="135" width="360" height="2" rx="1" fill="url(#topLine)" opacity="0.5"/>
+
+  <!-- Subtitle -->
+  <text x="600" y="170" text-anchor="middle" fill="#8b949e" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="19" font-weight="400" letter-spacing="1.5">
+    53 Datasets  ·  23 Frameworks  ·  20+ Papers  ·  8 Languages
+  </text>
+
+  <!-- ======= ICONS ROW (y center ~280) ======= -->
+
+  <!-- 1. Vision / Camera-Eye — Cyan #06b6d4 -->
+  <g transform="translate(180, 255)" stroke="#06b6d4" fill="none" stroke-width="2" stroke-linecap="round" stroke-linejoin="round">
+    <!-- Eye outer shape -->
+    <path d="M-28,0 Q-14,-20 0,-20 Q14,-20 28,0 Q14,20 0,20 Q-14,20 -28,0 Z"/>
+    <!-- Iris -->
+    <circle cx="0" cy="0" r="9"/>
+    <!-- Pupil -->
+    <circle cx="0" cy="0" r="4" fill="#06b6d4" fill-opacity="0.3"/>
+    <!-- Lens flare -->
+    <circle cx="-3" cy="-3" r="2" fill="#06b6d4" fill-opacity="0.5"/>
+  </g>
+  <text x="180" y="310" text-anchor="middle" fill="#06b6d4" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="12" font-weight="500" letter-spacing="1" opacity="0.8">VISION</text>
+
+  <!-- 2. Skeleton / Stick Figure — Green #22c55e -->
+  <g transform="translate(390, 255)" stroke="#22c55e" fill="none" stroke-width="2" stroke-linecap="round" stroke-linejoin="round">
+    <!-- Head -->
+    <circle cx="0" cy="-18" r="6"/>
+    <!-- Torso -->
+    <line x1="0" y1="-12" x2="0" y2="8"/>
+    <!-- Arms -->
+    <line x1="0" y1="-6" x2="-14" y2="-14"/>
+    <line x1="0" y1="-6" x2="14" y2="2"/>
+    <!-- Legs -->
+    <line x1="0" y1="8" x2="-12" y2="22"/>
+    <line x1="0" y1="8" x2="12" y2="22"/>
+    <!-- Joints -->
+    <circle cx="0" cy="-6" r="2" fill="#22c55e" fill-opacity="0.4"/>
+    <circle cx="-14" cy="-14" r="2" fill="#22c55e" fill-opacity="0.4"/>
+    <circle cx="14" cy="2" r="2" fill="#22c55e" fill-opacity="0.4"/>
+    <circle cx="-12" cy="22" r="2" fill="#22c55e" fill-opacity="0.4"/>
+    <circle cx="12" cy="22" r="2" fill="#22c55e" fill-opacity="0.4"/>
+  </g>
+  <text x="390" y="310" text-anchor="middle" fill="#22c55e" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="12" font-weight="500" letter-spacing="1" opacity="0.8">SKELETON</text>
+
+  <!-- 3. Wearable / Watch — Orange #f97316 -->
+  <g transform="translate(600, 255)" stroke="#f97316" fill="none" stroke-width="2" stroke-linecap="round" stroke-linejoin="round">
+    <!-- Watch band top -->
+    <rect x="-6" y="-26" width="12" height="10" rx="2"/>
+    <!-- Watch face -->
+    <rect x="-14" y="-16" width="28" height="32" rx="6"/>
+    <!-- Watch band bottom -->
+    <rect x="-6" y="16" width="12" height="10" rx="2"/>
+    <!-- Screen content - activity wave -->
+    <polyline points="-7,-4 -3,-9 1,2 5,-6 9,-1" stroke-width="1.5" stroke="#f97316" fill="none"/>
+    <!-- Small dot for digital crown -->
+    <circle cx="14" cy="0" r="1.5" fill="#f97316" fill-opacity="0.5"/>
+  </g>
+  <text x="600" y="310" text-anchor="middle" fill="#f97316" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="12" font-weight="500" letter-spacing="1" opacity="0.8">WEARABLE</text>
+
+  <!-- 4. Multimodal / Overlapping Circles — Purple #a78bfa -->
+  <g transform="translate(810, 255)" stroke="#a78bfa" fill="none" stroke-width="2">
+    <circle cx="-10" cy="-5" r="14" stroke-opacity="0.7"/>
+    <circle cx="10" cy="-5" r="14" stroke-opacity="0.7"/>
+    <circle cx="0" cy="10" r="14" stroke-opacity="0.7"/>
+    <!-- Center intersection glow -->
+    <circle cx="0" cy="1" r="4" fill="#a78bfa" fill-opacity="0.15" stroke="none"/>
+  </g>
+  <text x="810" y="310" text-anchor="middle" fill="#a78bfa" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="12" font-weight="500" letter-spacing="1" opacity="0.8">MULTIMODAL</text>
+
+  <!-- 5. Emerging / Sparkle Star — Pink #ec4899 -->
+  <g transform="translate(1020, 255)" stroke="#ec4899" fill="none" stroke-width="2" stroke-linecap="round" stroke-linejoin="round">
+    <!-- 4-point star -->
+    <path d="M0,-22 L3,-8 L16,-4 L3,0 L0,22 L-3,0 L-16,-4 L-3,-8 Z" fill="#ec4899" fill-opacity="0.08"/>
+    <path d="M0,-22 L3,-8 L16,-4 L3,0 L0,22 L-3,0 L-16,-4 L-3,-8 Z"/>
+    <!-- Small accent sparkles -->
+    <line x1="12" y1="-16" x2="14" y2="-18" stroke-width="1.5" opacity="0.6"/>
+    <line x1="14" y1="-16" x2="12" y2="-18" stroke-width="1.5" opacity="0.6"/>
+    <line x1="-14" y1="12" x2="-16" y2="14" stroke-width="1.5" opacity="0.6"/>
+    <line x1="-16" y1="12" x2="-14" y2="14" stroke-width="1.5" opacity="0.6"/>
+  </g>
+  <text x="1020" y="310" text-anchor="middle" fill="#ec4899" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="12" font-weight="500" letter-spacing="1" opacity="0.8">EMERGING</text>
+
+  <!-- Connecting line between icons -->
+  <line x1="220" y1="255" x2="350" y2="255" stroke="#ffffff" stroke-opacity="0.04" stroke-width="1" stroke-dasharray="4,6"/>
+  <line x1="430" y1="255" x2="560" y2="255" stroke="#ffffff" stroke-opacity="0.04" stroke-width="1" stroke-dasharray="4,6"/>
+  <line x1="640" y1="255" x2="770" y2="255" stroke="#ffffff" stroke-opacity="0.04" stroke-width="1" stroke-dasharray="4,6"/>
+  <line x1="850" y1="255" x2="980" y2="255" stroke="#ffffff" stroke-opacity="0.04" stroke-width="1" stroke-dasharray="4,6"/>
+
+  <!-- Bottom subtle branding -->
+  <text x="600" y="375" text-anchor="middle" fill="#484f58" font-family="Segoe UI, -apple-system, BlinkMacSystemFont, Helvetica Neue, Arial, sans-serif" font-size="11" font-weight="400" letter-spacing="2">
+    A CURATED LIST OF HAR RESOURCES
+  </text>
+</svg>
diff --git a/data/sota-snapshot.json b/data/sota-snapshot.json
@@ -0,0 +1,25 @@
+{
+  "updated_at": "2026-03-19T00:00:00Z",
+  "tasks": {
+    "action-recognition-in-videos": {
+      "task_name": "Action Recognition in Videos",
+      "description": "",
+      "top_results": []
+    },
+    "skeleton-based-action-recognition": {
+      "task_name": "Skeleton-Based Action Recognition",
+      "description": "",
+      "top_results": []
+    },
+    "activity-recognition": {
+      "task_name": "Activity Recognition",
+      "description": "",
+      "top_results": []
+    },
+    "human-pose-estimation": {
+      "task_name": "Human Pose Estimation",
+      "description": "",
+      "top_results": []
+    }
+  }
+}
diff --git a/docs/index.md b/docs/index.md
@@ -0,0 +1,65 @@
+# Awesome Human Activity Recognition
+
+> A curated, researcher-driven guide to **Human Activity Recognition** -- 53 datasets, key frameworks, pretrained models, tutorials, and benchmark tools across vision, wearable, skeleton, and multimodal modalities.
+
+[![License: CC BY 4.0](https://img.shields.io/badge/License-CC_BY_4.0-lightgrey.svg)](https://creativecommons.org/licenses/by/4.0/)
+[![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg)](https://github.com/Leo-Cyberautonomy/awesome-human-activity-recognition/pulls)
+
+## Quick Stats
+
+| Modality | Datasets | Highlights |
+|----------|----------|------------|
+| Vision (RGB/Depth) | 14 | Kinetics-700, UCF-101, ActivityNet, AVA |
+| Skeleton & MoCap | 7 | NTU RGB+D 60/120, AMASS, Human3.6M |
+| Wearable Sensors | 13 | UCI-HAR, PAMAP2, CAPTURE-24 (3883 hrs) |
+| Multimodal & Egocentric | 7 | Ego4D (3.3k hrs), EPIC-Kitchens-100 |
+| Emerging & Frontier | 12 | HumanML3D, Motion-X++, Ego-Exo4D |
+
+## Which Dataset Should I Use?
+
+!!! tip "Pick your modality and task, then follow the recommendation."
+
+=== "Video Classification"
+
+    Start with **[Kinetics-700](../datasets/vision/kinetics-700.md)** for pretraining, evaluate on **[UCF-101](../datasets/vision/ucf101.md)** or **[HMDB-51](../datasets/vision/hmdb51.md)** for comparison with prior work. Browse all [Vision datasets](../datasets/vision/kinetics-700.md).
+
+=== "Temporal Action Detection"
+
+    **[ActivityNet](../datasets/vision/activitynet.md)** for proposals, **[AVA](../datasets/vision/ava.md)** for spatio-temporal, **[MultiTHUMOS](../datasets/vision/multithumos.md)** for dense multi-label.
+
+=== "Skeleton / MoCap"
+
+    **[NTU RGB+D 120](../datasets/vision/ntu-rgbd-120.md)** is the de facto standard. For text-motion alignment, use **[BABEL](../datasets/skeleton/babel.md)** or **[HumanML3D](../datasets/emerging/humanml3d.md)**.
+
+=== "Wearable Sensors"
+
+    **[UCI-HAR](../datasets/wearable/uci-har.md)** for baselines, **[PAMAP2](../datasets/wearable/pamap2.md)** for multi-sensor, **[CAPTURE-24](../datasets/wearable/capture24.md)** for real-world scale (151 subjects, 3883 hours).
+
+=== "Egocentric / Multimodal"
+
+    **[Ego4D](../datasets/multimodal/ego4d.md)** for scale (3.3k hours), **[EPIC-Kitchens-100](../datasets/multimodal/epic-kitchens-100.md)** for kitchen actions, **[Ego-Exo4D](../datasets/emerging/ego-exo4d.md)** for cross-view.
+
+=== "Text-to-Motion Generation"
+
+    **[HumanML3D](../datasets/emerging/humanml3d.md)** for single-person, **[InterHuman](../datasets/emerging/interhuman.md)** for two-person, **[Motion-X++](../datasets/emerging/motionx-plus.md)** for whole-body with face and hands.
+
+## Explore
+
+- **[Datasets](../datasets/vision/kinetics-700.md)** -- Browse all 53 dataset cards organized by modality
+- **[Taxonomy](taxonomy.md)** -- Multi-dimensional classification of HAR approaches
+- **[Surveys](surveys.md)** -- Curated survey papers across all modalities
+- **[Benchmarking](benchmarking.md)** -- Compare datasets and methods
+- **[Roadmap](roadmap.md)** -- What is coming next
+- **[Contributing](../CONTRIBUTING.md)** -- How to add datasets or improve the list
+
+## Citation
+
+```bibtex
+@misc{awesome_har_2025,
+  title   = {Awesome Human Activity Recognition: A Curated List},
+  author  = {Wenxuan Huang},
+  year    = {2025},
+  url     = {https://github.com/Leo-Cyberautonomy/awesome-human-activity-recognition},
+  note    = {GitHub repository}
+}
+```
diff --git a/mkdocs.yml b/mkdocs.yml
diff --git a/requirements-docs.txt b/requirements-docs.txt
diff --git a/tools/sota_updater.py b/tools/sota_updater.py