Skip to content

feat: End-to-End Automation MVP (Matrices, Docs UI, & Community)#1813

Open
ica-chao wants to merge 85 commits intomainfrom
table-update
Open

feat: End-to-End Automation MVP (Matrices, Docs UI, & Community)#1813
ica-chao wants to merge 85 commits intomainfrom
table-update

Conversation

@ica-chao
Copy link
Collaborator

@ica-chao ica-chao commented Feb 26, 2026

Title

🚀 feat: End-to-End Automation MVP (Matrices, Docs UI, & Community)

Description

Summary:
This Pull Request introduces the complete MVP for automating the vLLM TPU project. It implements three major pillars of automation:

  1. Support Matrices Data Pipelines (combining separate CSVs).
  2. Documentation UI Layouts (dynamically building massive HTML tables into README.md).
  3. Community Workflows (Mission board, Badges, and Contributors wall actions).

It significantly improves data accuracy, eliminates manual Markdown editing, and establishes a foundation for maintaining a highly active open-source community.


🛠️ 1. Support Matrices Automation

  • Combined Pipelines: Added scripts/combine_matrices.py and update_all_matrices.py to automatically merge underlying test data into unified, user-facing CSVs (e.g., combined_model_support_matrix.csv).
  • Buildkite Integration: Updated .buildkite/scripts/commit_support_matrices.sh to hook into matrix formatting rules before CI commits to the repository.

📝 2. Documentation & UI Automation (README.md)

  • Dynamic generation: Added update_readme.py and a GitHub Action (.github/workflows/update_readme.yml) to automatically insert table matrix statuses directly into README.md upon any CSV updates.
  • Structured UI Redesign: Converted massive flat tables (Core Features, Quantization, Microbenchmarks) into complex HTML tables utilizing spanning headers (e.g. v6e vs v7x) to reduce horizontal scrolling and improve scanability.
  • Metrics Merging: Wrote Python logic to combine isolated "Performance" and "Correctness" cells into single, unified user statuses natively.
  • Minimalist Legend & Formatting: Redesigned the "Status Legend" into a clean structural blockquote list using GitHub native grey system fonts, clearing visual clutter. Enforced <br> and &nbsp; usage to prevent unwanted table cell wrapping.
  • Dynamic Timestamping: Script auto-updates the *Last Updated:...* string at the top of the README with execution UTC time.

🤝 3. Community Automation

  • Auto Mission Board: Added scripts/auto_mission_board.py and .github/workflows/auto_mission_board.yml to automatically scrape "good first issues" and sync them securely to our README or issues board.
  • Dynamic Contributors Wall: Added scripts/update_contributors.py (with weekly Action) to automatically crawl repo commits, issue authors, and PR reviewers to calculate an "Effort Score", ranking and updating the visual avatar grid in the README.
  • Automated Badges: Added workflow (update_badges.yml) to keep repository health badges constantly in-sync.

Testing/Review Notes:

  • All extraneous AI mock scripts have been explicitly deleted from this branch.
  • To review the UI improvements, view the rendered README.md in this PR branch.

# Description
This PR introduces the **MVP for End-to-End Project Automation**, establishing a unified pipeline for benchmarking, documentation, and community engagement.

It consolidates all key automation features worked on recently into a single "Foundation" release, covering 3 core steps: **Data Collection**, **Matrix/Doc Generation**, and **Community Engagement**.

## 🚀 Key Features (The "3-Step MVP")

### 1. Matrix & Badge Automation (The Data Layer)
*   **Kernel Support Matrix:** Added `scripts/update_all_matrices.py` to standardize all 6 microbenchmark CSVs with a new, reader-friendly design (Icons `✅`, Footer Notes, 2-line headers).
*   **Badge System:** Added `scripts/update_badges.py` to dynamically update status badges in `README.md` based on issue states.
*   **Data Standardization:** Added `scripts/standardize_csvs.py` and `scripts/combine_matrices.py` to ensure consistent data sizing and formatting across different TPU versions.

### 2. Documentation Sync (The Presentation Layer)
*   **README Redesign:** Refined the `README.md` to feature:
    *   **New Collapsible Tables** for better space management.
    *   **Live Metrics:** Tables now reflect the actual state of the code via the automation scripts.
    *   **Visual Polish:** Improved spacing, headers, and footer notes for the "Kernel Support" section.
*   **Presentation Generation:** Added `generate_presentation.py` to automate the creation of project slide decks from raw data.

### 3. Community Engine (The Engagement Layer)
*   **Auto Mission Board:** Introduced `scripts/auto_mission_board.py` and `docs/auto_mission_board/*` to track contributor activities and highlight "Good First Issues".
*   **Contributor Wall:** Added `scripts/update_contributors.py` to automatically recognize and display active contributors.
*   **CI/CD Workflows:** Added GitHub Actions (`.github/workflows/*.yml`) to trigger these updates automatically on schedule or push.

## 🛠 Technical Details
- **Scripts:** All automation logic is centralized in `scripts/`, making it modular and easy to maintain.
- **Workflows:** New workflows Ensure that `README.md` and CSVs are always in sync with the latest test results and community activity.
- **Conflict Resolution:** This PR includes a recent merge from `main` to ensure compatibility with the latest codebase.

## ✅ Checklist
- [x] **Automation:** Verified that all scripts (`update_all_matrices.py`, `update_badges.py`, etc.) run without errors.
- [x] **Rendering:** Verified `README.md` renders correctly on GitHub.
- [x] **Data Integrity:** Restored and verified benchmark data rows.
- [x] **Self-Review:** Performed a full pass on the new automation logic.

---
*This PR serves as the foundational milestone for the project's automated infrastructure.*

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…eaders

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…/v7x spanning headers

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…for core features

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…e script

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…robenchmarks

Signed-off-by: Chao Wang <chaowan@google.com>
… legend

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
# Conflicts:
#	support_matrices/multimodal_model_support_matrix.csv
#	support_matrices/nightly/flax_nnx/v6/multimodal_model_support_matrix.csv
#	support_matrices/nightly/flax_nnx/v6/text_only_model_support_matrix.csv
#	support_matrices/nightly/flax_nnx/v7/multimodal_model_support_matrix.csv
#	support_matrices/nightly/flax_nnx/v7/text_only_model_support_matrix.csv
#	support_matrices/nightly/v6/kernel_support_matrix-microbenchmarks.csv
#	support_matrices/nightly/v6/multimodal_model_support_matrix.csv
#	support_matrices/nightly/v6/text_only_model_support_matrix.csv
#	support_matrices/nightly/v7/kernel_support_matrix-microbenchmarks.csv
#	support_matrices/nightly/v7/multimodal_model_support_matrix.csv
#	support_matrices/nightly/v7/text_only_model_support_matrix.csv
#	support_matrices/nightly/vllm/v6/multimodal_model_support_matrix.csv
#	support_matrices/nightly/vllm/v6/text_only_model_support_matrix.csv
#	support_matrices/nightly/vllm/v7/multimodal_model_support_matrix.csv
#	support_matrices/nightly/vllm/v7/text_only_model_support_matrix.csv
#	support_matrices/text_only_model_support_matrix.csv

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…benchmark table

Signed-off-by: Chao Wang <chaowan@google.com>
…left column

Signed-off-by: Chao Wang <chaowan@google.com>
…rmatting

Signed-off-by: Chao Wang <chaowan@google.com>
…ies to prevent duplicate rows

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…blocks to normalize CSS striped backgrounds

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
…ix nesting display issue

Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
Signed-off-by: Chao Wang <chaowan@google.com>
@ica-chao ica-chao marked this pull request as ready for review March 5, 2026 00:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants