Skip to content

Commit a0d18ba

Browse files
Yoojin-namclaude
andauthored
feat(model-lane): v5.0 Phase 2 — architecture-zoo (research-question → architecture decision tool) (#219)
The "choose" front end of the model-engineering lane, completing Phase 2 (architecture-zoo → model-scaffold → model-validation). Advisory Layer-D skill, no detector/torch. Additive: skills 47→48; detectors/probes/guidelines unchanged. - /architecture-zoo: maps a research question (task + modality/dimensionality + labelled-data scale + class imbalance) to a paper-grounded architecture shortlist via a decision tree (references/index.md), then per-architecture cards — core idea, when-to-use, medical-imaging use, reference implementation, the typical validation/experiment setup, and the matching /model-scaffold template. Seeds the classification (ResNet/DenseNet/EfficientNet/Inception/ViT/Swin/DeiT), segmentation (U-Net/3-D U-Net/V-Net/Attention & Residual U-Net/nnU-Net/SegResNet/Swin-UNETR/ Mask R-CNN), and foundation/SSL (SAM/MedSAM/MedSAM2/TotalSegmentator/SegVol/ BiomedCLIP/DINO/MAE/SimCLR/MoCo) families. Every recommendation names its source paper; archetypes, not a live SOTA leaderboard. All CI-mirror gates green locally (validate_skills, all gen_* --check, validate_catalog_consistency, frontmatter, routing-assets, locale, version, npm). Version left at 4.10.0 — release is a separate gated step. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>
1 parent 90a6b58 commit a0d18ba

16 files changed

Lines changed: 593 additions & 4 deletions

.claude-plugin/marketplace.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@
2525
"source": "./",
2626
"strict": false,
2727
"skills": [
28+
"./skills/architecture-zoo",
2829
"./skills/calc-sample-size",
2930
"./skills/clean-data",
3031
"./skills/define-variables",

CHANGELOG.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -59,6 +59,16 @@
5959
deterministic split matches the frozen expected + is patient-disjoint (proven inline) → passes
6060
`check_training_hygiene` → a **self-skipping** torch tier (forward shape + gradients + reproducible
6161
loss when torch is installed; `SKIP`, never CI coverage of runnability, when absent).
62+
- **New skill `/architecture-zoo`** (Layer D, advisory) — the *choose* front end of the lane: maps a
63+
research question (task + modality / dimensionality + labelled-data scale + class imbalance) to a
64+
**paper-grounded** architecture shortlist via a decision tree, then per-architecture cards with core
65+
idea, when-to-use, medical-imaging use, reference implementation, the typical validation/experiment
66+
setup, and the matching `/model-scaffold` template. Seeds the classification (ResNet / DenseNet /
67+
EfficientNet / Inception / ViT / Swin / DeiT), segmentation (U-Net / 3-D U-Net / V-Net / Attention
68+
& Residual U-Net / nnU-Net / SegResNet / Swin-UNETR / Mask R-CNN), and foundation/SSL (SAM / MedSAM /
69+
MedSAM2 / TotalSegmentator / SegVol / BiomedCLIP / DINO / MAE / SimCLR / MoCo) families. Every
70+
recommendation names its source paper; it teaches archetypes, not a live SOTA leaderboard. Skills
71+
47 → 48.
6272

6373
## [4.10.0] - 2026-06-28
6474

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,14 +2,14 @@
22

33
# MedSci Skills
44

5-
**47 skills that actually work.** Built by a physician-researcher, tested on real publications.
5+
**48 skills that actually work.** Built by a physician-researcher, tested on real publications.
66

77
*MedSci Skills is a submission-grade clinical manuscript workflow, not a generic biomedical skill catalog. Its moat is the compliance layer — 38 reporting guidelines and risk-of-bias tools, reference/citation verification, and deterministic integrity gates, before peer review sees the manuscript. It competes on clinical submission reliability, not skill count.*
88

99
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
1010
[![Release](https://img.shields.io/github/v/release/Aperivue/medsci-skills?style=flat-square&color=blue)](https://github.com/Aperivue/medsci-skills/releases/latest)
1111
[![CI](https://img.shields.io/github/actions/workflow/status/Aperivue/medsci-skills/validate.yml?branch=main&style=flat-square&label=CI)](https://github.com/Aperivue/medsci-skills/actions/workflows/validate.yml)
12-
![Skills](https://img.shields.io/badge/Skills-47-brightgreen?style=flat-square)
12+
![Skills](https://img.shields.io/badge/Skills-48-brightgreen?style=flat-square)
1313
[![npm](https://img.shields.io/npm/v/medsci-skills?style=flat-square&label=npm&color=cb3837)](https://www.npmjs.com/package/medsci-skills)
1414
[![Watch the 2-min intro](https://img.shields.io/badge/▶_Watch-2--min_intro-FF0000?style=flat-square&logo=youtube&logoColor=white)](https://youtu.be/MclQ_RIofpE)
1515
[![good first issues](https://img.shields.io/github/issues/Aperivue/medsci-skills/good%20first%20issue?style=flat-square&label=good%20first%20issues&color=7057ff)](https://github.com/Aperivue/medsci-skills/contribute)
@@ -453,6 +453,7 @@ ma-scout -> search-lit -> fulltext-retrieval -> design-study ──> write-proto
453453
| **design-ai-benchmarking** | Design and validity review for benchmarking AI system(s) against a human-expert panel: evaluation-question and arm definition, decoupled multi-dimensional rubrics with anchors, planted calibration probes (positive-control / known-bad / instability / mechanism-contradiction), reviewer-panel construction with per-reviewer randomization, inter-rater reliability targets with separate control-item reliability, LLM-as-judge vs human-as-judge adjudication, construct-independence guards, and a structured JSON rating-export schema. Locks the rubric before data collection. |
454454
| **model-validation** | Design or audit the clinical-validation study for an engineer-built medical-imaging model (segmentation / classification / detection): patient-level split disjointness and the data-leakage taxonomy, tuning-on-test, internal vs genuine external validation, comparator design, single-run vs multi-seed variance, task-correct metric selection (Metrics Reloaded), test-set sizing, and CLAIM 2024 / TRIPOD+AI / STARD-AI reporting fit. Ships a deterministic split-leakage gate that proves patient disjointness by set arithmetic on the emitted split table. Integrates with MONAI / nnU-Net — does not replace them. |
455455
| **model-scaffold** | Generate a reproducible, runnable PyTorch training repo for a medical-imaging segmentation task — the missing middle link between choosing an architecture and validating a trained model. Emits a patient-level seed-locked split as an auditable artifact, a configurable U-Net, train/evaluate scripts that seed every RNG and infer under eval mode, a config, requirements, a reproducibility record, and a Methods stub with VERIFY placeholders (no fabricated numbers). Reproducibility holds by construction; ships a `check_training_hygiene` AST gate + a network-free build→validate challenge. Integrates with MONAI / nnU-Net / TorchIO — does not reimplement them. |
456+
| **architecture-zoo** | "Which architecture for which research question" decision tool: maps task (classification / segmentation / detection / transfer), modality, data scale, and class imbalance to a paper-grounded architecture shortlist. Curates the foundational curriculum (ResNet / DenseNet / EfficientNet / ViT / Swin; U-Net / 3-D U-Net / Attention & Residual U-Net / nnU-Net / Mask R-CNN; SAM/MedSAM / TotalSegmentator / BiomedCLIP / DINO / MAE / SimCLR) — each with core idea, when-to-use, medical-imaging use, reference implementation, validation setup, and the matching model-scaffold template. Advisory; teaches archetypes, not a live SOTA leaderboard. |
456457
| **intake-project** | Classifies new research projects, summarizes current state, identifies missing inputs, and recommends next steps. |
457458
| **grant-builder** | Structures grant proposals: significance, innovation, approach, milestones, and consortium roles. |
458459
| **present-paper** | Academic presentation preparation: paper analysis, supporting research, speaker scripts, slide note injection, and Q&A prep. |

docs/skills/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,7 @@ One reference page per skill, generated from each skill's `SKILL.md` and `skill.
77
- [academic-aio](academic-aio.md) — Medical AI paper optimization for AI search engines (Perplexity, ChatGPT web, Elicit, Consensus, SciSpace) and RAG-based literature tools. _(evidence: bundled_script)_
88
- [add-journal](add-journal.md) — Add a new journal to the MedSci Skills profile database. _(evidence: manual_workflow)_
99
- [analyze-stats](analyze-stats.md) — Statistical analysis for medical research papers. _(evidence: demo)_
10+
- [architecture-zoo](architecture-zoo.md) — Choose a model architecture for a medical-imaging research question before scaffolding. _(evidence: manual_workflow)_
1011
- [author-strategy](author-strategy.md) — PubMed author profile analysis. _(evidence: manual_workflow)_
1112
- [batch-cohort](batch-cohort.md) — Generate N analysis scripts from a single methodology template × multiple exposure/outcome combinations. _(evidence: manual_workflow)_
1213
- [calc-sample-size](calc-sample-size.md) — Interactive sample size calculator for medical research. _(evidence: manual_workflow)_

docs/skills/architecture-zoo.md

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
<!-- AUTO-GENERATED from skills/architecture-zoo/SKILL.md by scripts/gen_skill_docs.py. Do not edit by hand. -->
2+
3+
# architecture-zoo
4+
5+
> Choose a model architecture for a medical-imaging research question before scaffolding. Maps the task (classification, segmentation, detection, transfer), modality and dimensionality, labelled-data scale, and class imbalance to a shortlist of architectures, each grounded in its source paper with a when-to-use, a medical-imaging use, a reference implementation, the typical validation setup, and the matching model-scaffold template. Covers the foundational curriculum (ResNet, DenseNet, EfficientNet, ViT, Swin; U-Net, 3-D U-Net, Attention/Residual U-Net, nnU-Net, Mask R-CNN; SAM/MedSAM, TotalSegmentator, BiomedCLIP, DINO/MAE/SimCLR). It teaches archetypes and the task-to-architecture logic, not a live SOTA leaderboard.
6+
7+
**Invoke:** `/architecture-zoo` · **Tools:** Read, Write, Edit, Grep, Glob · **Model:** inherit
8+
9+
## When to use
10+
11+
`architecture-zoo` activates on requests such as: architecture zoo, which architecture, choose a model, model selection, ResNet vs ViT, U-Net vs nnU-Net, what backbone, foundation model for, transfer learning choice, MedSAM, TotalSegmentator, DINO, MAE, self-supervised, paper to architecture, reference implementation, when to use ViT, segmentation architecture, classification backbone.
12+
13+
## Quality Card
14+
15+
**Purpose** — Turn a medical-imaging research question into a paper-grounded architecture choice — so the build starts from the right archetype (and a known validation setup) rather than from what is fashionable, and the choice carries its source citation into the manuscript.
16+
17+
**Safety boundaries**
18+
19+
- Advisory only: it writes a decision note, never code or weights; the build is /model-scaffold.
20+
- Every recommendation names its source paper; benchmark numbers are cited, never invented; the zoo describes archetypes, not a live leaderboard.
21+
22+
**Known limitations**
23+
24+
- The literature moves fast; this is a curated archetype map (classification, segmentation, foundation/SSL families seeded), not an exhaustive or current SOTA ranking — additional families (detection, synthesis) land in later phases.
25+
- A sound architecture choice is necessary, not sufficient; validity still depends on the split, validation design, and metrics (/model-validation, /model-evaluation).
26+
27+
**Validation**
28+
29+
- `carry the decision note into /model-scaffold to instantiate the chosen template, then /model-validation`
30+
31+
**Evidence**`manual_workflow`
32+
33+
## Bundled resources
34+
35+
**References** (`skills/architecture-zoo/references/`):
36+
37+
- `classification.md`
38+
- `foundation_models.md`
39+
- `index.md`
40+
- `segmentation.md`
41+
42+
## Source
43+
44+
Canonical definition: [`skills/architecture-zoo/SKILL.md`](../../skills/architecture-zoo/SKILL.md)
45+
46+
---
47+
48+
*Part of [MedSci Skills](../../README.md) — Claude Code skills for the medical research lifecycle. This page is generated from the skill's `SKILL.md`; edit that file and re-run `scripts/gen_skill_docs.py`.*

metadata/catalog_counts.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"_comment": "Single source of truth for catalog counts cited in public docs (README, orchestrate, check-reporting). scripts/validate_catalog_consistency.py recomputes every value from disk, asserts this file matches, and asserts the doc claims match. Do not hand-edit a value without running that script \u2014 CI fails on drift.",
3-
"skills": 47,
3+
"skills": 48,
44
"reporting_guidelines": 38,
55
"journal_profiles_find": 73,
66
"journal_profiles_write": 55,

metadata/distribution_files.json

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -396,6 +396,36 @@
396396
"size": 1421,
397397
"sha256": "912c52e9289a7ccb014aa8a18105b6dfe04c2cc040e970c73b4bbc6b2d8a8a39"
398398
},
399+
{
400+
"path": "skills/architecture-zoo/SKILL.md",
401+
"size": 5444,
402+
"sha256": "6d8f81262a42ff24e36dca425511804b9f324d2c900f31f701c703e3d8326729"
403+
},
404+
{
405+
"path": "skills/architecture-zoo/references/classification.md",
406+
"size": 5986,
407+
"sha256": "035e0fddaccb0e19e23ffd7756b075154f847ee20f9a512cd2a010c0db4210fa"
408+
},
409+
{
410+
"path": "skills/architecture-zoo/references/foundation_models.md",
411+
"size": 5267,
412+
"sha256": "495453b025f1cb13d5ba0bac9be6d3b0d63dd958ea48d2b9e55bc222e9c26786"
413+
},
414+
{
415+
"path": "skills/architecture-zoo/references/index.md",
416+
"size": 4065,
417+
"sha256": "a1ed80efcf9a56e0286972ae9b08bd38965bb20a569e3aac5314549dfa6ad5f4"
418+
},
419+
{
420+
"path": "skills/architecture-zoo/references/segmentation.md",
421+
"size": 6508,
422+
"sha256": "17618fe3d6884cf89b034ededcd69e081a65d7bfd473495eb2ab1fb5d8b15d8b"
423+
},
424+
{
425+
"path": "skills/architecture-zoo/skill.yml",
426+
"size": 2889,
427+
"sha256": "275cfb1d0779028d79d8596879c09b5b6c714859142cb72a12b7ec901acc69e1"
428+
},
399429
{
400430
"path": "skills/author-strategy/SKILL.md",
401431
"size": 9209,

metadata/distribution_manifest.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@
55
"academic-aio",
66
"add-journal",
77
"analyze-stats",
8+
"architecture-zoo",
89
"author-strategy",
910
"batch-cohort",
1011
"calc-sample-size",

metadata/skills_catalog.json

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"_comment": "AUTO-GENERATED by scripts/gen_skills_catalog_json.py from each skills/<slug>/SKILL.md + skill.yml. Machine-readable skill catalog (single source of truth) consumed by external surfaces such as the aperivue.com storefront to gate skill-list completeness. Do not hand-edit; CI gate: python3 scripts/gen_skills_catalog_json.py --check.",
3-
"skill_count": 47,
3+
"skill_count": 48,
44
"categories": [
55
{
66
"key": "literature_references",
@@ -17,6 +17,7 @@
1717
"key": "data_study_design",
1818
"label": "Data & Study Design",
1919
"slugs": [
20+
"architecture-zoo",
2021
"calc-sample-size",
2122
"clean-data",
2223
"define-variables",
@@ -126,6 +127,15 @@
126127
"maturity": "official",
127128
"description": "Statistical analysis for medical research papers."
128129
},
130+
{
131+
"slug": "architecture-zoo",
132+
"category": "data_study_design",
133+
"category_label": "Data & Study Design",
134+
"layer": "D",
135+
"owner_domain": "architecture_reference",
136+
"maturity": "official",
137+
"description": "Choose a model architecture for a medical-imaging research question before scaffolding."
138+
},
129139
{
130140
"slug": "author-strategy",
131141
"category": "project_workflow",

scripts/gen_skills_catalog_json.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,7 @@
5454
# "Model Engineering & Validation" storefront category at the v5.0.0 major).
5555
"model_validation": ("data_study_design", "Data & Study Design"),
5656
"model_development": ("data_study_design", "Data & Study Design"),
57+
"architecture_reference": ("data_study_design", "Data & Study Design"),
5758
# Analysis & figures
5859
"statistical_analysis": ("analysis_figures", "Analysis & Figures"),
5960
"figure_generation": ("analysis_figures", "Analysis & Figures"),

0 commit comments

Comments
 (0)