hutaobo
diff --git a/‎.gitignore‎
Lines changed: 4 additions & 0 deletions b/‎.gitignore‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎CITATION.cff‎
Lines changed: 2 additions & 2 deletions b/‎CITATION.cff‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 112 additions & 8 deletions b/‎README.md‎
Lines changed: 112 additions & 8 deletions
diff --git a/‎docs/_static/results/gse274058_reference/barcode_spread.png‎
36.8 KB b/‎docs/_static/results/gse274058_reference/barcode_spread.png‎
36.8 KB
diff --git a/‎docs/api.md‎
Lines changed: 73 additions & 0 deletions b/‎docs/api.md‎
Lines changed: 73 additions & 0 deletions
diff --git a/‎docs/benchmarks.md‎
Lines changed: 109 additions & 0 deletions b/‎docs/benchmarks.md‎
Lines changed: 109 additions & 0 deletions
@@ -5,3 +5,7 @@ dist/
 build/
 *.egg-info/
 .ipynb_checkpoints/
+.spatialperturb-cache/
+reports/
+artifacts/
+.tmp_sphinx_html/
@@ -4,5 +4,5 @@ message: If you use this package, please cite it.
 authors:
   - family-names: Hu
     given-names: Taobo
-version: 0.1.0
-repository-code: https://github.com/yourname/SpatialPerturb
+version: 0.3.0
+repository-code: https://github.com/hutaobo/SpatialPerturb
@@ -1,34 +1,138 @@
 # SpatialPerturb
 
-Toolkit for combining **Spatial Transcriptomics** with **Perturb-seq** workflows — signatures, label transfer, spatial scoring, and graph/structure analysis.
+SpatialPerturb is an AnnData-native framework for spatial perturbation inference across sequencing-based and imaging-based platforms.
+
+It now ships a benchmark-oriented workflow built around:
+
+- a stable `AnnData` schema,
+- `fetch -> prepare -> load` public dataset lifecycle helpers,
+- intrinsic and neighborhood differential effects with `simple` and `pseudobulk` modes,
+- ligand-receptor differential scoring with fixed fallback or custom LR resources,
+- perturbation-program and cross-platform concordance metrics,
+- paper-style figure rendering and report manifests.
 
 ## Install
 
 ```bash
 pip install SpatialPerturb
-# or with GNN extras:
-pip install 'SpatialPerturb[gnn]'
+```
+
+For heavier ecosystem interop:
+
+```bash
+pip install "SpatialPerturb[interop]"
 ```
 
 ## Quick start
 
 ```python
 import spatialperturb as sp
 
-print(sp.__version__)
+adata = sp.load_demo_dataset()
+
+intrinsic = sp.intrinsic_de(
+    adata,
+    perturbation="Lrrk2",
+    control="control",
+    method="pseudobulk",
+    sample_col="sample",
+    cell_type="neuron",
+    roi="hippocampus",
+)
+
+neighbor = sp.neighbor_de(
+    adata,
+    perturbation="Lrrk2",
+    control="control",
+    method="pseudobulk",
+    sample_col="sample",
+    aggregate="pseudobulk",
+    cell_type="neuron",
+    roi="hippocampus",
+)
+
+lr = sp.differential_lr(adata, perturbation="Lrrk2", control="control", lr_network="fallback")
+power = sp.power_curve(adata, perturbation="Lrrk2", control="control", method="pseudobulk", sample_col="sample")
+programs = sp.derive_perturbation_programs(intrinsic, top_n=10, direction="both")
+```
+
+## Public dataset lifecycle
+
+```python
+import spatialperturb as sp
+
+sp.available_datasets()
+
+sp.fetch_dataset("shen_2026_scrnaseq", cache_dir=".spatialperturb-cache")
+sp.prepare_dataset("shen_2026_scrnaseq", cache_dir=".spatialperturb-cache")
+adata = sp.load_public_dataset("shen_2026_scrnaseq", cache_dir=".spatialperturb-cache")
 ```
 
-CLI:
+Registered public tracks:
+
+- `shen_2026_stereoseq` -> `GSE274447`
+- `shen_2026_scrnaseq` -> `GSE274058`
+- `demo_spatialperturb` -> deterministic paired demo data
+
+Notes:
+
+- `shen_2026_scrnaseq` supports automatic fetch and preparation from the GEO raw archive.
+- `shen_2026_stereoseq` supports automatic fetch and extraction, but final preparation still requires a preconverted `.h5ad` or tabular export from the raw GEF files.
+
+## Paper-grade benchmark workflow
+
+```python
+import spatialperturb as sp
+
+results = sp.run_core_benchmark(
+    "demo_spatialperturb",
+    config={
+        "cache_dir": ".spatialperturb-cache",
+        "reference_dataset": "demo_spatialperturb",
+        "method": "pseudobulk",
+        "sample_col": "sample",
+        "concordance_level": "both",
+    },
+    output_dir="reports/demo_spatialperturb",
+)
+```
+
+This writes:
+
+- tidy tables under `reports/.../tables/`
+- fixed paper figures under `reports/.../figures/`
+- a machine-readable `manifest.json`
+- the exact `input.h5ad` used for the run
+
+## CLI
+
 ```bash
-SpatialPerturb version
+spatialperturb datasets
+spatialperturb fetch-dataset shen_2026_scrnaseq
+spatialperturb prepare-dataset shen_2026_scrnaseq
+spatialperturb run-benchmark demo_spatialperturb --output-dir reports/demo
+spatialperturb render-paper-figures demo_spatialperturb --output-dir reports/demo-figs
+spatialperturb validate path/to/data.h5ad
 ```
 
+## Package layout
+
+- `spatialperturb.io`: AnnData ingestion helpers.
+- `spatialperturb.pp`: perturbation assignment and QC.
+- `spatialperturb.gr`: spatial graph construction and neighbor collection.
+- `spatialperturb.tl`: intrinsic DE, neighbor DE, ligand-receptor scoring, concordance, and power.
+- `spatialperturb.pl`: plotting helpers for benchmark figures.
+- `spatialperturb.signatures`: perturbation program derivation and scoring.
+- `spatialperturb.datasets`: dataset registry plus public `fetch/prepare/load`.
+- `spatialperturb.benchmarks`: benchmark orchestration and report manifests.
+- `spatialperturb.reports`: fixed paper figure rendering.
+
 ## Development
 
 ```bash
-python -m pip install --upgrade build twine
+python -m pip install --upgrade build pytest twine
 python -m build
-twine upload --repository testpypi dist/*
+pytest -q
 ```
 
 ## Citation
 
@@ -0,0 +1,73 @@
+# API 参考
+
+## 包入口
+
+```{eval-rst}
+.. automodule:: spatialperturb
+   :members:
+```
+
+## I/O 与 Schema
+
+```{eval-rst}
+.. automodule:: spatialperturb.io
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.schema
+   :members:
+```
+
+## 预处理与图
+
+```{eval-rst}
+.. automodule:: spatialperturb.pp
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.gr
+   :members:
+```
+
+## 分析工具
+
+```{eval-rst}
+.. automodule:: spatialperturb.tl
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.signatures
+   :members:
+```
+
+## 数据集与 benchmark
+
+```{eval-rst}
+.. automodule:: spatialperturb.datasets
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.benchmarks
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.reports
+   :members:
+```
+
+## 绘图与 CLI
+
+```{eval-rst}
+.. automodule:: spatialperturb.pl
+   :members:
+```
+
+```{eval-rst}
+.. automodule:: spatialperturb.cli
+   :members:
+```
@@ -0,0 +1,109 @@
+# Benchmarks
+
+SpatialPerturb 当前把 benchmark 固定成两条主轨道：
+
+- `shen_2026_core`
+  目标是复现空间扰动数据上的 intrinsic / neighbor / ligand-receptor / power / figure 主链。
+- `cross_platform_concordance`
+  目标是比较 spatial 和 dissociated reference 中的 perturbation signatures 与 programs。
+
+## 查看 catalog
+
+```python
+import spatialperturb as sp
+
+sp.available_datasets()
+sp.available_benchmarks()
+```
+
+## Public benchmark backbone
+
+### `shen_2026_scrnaseq`
+
+- accession: `GSE274058`
+- role: reference / cross-platform track
+- raw format: nested `10x tar.gz`
+- status: automatic `fetch -> prepare -> load` supported
+
+### `shen_2026_stereoseq`
+
+- accession: `GSE274447`
+- role: spatial core track
+- raw format: `tar of GEF`
+- status: automatic fetch and extraction supported; final prepare still expects a preconverted `.h5ad` or tabular cell-level export
+
+## 运行 core benchmark
+
+```python
+import spatialperturb as sp
+
+results = sp.run_core_benchmark(
+    "demo_spatialperturb",
+    config={
+        "cache_dir": ".spatialperturb-cache",
+        "method": "pseudobulk",
+        "sample_col": "sample",
+        "reference_dataset": "demo_spatialperturb",
+        "concordance_level": "both",
+    },
+    output_dir="reports/demo_spatialperturb",
+)
+```
+
+这个入口会自动：
+
+- 载入 prepared dataset
+- 补 spatial graph（如果还没建）
+- 运行 `intrinsic_de`
+- 运行 `neighbor_de`
+- 运行 `differential_lr`
+- 运行 `power_curve`
+- 如果给了 reference，再运行 `platform_concordance`
+- 输出 tables、figures、`manifest.json` 和 `input.h5ad`
+
+## 运行 cross-platform benchmark
+
+```python
+spatial, reference = sp.load_demo_dataset(paired=True)
+
+spatial_de = sp.intrinsic_de(
+    spatial,
+    perturbation="Lrrk2",
+    control="control",
+    method="pseudobulk",
+    sample_col="sample",
+)
+
+reference_de = sp.intrinsic_de(
+    reference,
+    perturbation="Lrrk2",
+    control="control",
+    method="pseudobulk",
+    sample_col="sample",
+)
+
+concordance = sp.run_cross_platform_benchmark(
+    spatial_de,
+    reference_de,
+    config={"top_n": 50, "level": "both"},
+)
+```
+
+## Benchmark 输出目录
+
+`run_core_benchmark(..., output_dir=...)` 会生成固定目录结构：
+
+- `tables/intrinsic_de.tsv`
+- `tables/neighbor_de.tsv`
+- `tables/differential_lr.tsv`
+- `tables/power_curve.tsv`
+- `tables/platform_concordance.tsv`（如果提供 reference）
+- `figures/workflow_schema.png`
+- `figures/assignment_qc.png`
+- `figures/own_vs_neighbor.png`
+- `figures/lr_differential.png`
+- `figures/platform_concordance.png`
+- `figures/power_curve.png`
+- `manifest.json`
+- `config.json`
+- `input.h5ad`