quadbio
diff --git a/‎.github/copilot-instructions.md‎
Lines changed: 142 additions & 0 deletions b/‎.github/copilot-instructions.md‎
Lines changed: 142 additions & 0 deletions
diff --git a/‎.github/workflows/test.yaml‎
Lines changed: 9 additions & 0 deletions b/‎.github/workflows/test.yaml‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 10 additions & 0 deletions b/‎.gitignore‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎.pylintrc‎
Lines changed: 2 additions & 0 deletions b/‎.pylintrc‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 75 additions & 4 deletions b/‎CHANGELOG.md‎
Lines changed: 75 additions & 4 deletions
diff --git a/‎LICENSE‎
Lines changed: 1 addition & 1 deletion b/‎LICENSE‎
Lines changed: 1 addition & 1 deletion
@@ -0,0 +1,142 @@
+# Copilot Instructions for CellAnnotator
+
+## Important Notes
+- Avoid drafting summary documents or endless markdown files. Just summarize in chat what you did, why, and any open questions.
+- Don't update Jupyter notebooks - those are managed manually.
+- When running terminal commands, activate the appropriate environment first (`mamba activate cell_annotator`).
+- Rather than making assumptions, ask for clarification when uncertain.
+- **GitHub workflows**: Use GitHub CLI (`gh`) when possible. For GitHub MCP server tools, ensure Docker Desktop is running first (`open -a "Docker Desktop"`).
+
+## Project Overview
+
+**CellAnnotator** is an scverse ecosystem package for automated cell type annotation in scRNA-seq data using Large Language Models (LLMs). It's provider-agnostic, supporting OpenAI, Google Gemini, and Anthropic Claude. The tool sends cluster marker genes (not expression values) to LLMs, which return structured cell type annotations with confidence scores.
+
+### Domain Context (Brief)
+- **AnnData**: Standard single-cell data structure. Contains `.X`, `.obs` (cell metadata), `.var` (gene metadata).
+- **Marker genes**: Differentially expressed genes that characterize cell types/clusters (computed via scanpy).
+- **LLM providers**: OpenAI (GPT), Google (Gemini), Anthropic (Claude). Uses Pydantic for structured outputs.
+- **Workflow**: 1) Compute marker genes per cluster, 2) Send to LLM with biological context, 3) Get structured annotations, 4) Harmonize across samples.
+
+### Key Dependencies`
+- **Core**: scanpy, pydantic, python-dotenv, rich
+- **LLM providers**: openai, anthropic, google-genai (all optional)
+- **Optional**: rapids-singlecell (GPU), colorspacious (colors)
+
+## Architecture & Code Organization
+
+### Module Structure (follows scverse conventions)
+- Use `AnnData` objects as primary data structure
+- Type annotations use modern syntax: `str | None` instead of `Optional[str]`
+- Supports Python 3.11, 3.12, 3.13 (see `pyproject.toml`)
+- Avoid local imports unless necessary for circular import resolution
+
+### Core Components
+1. **`src/cell_annotator/model/cell_annotator.py`**: Main `CellAnnotator` class
+   - Orchestrates annotation across multiple samples
+   - `annotate_clusters()`: Main entry point for annotation
+2. **`src/cell_annotator/model/sample_annotator.py`**: `SampleAnnotator` class
+   - Handles annotation for single sample
+   - Computes marker genes, queries LLM, stores results
+3. **`src/cell_annotator/model/base_annotator.py`**: `BaseAnnotator` abstract class
+   - Shared LLM provider logic and validation
+4. **`src/cell_annotator/_response_formats.py`**: Pydantic models for structured LLM outputs
+5. **`src/cell_annotator/_prompts.py`**: LLM prompt templates
+6. **`src/cell_annotator/utils.py`**: Helper functions (marker gene filtering, formatting)
+
+## Development Workflow
+
+### Environment Management (Hatch-based)
+```bash
+# Testing - NEVER use pytest directly
+hatch test                    # test with highest Python version
+hatch test --all              # test all Python 3.11 & 3.13 + pre-release
+
+# Documentation
+hatch run docs:build          # build Sphinx docs
+hatch run docs:open           # open in browser
+hatch run docs:clean          # clean build artifacts
+
+# Environment inspection
+hatch env show                # list environments
+```
+
+### Testing Strategy
+- Test matrix defined in `[[tool.hatch.envs.hatch-test.matrix]]` in `pyproject.toml`
+- Tests Python 3.11 & 3.13 with stable deps, 3.13 with pre-release deps
+- Tests live in `tests/`, use pytest with `@pytest.mark.real_llm_query` for actual LLM calls
+- Run via `hatch test` to ensure proper environment isolation
+- Optional dependencies tested via `features = ["test"]` which includes all providers
+
+### Code Quality Tools
+- **Ruff**: Linting and formatting (120 char line length)
+- **Biome**: JSON/JSONC formatting with trailing commas
+- **Pre-commit**: Auto-runs ruff, biome. Install with `pre-commit install`
+- Use `git pull --rebase` if pre-commit.ci commits to your branch
+
+## Key Configuration Files
+
+### `pyproject.toml`
+- **Build**: `hatchling` with `hatch-vcs` for git-based versioning
+- **Dependencies**: Minimal core (scanpy, pydantic); provider packages are optional extras
+- **Extras**: `[openai]`, `[anthropic]`, `[gemini]`, `[all-providers]`, `[test]`, `[doc]`
+- **Ruff**: 120 char line length, NumPy docstring convention
+- **Test matrix**: Python 3.11 & 3.13
+
+### Version Management
+- Version from git tags via `hatch-vcs`
+- Release: Create GitHub release with tag `vX.X.X`
+- Follows **Semantic Versioning**
+
+## Project-Specific Patterns
+
+### Basic Usage
+```python
+from cell_annotator import CellAnnotator
+
+# Annotate across multiple samples
+cell_ann = CellAnnotator(
+    adata,
+    species="human",
+    tissue="heart",
+    cluster_key="leiden",
+    sample_key="batch",
+    provider="openai",  # or "gemini", "anthropic"
+).annotate_clusters()
+
+# Results in adata.obs['cell_type_predicted']
+```
+
+### LLM Provider Selection
+- Providers: `"openai"` (default), `"gemini"`, `"anthropic"`
+- API keys via environment variables or `.env` file (loaded with python-dotenv)
+- Models: `gpt-4o-mini`, `gemini-2.5-flash-lite`, `claude-haiku-4-5` (defaults)
+- Anthropic is most expensive ($1/$5 per 1M tokens), minimize usage in tests
+- All providers use model aliases that auto-update to latest snapshots
+
+### Structured Outputs with Pydantic
+- `CellTypeListOutput`: List of expected cell types
+- `ExpectedMarkerGeneOutput`: Dict of cell type → marker genes
+- Ensures reliable, parseable LLM responses
+
+### AnnData Conventions
+- Marker genes computed via `scanpy.tl.rank_genes_groups()`
+- Results stored in `adata.obs[cell_type_key]` (default: `"cell_type_predicted"`)
+- Confidence scores in `adata.obs[f"{cell_type_key}_confidence"]`
+
+## Common Gotchas
+
+1. **Hatch for testing**: Always use `hatch test`, never standalone `pytest`. CI matches hatch test matrix.
+2. **API keys**: Must be set as env vars or in `.env` file. Package auto-loads via python-dotenv.
+3. **Provider packages**: Install provider extras (`pip install cell-annotator[openai]`) to use specific LLMs.
+4. **Real LLM tests**: Use `@pytest.mark.real_llm_query` and skip in CI unless explicitly enabled.
+5. **Marker gene filtering**: Package automatically filters marker genes to genes present in `adata.var_names`.
+6. **Pre-commit conflicts**: Use `git pull --rebase` to integrate pre-commit.ci fixes.
+7. **Line length**: Ruff set to 120 chars, but keep docstrings readable (~80 chars per line).
+
+## Related Resources
+
+- **Contributing guide**: `docs/contributing.md`
+- **Tutorials**: `docs/notebooks/tutorials/`
+- **OpenAI structured outputs**: https://platform.openai.com/docs/guides/structured-outputs
+- **scanpy docs**: https://scanpy.readthedocs.io/
+- **Pydantic docs**: https://docs.pydantic.dev/
@@ -61,6 +61,13 @@ jobs:
     name: ${{ matrix.env.label }}
     runs-on: ${{ matrix.os }}
 
+    env:
+      OS: ${{ matrix.os }}
+      PYTHON: ${{ matrix.python }}
+      OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
+      GEMINI_API_KEY: ${{ secrets.GEMINI_API_KEY }}
+      ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
+
     steps:
       - uses: actions/checkout@v4
         with:
@@ -87,6 +94,8 @@ jobs:
           uvx hatch run ${{ matrix.env.name }}:coverage xml # create report for upload
       - name: Upload coverage
         uses: codecov/codecov-action@v5
+        with:
+          token: ${{ secrets.CODECOV_TOKEN }}
 
   # Check that all tests defined above pass. This makes it easy to set a single "required" test in branch
   # protection instead of having to update it frequently. See https://github.com/re-actors/alls-green#why.
 
@@ -19,3 +19,13 @@ __pycache__/
 # docs
 /docs/generated/
 /docs/_build/
+/docs/notebooks/tests/
+
+# Jupyter
+.ipynb_checkpoints
+
+# Data files
+*.h5ad
+
+# Environment files
+*.env
@@ -0,0 +1,2 @@
+[FORMAT]
+max-line-length=120
@@ -3,13 +3,84 @@
 All notable changes to this project will be documented in this file.
 
 The format is based on [Keep a Changelog][],
-and this project adheres to [Semantic Versioning][].
+and this project adheres to [Semantic Versioning][]. Full commit history is available in the [commit logs][].
 
 [keep a changelog]: https://keepachangelog.com/en/1.0.0/
 [semantic versioning]: https://semver.org/spec/v2.0.0.html
+[commit logs]: https://github.com/quadbio/cell-annotator/commits
 
-## [Unreleased]
+## Version 0.2
 
-### Added
+### Unreleased
 
--   Basic tool, preprocessing and plotting functions
+### 0.2.0 (2025-07-26)
+
+#### Added
+- Added a generic LLM backend that supports OpenAI, Claude and Gemini models {pr}`53`
+- Add the possibility to provide the current gene set when querying expected marker genes {pr}`53`
+- Add the possibility to filter expected marker genes to those presend in AnnData {pr}`53`
+- Add a new tutorial on spatial data annotation {pr}`53`
+- Added and improved tests for the new classes (e.g. ObsBeautifier, LLMBackend, etc) {pr}`53`
+- For each backend, add a small `test_query` method which can be used for diagnostics {pr}`53`
+
+#### Changed
+- Moved the `reorder_and_color` utility into a new class: `ObsBeautifier` {pr}`54`
+- Improved class representations throughout the package  {pr}`53`
+
+#### Fixed
+- Fix the `ObsBeautifier` modifying cluster colors when only their order should be updated {pr}`54`
+
+## Version 0.1
+
+### 0.1.5 (2025-07-24)
+
+#### Changed
+- Update tutorials to use `gpt-4.1` {pr}`51`
+
+### 0.1.4 (2025-03-28)
+
+#### Added
+
+- Use `rapids_singlecell`, `cupy` and `cuml` to accelerate cluster marker computation on GPUs {pr}`37`.
+- Add the possibility to softly enforce adherence to expected cell types {pr}`42`.
+
+#### Changed
+
+- Run cluster label harmonization also for a single sample {pr}`37`.
+- Re-format prompts into a dataclass {pr}`42`.
+
+#### Fixed
+
+- Fixed a bug with integer sample labels {pr}`37`.
+
+### 0.1.3 (2025-02-07)
+
+#### Added
+
+- Added tests for the single-sample case {pr}`29`.
+- Refer to issues and PRs with sphinx {pr}`30`.
+
+#### Removed
+
+- Removed `tenacity` for query retries {pr}`28`.
+
+#### Fixed
+
+- Fixed `_get_annotation_summary_string` for the single-sample case {pr}`29`.
+- Fixed the expected cell type marker test by adding additional marker genes {pr}`28`.
+
+### 0.1.2 (2025-01-29)
+
+#### Added
+
+- Update the documentation, in particular the installation instructions.
+
+### 0.1.1 (2025-01-29)
+
+#### Added
+
+- Initial push to PyPI
+
+### 0.1.0 (2025-01-29)
+
+Initial package release
@@ -1,6 +1,6 @@
 MIT License
 
-Copyright (c) 2024, Marius Lange
+Copyright (c) 2024, QuaDBio Lab
 
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal