Fix Metadata checks by Marius1311 · Pull Request #60 · quadbio/cellmapper

Marius1311 · 2026-01-16T09:34:22Z

This fixes and improves a few things:

metadata checks for packages: i had rapids installed but the checking module still failed
batched mode for all k-NN backends, so that you can run with many neighbors on memory-limited GPUs
some improvements to confusion matrix plotting (filter NaNs better, option to mask, etc. )

- Move _batched_query helper to _knn_backend.py (works with any backend) - Add batch_size parameter to Kernel.compute_neighbors() - Simplify _RapidsBackend.query() to single query with cleanup - All backends now benefit from optional batching for memory management

- Add _make_key() helper that omits underscore when postfix is empty - Passing prediction_postfix='' now stores result as 'key' instead of 'key_' - Same behavior for confidence_postfix

BREAKING: prediction_postfix and confidence_postfix now default to '_pred' and '_conf' respectively. Pass the full postfix including any separator. - prediction_postfix='_pred' (was 'pred') - confidence_postfix='_conf' (was 'conf') - Use '' for no postfix (stores directly as original key) - Removed _make_key helper - simple concatenation now

Allows filtering cells for confusion matrix via boolean mask: cmap.plot_confusion_matrix(label_key='celltype', subset=query.obs['time'] == 'E8.5')

When y_true and y_pred have different categories (e.g., reference has more time points than query), use union of both category sets as labels.

sklearn interprets float-typed categorical data as continuous. Convert to strings to ensure proper categorical handling.

…ibility The pynndescent transformer is non-deterministic without a fixed seed, causing test failures on CI (Linux) while passing locally (macOS).

Pre-release dependency tests (numba + numpy 2.0) may fail due to compatibility issues in upstream packages. These failures are expected and shouldn't block PRs.

sphinx-tabs 3.4.7 is incompatible with docutils 0.22 (KeyError: 'backrefs'). See executablebooks/sphinx-tabs#206

pynndescent uses SIMD instructions that produce different results on different CPU architectures (macOS ARM vs Linux x86), making exact matrix comparison impossible. The sklearn test still validates the core functionality.

codecov · 2026-01-16T13:12:43Z

Codecov Report

❌ Patch coverage is 51.51515% with 32 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.38%. Comparing base (a935146) to head (44abfbd).
⚠️ Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
src/cellmapper/model/evaluate.py	9.09%	20 Missing ⚠️
src/cellmapper/model/_knn_backend.py	26.66%	11 Missing ⚠️
src/cellmapper/model/cellmapper.py	92.30%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #60      +/-   ##
==========================================
- Coverage   86.98%   85.38%   -1.61%     
==========================================
  Files          13       13              
  Lines        1245     1286      +41     
==========================================
+ Hits         1083     1098      +15     
- Misses        162      188      +26

Files with missing lines	Coverage Δ
src/cellmapper/_docs.py	`100.00% <ø> (ø)`
src/cellmapper/check.py	`100.00% <100.00%> (ø)`
src/cellmapper/model/kernel.py	`89.05% <100.00%> (ø)`
src/cellmapper/model/cellmapper.py	`84.10% <92.30%> (+0.25%)`	⬆️
src/cellmapper/model/_knn_backend.py	`56.14% <26.66%> (-4.47%)`	⬇️
src/cellmapper/model/evaluate.py	`73.88% <9.09%> (-6.72%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

pynndescent is fundamentally approximate and uses SIMD instructions that produce different results on different CPU architectures. Instead of skipping the test entirely, use correlation-based comparison (r > 0.99) to validate the code path while accommodating minor platform differences.

Add prominent note in Important Notes section that testing must use 'hatch test', not 'uv run pytest'. This ensures the test matrix matches CI.

Linux x86 produces ~0.97 correlation while macOS ARM produces ~0.99. Use 0.95 threshold to accommodate platform-dependent SIMD differences while still validating the matrices are structurally similar.

Marius1311 added 14 commits January 15, 2026 16:49

fix: handle missing package metadata for conda packages

89c7f8a

fix: prevent stale knn state when neighbor computation fails

e6f6ec2

feat: add batch_size parameter to rapids backend for GPU OOM handling

14d55cb

feat: allow empty prediction_postfix to use key without underscore

3882057

- Add _make_key() helper that omits underscore when postfix is empty - Passing prediction_postfix='' now stores result as 'key' instead of 'key_' - Same behavior for confidence_postfix

feat: add subset parameter to plot_confusion_matrix

b61d650

Allows filtering cells for confusion matrix via boolean mask: cmap.plot_confusion_matrix(label_key='celltype', subset=query.obs['time'] == 'E8.5')

fix: handle NaN values in both y_true and y_pred for confusion matrix

a4867db

fix: handle mismatched category sets in confusion matrix

7114bc3

When y_true and y_pred have different categories (e.g., reference has more time points than query), use union of both category sets as labels.

fix: convert float categories to strings for confusion matrix

c0be4ca

sklearn interprets float-typed categorical data as continuous. Convert to strings to ensure proper categorical handling.

fix: add random_state to pynndescent test for cross-platform reproduc…

a918cd3

…ibility The pynndescent transformer is non-deterministic without a fixed seed, causing test failures on CI (Linux) while passing locally (macOS).

ci: allow pre-release tests to fail without blocking PR

89eda50

Pre-release dependency tests (numba + numpy 2.0) may fail due to compatibility issues in upstream packages. These failures are expected and shouldn't block PRs.

docs: pin docutils<0.22 for sphinx-tabs compatibility

b46dcf1

sphinx-tabs 3.4.7 is incompatible with docutils 0.22 (KeyError: 'backrefs'). See executablebooks/sphinx-tabs#206

Marius1311 added 3 commits January 16, 2026 14:13

docs: emphasize hatch for testing in copilot instructions

501190b

Add prominent note in Important Notes section that testing must use 'hatch test', not 'uv run pytest'. This ensures the test matrix matches CI.

test: relax pynndescent correlation threshold to 0.95

44abfbd

Linux x86 produces ~0.97 correlation while macOS ARM produces ~0.99. Use 0.95 threshold to accommodate platform-dependent SIMD differences while still validating the matrices are structurally similar.

Marius1311 merged commit e706485 into main Jan 16, 2026
7 of 8 checks passed

Marius1311 deleted the fix/rapids-metadata-check branch January 16, 2026 13:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Metadata checks#60

Fix Metadata checks#60
Marius1311 merged 17 commits intomainfrom
fix/rapids-metadata-check

Marius1311 commented Jan 16, 2026

Uh oh!

codecov bot commented Jan 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Marius1311 commented Jan 16, 2026

Uh oh!

codecov bot commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov bot commented Jan 16, 2026 •

edited

Loading