You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: add --reference-accession filter for BAM files (#421)
RSV-A/RSV-B BAM files from V-PIPE contain alignments to multiple similar
references because the viruses are similar enough that reads cross-align.
This adds a --reference-accession CLI option to filter reads by target
reference sequence.
Changes:
- Add --reference-accession CLI flag to process-from-vpipe command
- Add REFERENCE_ACCESSION config option for Snakefile workflow
- Filter reads in bam_to_fastq_handle_indels() by reference name
- Add ZeroFilteredReadsError when filtering results in zero reads
- Log filtering statistics (reads kept/filtered with percentages)
- Thread target_reference parameter through translate_align.py
- Skip filtered reads during AA enrichment
Test data:
- Add H3_16_2025_11_15 BAM with reads to both RSV-A (3086) and RSV-B (48711)
- Update test_convert.py with reference filtering tests
- Add snakemake integration test for RSV-A with reference filtering
Config:
- Add REFERENCE_ACCESSION to deployments/rsva/config.yaml
- Update pre-commit to allow large BAM files in tests/data/
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
0 commit comments