skip chromosomes not in VCFs by mfansler · Pull Request #266 · wheaton5/souporcell

mfansler · 2026-01-07T10:03:27Z

As discussed in #249, computing depth for the BAM regions is non-uniform and, for most scRNA-seq, will bottleneck on the MT chromosome if included. When using known or common variants, one can work around this by only computing depth on chromosomes that are included in the VCF(s). This PR implements that functionality.

In practice with common variants, I see this reduce the get_bam_regions step from over an hour to under 10 mins.

Added functionality to filter BAM regions based on known chromosomes from VCF files and created a new function to read chromosome names from VCF.

exclude chromosomes not in VCFs

e49f81b

Added functionality to filter BAM regions based on known chromosomes from VCF files and created a new function to read chromosome names from VCF.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skip chromosomes not in VCFs#266

skip chromosomes not in VCFs#266
mfansler wants to merge 1 commit intowheaton5:masterfrom
mfansler:bam-vcf-filter

mfansler commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mfansler commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant