You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m currently learning to use Signac for analysing single-cell chromatin data from histone marks, and while exploring quality metrics like fraction of reads in enhancer and promoter regions I noticed that FractionCountsInRegion() sometimes returns values greater than 1.
I initially thought this might reflect some normalization step, but when I looked into the source code I saw that the helper function CountsInRegion() uses queryHits(findOverlaps(...)) to subset the count matrix. I wonder if this might accidentally count the same bin multiple times when a single bin overlaps several regions (e.g., enhancers). Of course, it’s possible that I’m misunderstanding the intended behaviour — if this design is correct for a reason I’ve missed, I’d really appreciate clarification.
In the toy example below, all five enhancer intervals overlap the same bin. FractionCountsInRegion() seems to sum that bin five times (once per overlap), producing fractions > 1. When I collapse to unique bin indices (unique(queryHits)), the result becomes ≤ 1, which in my mind matches the idea of “fraction of reads in regions.”
Thank you very much for your time and for maintaining this great package
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello!
I’m currently learning to use Signac for analysing single-cell chromatin data from histone marks, and while exploring quality metrics like fraction of reads in enhancer and promoter regions I noticed that FractionCountsInRegion() sometimes returns values greater than 1.
I initially thought this might reflect some normalization step, but when I looked into the source code I saw that the helper function CountsInRegion() uses queryHits(findOverlaps(...)) to subset the count matrix. I wonder if this might accidentally count the same bin multiple times when a single bin overlaps several regions (e.g., enhancers). Of course, it’s possible that I’m misunderstanding the intended behaviour — if this design is correct for a reason I’ve missed, I’d really appreciate clarification.
In the toy example below, all five enhancer intervals overlap the same bin. FractionCountsInRegion() seems to sum that bin five times (once per overlap), producing fractions > 1. When I collapse to unique bin indices (unique(queryHits)), the result becomes ≤ 1, which in my mind matches the idea of “fraction of reads in regions.”
Thank you very much for your time and for maintaining this great package
sessionInfo()
Beta Was this translation helpful? Give feedback.
All reactions