-
Notifications
You must be signed in to change notification settings - Fork 16
Description
Hi
We are using VBID2 on some adaptive sampling ONT data sets and wondering how to get the actual markers (SNPs ) used for predicting / detecting contamination ? The log file only provides
[SimplePileup] Total Number Markers: 1713
[SimplePileup] Total Number Bases: 48156
As of now we are using 1000g.phase3.100k.b38 which is significantly slowing the process ( both time and memory )
The main goal is to see if a consistent set of markers used across multiple samples , and then use that to generate custom ref.vcf file following guidelines posted here: https://github.com/Griffan/VerifyBamID, to force VBID2 to only check regions provided in custom reference file instead of all 100K SNPs.
Any other suggestions is greatly appreciated.
Thanks and looking forward.