Skip to content

Extracting Marker Info ( SNPs used for calling contamination ) #67

@shubhambinf1987

Description

@shubhambinf1987

Hi
We are using VBID2 on some adaptive sampling ONT data sets and wondering how to get the actual markers (SNPs ) used for predicting / detecting contamination ? The log file only provides

[SimplePileup] Total Number Markers: 1713
[SimplePileup] Total Number Bases: 48156

As of now we are using 1000g.phase3.100k.b38 which is significantly slowing the process ( both time and memory )

The main goal is to see if a consistent set of markers used across multiple samples , and then use that to generate custom ref.vcf file following guidelines posted here: https://github.com/Griffan/VerifyBamID, to force VBID2 to only check regions provided in custom reference file instead of all 100K SNPs.

Any other suggestions is greatly appreciated.

Thanks and looking forward.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions