Germ-line exomes, single samples #1023
marchoeppner
started this conversation in
General
Replies: 1 comment
-
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
apologies for using this route, but I am quite stuck with my processing and cannot find a straight answer elsewhere.
What is the issue?
I have clinical exomes, typically single samples from patients across a range of diseases (routine human genetics diagnostics). For these samples, I would like to call putative CNVs using CNVkit. All exomes are sequenced on the same instrument, using the same wet-lab pipeline and exome kit (IDT xGen). Each sequencing run holds up to 48 (unrelated) samples and generates around 100X coverage per sample. Downstream processing consists of alignment (BWA) and duplicate marking against hg38 (without ALT contigs).
BED file
The documentation is a bit unclear here. The option is called "target", but the text mentions "baits" a few times. In exome sequencing, these are two different things. "Baits" are the actual stretches used for constructing the RNA baits, whereas targets are usually the exons targetted for capture. One or more baits can map to one target (e.g. long exons). So is it baits or targets you expect here? Vedors supply BED files for both, usually.
Resolution
The documentation mentions a resolution of >1MB for exomes. But the method "hmm-germline" apparently changes this? What is a realistic lower limit here?
Expected number of CNVs
I have no idea if my results are anywhere near realistc. Depending on my exact approach, I get anything between a few dozen (reference from all samples in a sequencing run) to up to 1000 CNV calls (flat reference). What is a "typical" number for exomes from non-tumor samples?
Thanks for the help!
/Marc
Beta Was this translation helpful? Give feedback.
All reactions