-
Notifications
You must be signed in to change notification settings - Fork 15
fix: merge snv variants script #1499
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## develop #1499 +/- ##
===========================================
- Coverage 99.48% 99.45% -0.03%
===========================================
Files 40 40
Lines 1932 2020 +88
===========================================
+ Hits 1922 2009 +87
- Misses 10 11 +1
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
|
…LSAMIC into create_tnscope_mnvs
fevac
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚀
|
### Added: * Added option to disable hard filter of variants in matched normal #1509 * Added check to verify sample sex for all workflows #1516 * SOR filter to WGS TN SNV quality filter #1506 * GT field to IGH-DUX4 variant #1527 * ONC field annotations from Clinvar #1527 * Added memory option #1535 * Added max SOR 3 to TNscope TGA TN workflow #1526 * Added max RPA 12 to TNscope TGA workflow #1526 ### Changed: * Reworked bcftools filters #1509 * Renamed high_normal_tumor_af_frac to in_normal #1509 * check to verify sample sex for all workflows #1516 * Merging SNVs into MNVs in TNscope TGA #1524 * Change raw delivery SNV file for TGA to before any post-processing #1524 * Changed VarDict and TNscope VCF merged method to custom script #1499 * Changed QC thresholds for WGS normal and WES #1477 * Change VarDict memory usage to fix crashes in production #1537 * Updated cluster resources for tnscope WGS TN #1535 * Disable SV calling in TNscope #1541 ### Removed: * Remove WGS-level GC-bias metric from TGA workflow #1521 * Remove parallelization of VarDict per chromosome #1544 ### Fixed: * Merged VarDict and TNscope variants now correctly show both callers in FOUND_IN info field #1499 * Fixed somalier container #1538



Description
MERGED THESE PRs INTO THIS ONE to test all new features together:
This method is replacing the current bcftools concat method.
The bcftools concat method has a couple of issues:
Also added this to header of merged vcf:
Also removed the filepath in the FOUND_IN pre-processing by
edit_vcf_info.pyBased on @khurrammaqbool suggestion I also maintained single values for the AF and DP fields from the 1st VCF in the INFO field and added a new list of AF and DP values which contains the values from both, instead of as previously transforming the AF and DP field directly into a list.
Changed
Documentation
Tests
Feature Tests
Verify that both TNscope and VarDict shows up as callers for merged variants in Scout
Verify that INFO field from VarDict and TNscope variants keep AF and DP as single-value fields for merged variants, and that separate AF and DP LIST fields are created with values from each caller. See sheet: https://docs.google.com/spreadsheets/d/1kB2vNaEBmol0tX3HUR3UY1LPQCtWZLutixGwnvbkmhY/edit?gid=1578036768#gid=1578036768
Verify that running the https://github.com/EBIvariation/vcf-validator does not show any new errors that weren't present in the original merged VCF from 16.0.0.
Errors in clinical.filtered.pass vcf from uphippo v16.0.0:
Errors in clinical.filtered.pass vcf from uphippo this PR:
Pipeline Integrity Tests
.hkfile)Clinical Genomics Stockholm
Documentation
Panel of Normal specific criteria
User Changes
Infrastructure Changes
Checklist
Important
Ensure that all checkboxes below are ticked before merging.
For Developers
For Reviewers
conditions where applicable, with satisfactory results.