4.2.3.0
Download release: gatk-4.2.3.0.zip
Docker image: https://hub.docker.com/r/broadinstitute/gatk/
Highlights of the 4.2.3.0 release:
-
Notable bug fixes for
Mutect2andFuncotator -
Support in
CombineGVCFsandGenotypeGVCFsfor "reblocked" GVCFs as produced by theReblockGVCFtool. Reblocked GVCFs have a significantly reduced storage footprint. -
More control over the Smith-Waterman parameters in
HaplotypeCallerandMutect2 -
A new Fragment Allele Depth (
FAD) variant annotation similar to theADannotation except that allele support is considered per read pair, not per individual read -
GenomicsDB bug fixes and enhancements
Full list of changes:
-
HaplotypeCaller/Mutect2
- Fixed a bug where
Mutect2failed to filter germline variants with alternate representations (#7103)- This caused variants with alternative representations in gnomAD to not be recognized as being the same as called variants in some cases. This resulted in variants that were called and not filtered, but they should have been filtered by "germline".
- Exposed Smith-Waterman parameters as tool arguments in
HaplotypeCaller,Mutect2, andFilterAlignmentArtifacts. (#6885)- Enables use of alternative parameters for different event representation (e.g. three consecutive SNPs instead of two small indels)
- Can now specify the Smith-Waterman implementation in
FilterAlignmentArtifacts(#7105) - Added a
--debug-assembly-variants-outdiagnostic option to output a side VCF with variants detected by assembly forHaplotypeCallerandMutect2(#7384) Mutect2: the--genotype-germline-sitesargument is no longer marked as experimental (#7533)
- Fixed a bug where
-
GenotypeGVCFs / CombineGVCFs
- Updated
CombineGVCFsandGenotypeGVCFsto handle "reblocked" GVCFs with diploid data that are potentially missing hom-ref genotype PLs (#7223) - Homozygous reference genotypes with no PLs and zero depth are now output as no-calls by
GenotypeGVCFs(#7471) - Bug fixes for
GenotypeGVCFs/GnarlyGenotyperwhen allele-specific annotations have empty values due to lack of informative reads or no depth (#7491) (#7186)
- Updated
-
GenomicsDB
- Added a new
--call-genotypesGenomicsDB argument, enabling output of called genotypes (i.e. not ./.) when tools likeCombineGVCFsandSelectVariantsread from a GenomicsDB workspace (#7223) - Added a
--bypass-feature-readerargument toGenomicsDBImportto allow the C-based htslib VCF reader implementation to be used instead of the Java implementation (#7393)- Using this option will reduce memory usage and potentially speed up the import process
- Updated to GenomicsDB 1.4.2 (#7520)
- This release fixes a commonly-encountered bookkeeping issue with GenomicsDB array fragments. Should fix errors of the type: "Error: Cannot read from buffer; Error: cannot load book-keeping" as reported in #7012
- Full release notes are here: https://github.com/GenomicsDB/GenomicsDB/releases/tag/v1.4.2
- Added a new
-
Funcotator
-
CNV Calling
- CNV WDLs now handle BAM/CRAM index paths explicitly, as for cases where the index is not in the same path as its file (#7518)
- gCNV in the CASE mode now fills in all hidden DenoisingModelConfig and CopyNumberCallingConfig arguments from the input model configuration (#7464)
- Exposed number of samples used for estimating denoised copy ratios in gCNV via a new
--num-samples-copy-ratio-approxargument (#7450)
-
SV Calling
JointGermlineCNVSegmentation: bug fixes and refactoring (#7243)- A number of bugs, particularly with max-clique clustering, have been fixed, as well as a parameter swap bug in
JointGermlineCNVSegmentation - Reworks classes used by
JointGermlineCNVSegmentationfor SV clustering and defragmentation. The design ofSVClusterEnginehas been overhauled to enable the implementation ofCNVDefragmenterandBinnedCNVDefragmentersubclasses. Logic for producing representative records from a collection of clustered SVs has been separated into anSVCollapserclass, which provides enhanced functionality for handling genotypes for SVs more generally.
- A number of bugs, particularly with max-clique clustering, have been fixed, as well as a parameter swap bug in
-
Notable Enhancements
- Added a new Fragment Allele Depth (
FAD) variant annotation (#7511)- This annotation is identical to the
ADannotation except that allele support is considered per read pair, not per individual read
- This annotation is identical to the
- Added a new Fragment Allele Depth (
-
Miscellaneous Changes
-
Documentation
-
Dependencies