You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+16-6Lines changed: 16 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,9 +28,12 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
28
28
- Add peddy --sites hg38 argument when running with GRCh38 [#799](https://github.com/nf-core/raredisease/pull/799)
29
29
- Saltshaker for downstream processing of mitochondrial SV calls from MitoSAlt [#775](https://github.com/nf-core/raredisease/pull/775)
30
30
- Env variable NXF_SINGULARITY_NEW_PID_NAMESPACE = false to accommodate hisat2 running with latest Nextflow and Singularity [#775](https://github.com/nf-core/raredisease/pull/775)
31
+
- Parameter `exclude_alt` to filter alignments to alt/unplaced contigs after alignment using samtools view, retaining only primary chromosomes (GRCh37: 1-22,X,Y,MT / GRCh38: chr1-chr22,chrX,chrY,chrM). Note that enabling this will restrict variant calling to these chromosomes [#803](https://github.com/nf-core/raredisease/pull/803)]
32
+
- Parameters `save_all_mapped_as_cram` and `save_noalt_mapped_as_cram` to replace `save_mapped_as_cram`, allowing independent control over publishing unfiltered and alt-filtered alignment files as CRAM [#807](https://github.com/nf-core/raredisease/pull/807)
31
33
32
34
### `Changed`
33
35
36
+
- Sort parameters of `CALL_STRUCTURAL_VARIANTS` and `CALL_SV_MANTA` alphabetically [[#](https://github.com/nf-core/raredisease/pull/)]
34
37
- Use distinct output filenames for bcfools (in call_mobile_elements subworkflow) and svdb (in call_sv_tiddit subworkflow) [#716](https://github.com/nf-core/raredisease/pull/716)
35
38
- Use nf-core's most severe consequence & pli scripts instead of local ones [#732](https://github.com/nf-core/raredisease/pull/732)
36
39
- Use nf-core's VCF_FILTER_BCFTOOLS_ENSEMBLVEP subworkflow to generate clinical set instead of a local subworkflow [#727](https://github.com/nf-core/raredisease/pull/727)
@@ -56,6 +59,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
56
59
- Run MitoSAlt.pl from bin rather than within container [#775](https://github.com/nf-core/raredisease/pull/775)
57
60
- Include mitochonrdial SV calls in combined SV vcf, change call_sv output directory structure to remove mitochondria/ and genome/ [#775](https://github.com/nf-core/raredisease/pull/775)
58
61
- Remove Qualimap and Haplogrep3 as they were made redundant by Picard and VerifyBamID2 [#801](https://github.com/nf-core/raredisease/pull/801)
62
+
- Remove env variable NXF_SINGULARITY_NEW_PID_NAMESPACE from the config since this has to be set outside the subworkflow [#804](https://github.com/nf-core/raredisease/pull/804)
63
+
- Run UPD_SITES, UPD_REGIONS, and CHROMOGRAPH for UPD only when analysis type is WGS [#806](https://github.com/nf-core/raredisease/pull/806)
64
+
- Change saltshaker classification output from txt to html [#808](https://github.com/nf-core/raredisease/pull/808)
59
65
60
66
### `Fixed`
61
67
@@ -67,12 +73,16 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
-[Alignment and variant calling](#alignment-and-variant-calling)
62
62
-[MT deletion script](#mt-deletion-script)
63
-
-[eKLIPse](#eklipse)
63
+
-[MitoSAlt](#mitosalt)
64
+
-[saltshaker](#saltshaker)
64
65
-[Annotation](#annotation)
65
66
-[vcfanno](#vcfanno-1)
66
67
-[CADD](#cadd-1)
@@ -100,26 +101,28 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
100
101
101
102
##### Picard's MarkDuplicates
102
103
103
-
[Picard MarkDuplicates](https://broadinstitute.github.io/picard/command-line-overview.html#MarkDuplicates) is used for marking PCR duplicates that can occur during library amplification. This is essential as the presence of such duplicates results in false inflated coverages, which in turn can lead to overly-confident genotyping calls during variant calling. Only reads aligned by Bwa-mem2, bwameme and bwa are processed by this tool. By default, alignment files are published in bam format. If you would like to store cram files instead, set`--save_mapped_as_cram` to true.
104
+
[Picard MarkDuplicates](https://broadinstitute.github.io/picard/command-line-overview.html#MarkDuplicates) is used for marking PCR duplicates that can occur during library amplification. This is essential as the presence of such duplicates results in false inflated coverages, which in turn can lead to overly-confident genotyping calls during variant calling. Only reads aligned by Bwa-mem2, bwameme and bwa are processed by this tool. By default, alignment files are published in bam format. To publish cram files instead, use `--save_all_mapped_as_cram` for the full (unfiltered) alignment, or`--save_noalt_mapped_as_cram` for the alt-filtered alignment (requires `--exclude_alt`).
104
105
105
106
<detailsmarkdown="1">
106
107
<summary>Output files from Alignment</summary>
107
108
108
109
-`{outputdir}/alignment/`
109
-
-`*.bam|*.cram`: Alignment file in bam/cram format.
110
+
-`*_sorted_md.bam|*_sorted_md.cram`: Full (unfiltered) alignment file. Published as bam by default, or as cram when `--save_all_mapped_as_cram` is set.
111
+
-`*_sorted_md_primary_contigs.cram`: Alt-filtered alignment file in cram format. Published when `--save_noalt_mapped_as_cram` is set (requires `--exclude_alt`). Contains only primary chromosomes (GRCh37: 1-22,X,Y,MT / GRCh38: chr1-chr22,chrX,chrY,chrM).
110
112
-`*.bai|*.crai`: Index of the corresponding bam/cram file.
111
113
-`*.txt`: Text file containing the dedup metrics.
112
114
</details>
113
115
114
116
##### Sentieon Dedup
115
117
116
-
[Sentieon Dedup](https://support.sentieon.com/manual/DNAseq_usage/dnaseq/#remove-or-mark-duplicates) is the algorithm used by Sentieon's driver to remove duplicate reads. Only reads aligned by Sentieon's implementation of bwa are processed by this algorithm. By default, alignment files are published in bam format. If you would like to store cram files instead, set`--save_mapped_as_cram` to true.
118
+
[Sentieon Dedup](https://support.sentieon.com/manual/DNAseq_usage/dnaseq/#remove-or-mark-duplicates) is the algorithm used by Sentieon's driver to remove duplicate reads. Only reads aligned by Sentieon's implementation of bwa are processed by this algorithm. By default, alignment files are published in bam format. To publish cram files instead, use `--save_all_mapped_as_cram` for the full (unfiltered) alignment, or`--save_noalt_mapped_as_cram` for the alt-filtered alignment (requires `--exclude_alt`).
117
119
118
120
<detailsmarkdown="1">
119
121
<summary>Output files from Alignment</summary>
120
122
121
123
-`{outputdir}/alignment/`
122
-
-`*.bam|*.cram`: Alignment file in bam/cram format.
124
+
-`*_sorted_md.bam|*_sorted_md.cram`: Full (unfiltered) alignment file. Published as bam by default, or as cram when `--save_all_mapped_as_cram` is set.
125
+
-`*_sorted_md_primary_contigs.cram`: Alt-filtered alignment file in cram format. Published when `--save_noalt_mapped_as_cram` is set (requires `--exclude_alt`). Contains only primary chromosomes (GRCh37: 1-22,X,Y,MT / GRCh38: chr1-chr22,chrX,chrY,chrM).
123
126
-`*.bai|*.crai`: Index of the corresponding bam/cram file.
124
127
-`*.metrics`: Text file containing the dedup metrics.
125
128
</details>
@@ -467,7 +470,7 @@ The pipeline for mitochondrial variant discovery, using Mutect2, uses a high sen
467
470
[Saltshaker](https://github.com/aksenia/saltshaker) allows for downstream clustering and classification of mtDNA strucutral variants. Called variants are combined with structural variants called in the nuclear genome.
468
471
469
472
-`call_sv`
470
-
-`<sample_id>.saltshaker_classify.txt`: report containing case-level classification of mitochondrial deletions.
473
+
-`<sample_id>.saltshaker_classify.html`: report containing case-level classification of mitochondrial deletions.
Copy file name to clipboardExpand all lines: docs/usage.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -210,6 +210,7 @@ The mandatory and optional parameters for each category are tabulated below.
210
210
| | min_trimmed_length<sup>6</sup> |
211
211
| | extract_alignments |
212
212
| | restrict_to_contigs<sup>7</sup> |
213
+
| | exclude_alt<sup>8</sup> |
213
214
214
215
<sup>1</sup>Default value is bwamem2. Other alternatives are bwa, bwameme and sentieon (requires valid Sentieon license ).<br />
215
216
<sup>2</sup>Analysis set reference genome in fasta format, first 25 contigs need to be chromosome 1-22, X, Y and the mitochondria.<br />
@@ -218,6 +219,7 @@ The mandatory and optional parameters for each category are tabulated below.
218
219
<sup>5</sup>Used only by Sentieon.<br />
219
220
<sup>6</sup>Default value is 40. Used only by fastp.<br />
220
221
<sup>7</sup>Used to limit your analysis to specific contigs. Can be used to remove alignments to unplaced contigs to minimize potential errors. This parameter should be used in conjunction with the `extract_alignments` parameter.<br />
222
+
<sup>8</sup>When set to true, alignments to alt/unplaced contigs are removed after alignment using samtools view, retaining only primary chromosomes (GRCh37: 1-22,X,Y,MT / GRCh38: chr1-chr22,chrX,chrY,chrM). Note that this will affect all downstream variant calling, as variants will only be called on these primary chromosomes.<br />
Copy file name to clipboardExpand all lines: nextflow_schema.json
+17-2Lines changed: 17 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -536,6 +536,11 @@
536
536
"description": "Specifies whether to generate a concatenated VCF file containing both nuclear & mitochondrial snv calls",
537
537
"fa_icon": "fas fa-toggle-on"
538
538
},
539
+
"exclude_alt": {
540
+
"type": "boolean",
541
+
"description": "After aligning the reads to a reference, remove alignments to alt contigs using samtools view, retaining only primary chromosomes (GRCh37: 1-22,X,Y,MT / GRCh38: chr1-chr22,chrX,chrY,chrM).",
542
+
"fa_icon": "fas fa-toggle-on"
543
+
},
539
544
"extract_alignments": {
540
545
"type": "boolean",
541
546
"description": "After aligning the reads to a reference, extract alignments from specific regions/contigs and restrict the analysis to those regions/contigs.",
@@ -576,9 +581,14 @@
576
581
"description": "Specifies whether to run rtgtools' vcfeval",
577
582
"fa_icon": "fas fa-toggle-on"
578
583
},
579
-
"save_mapped_as_cram": {
584
+
"save_all_mapped_as_cram": {
585
+
"type": "boolean",
586
+
"description": "Specifies whether to generate and publish all (unfiltered) alignment files as cram instead of bam",
587
+
"fa_icon": "fas fa-toggle-on"
588
+
},
589
+
"save_noalt_mapped_as_cram": {
580
590
"type": "boolean",
581
-
"description": "Specifies whether to generate and publish alignment files as cram instead of bam",
591
+
"description": "Specifies whether to generate and publish alt-filtered alignment files as cram instead of bam. Requires exclude_alt to be set to true.",
582
592
"fa_icon": "fas fa-toggle-on"
583
593
},
584
594
"scatter_count": {
@@ -756,6 +766,11 @@
756
766
"hisat2": {
757
767
"type": "string"
758
768
},
769
+
"hisat2_build_memory": {
770
+
"type": "string",
771
+
"description": "Minimum memory required to build HISAT2 index with splice sites and exons. If available memory is below this threshold, a simpler index is built without splice sites.",
0 commit comments