Skip to content

Commit 39eafa6

Browse files
authored
Merge pull request #141 from CDCgov/v1.6.3
v1.6.3
2 parents f01ec0a + 7a8514d commit 39eafa6

File tree

5 files changed

+11
-7
lines changed

5 files changed

+11
-7
lines changed

CHANGELOG.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,10 @@
22

33
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/).
44

5+
## v1.6.3 - [07/25/2025]
6+
### Fixed
7+
- [Pre-MycoSNP workflow] GC content calculation for closest GAMBIT match (`Reference_GC` field in pre-mycosnp-summary.csv) excluded lowercase (soft-masked) bases, i.e. "c" and "g", leading to underestimates for `Reference_GC`. This fix includes lowercase "g" and "c" in the GC content calculation in `bin/pre-mycosnp-stats.sh`, for both the closest GAMBIT match and the sample assembly. However, the sample assembly shouldn't contain any soft-masked bases to begin with, so this shouldn't result in any differences in `Sample_Assembly_GC`.
8+
59
## v1.6.2 - [05/18/2025]
610
### Fixed
711
- Docker permissions issue with GAMBIT container in some Linux environments (issue [#137](https://github.com/CDCgov/mycosnp-nf/issues/137))

bin/pre-mycosnp-stats.sh

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
#!/bin/bash
22

3-
# pre-mycosnp-stats.sh v1.1.0
3+
# pre-mycosnp-stats.sh v1.1.1
44
# Authors:
55
# Jared Johnson, [email protected]
66
# Zack Mudge, [email protected]
77

8-
version="v1.1.0"
8+
version="v1.1.1"
99

1010
#----- HELP & VERSION -----#
1111
# help message
@@ -72,7 +72,7 @@ if [[ -n "$ref" ]]; then # Check if ref is provided (not empty)
7272
ref_length=$(cat "${ref%.gz}" | grep -v ">" | tr -d '\n\t\r ' | wc -c)
7373

7474
# reference %GC, to 2 decimal places
75-
ref_gc_count=$(cat "${ref%.gz}" | grep -v ">" | grep -Eo "G|C" | wc -l)
75+
ref_gc_count=$(cat "${ref%.gz}" | grep -v ">" | grep -Eo "G|C|g|c" | wc -l)
7676
ref_gc_perc=$(echo -e "${ref_gc_count}\t${ref_length}" | awk '{printf "%.2f", 100*$1/$2}')
7777

7878
# estimated average depth of coverage, to 1 decimal place
@@ -85,7 +85,7 @@ else
8585
fi
8686

8787
# sample assembly % GC, to 2 decimal places
88-
sample_gc_count=$(cat "${assembly%.gz}" | grep -v ">" | grep -Eo "G|C" | wc -l)
88+
sample_gc_count=$(cat "${assembly%.gz}" | grep -v ">" | grep -Eo "G|C|g|c" | wc -l)
8989
sample_gc_perc=$(echo -e "${sample_gc_count}\t${sample_length}" | awk '{printf "%.2f", 100*$1/$2}')
9090

9191
#----- OUTPUT -----#

docs/output.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,7 @@ Output files: `combined/pre-mycosnp_summary/pre-mycosnp-summary.csv`
9999
| Sample_Assembly_GC | GC percentage of Shovill assembly | |
100100
| Reference_Genome_Length | Length of the reference genome of the closest GAMBIT entry (if a prediction is made) | |
101101
| Avg_Depth_Coverage | (Total trimmed bases / `Reference_Genome_Length`) | |
102-
| Reference_GC | GC percentage of the reference genome of the closest GAMBIT entry (if a prediction is made) | |
102+
| Reference_GC | GC percentage of the reference genome of the closest GAMBIT entry (if a prediction is made). Includes soft-masked bases i.e. lowercase "g" and "c". | |
103103

104104
### MultiQC
105105
- Outputs follow the same [pattern described for the main MycoSNP workflow](#multiqc-1).

docs/params.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@
77
|\ | |__ __ / ` / \ |__) |__ } {
88
| \| | \__, \__/ | \ |___ \`-._,-`-,
99
`._,._,'
10-
CDCgov/mycosnp-nf v1.6.2
10+
CDCgov/mycosnp-nf v1.6.3
1111
------------------------------------------------------
1212
Typical pipeline command:
1313

nextflow.config

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -230,7 +230,7 @@ manifest {
230230
description = 'MycoSNP is a portable workflow for performing whole genome sequencing analysis of fungal organisms, including Candida auris. This method prepares the reference, performs quality control, and calls variants using a reference. MycoSNP generates several output files that are compatible with downstream analytic tools, such as those for used for phylogenetic tree-building and gene variant annotations.'
231231
mainScript = 'main.nf'
232232
nextflowVersion = '!>=21.10.3'
233-
version = '1.6.2'
233+
version = '1.6.3'
234234
}
235235

236236
// Load modules.config for DSL2 module specific options

0 commit comments

Comments
 (0)