Release v2.3.0 by charles-plessy · Pull Request #117 · nf-core/pairgenomealign

charles-plessy · 2026-06-03T07:31:17Z

Modules will be updated to new versions in release 3.0.0, together with strict syntax conversion.

v2.3.0 "Umi budou" - [June 3rd 2026]

`Added`

New --multi_cram option to produce a multi-query CRAM file combining all the alignments (#60).
New --multiqc_thumbs option to produce alignment thumbnails in the MultiQC report (#93).
New --strand option to index only one strand of the genome, which reduces memory usage at the expense of speed, and suppresses -/+ alignments (#97).
New --query and --queryName convenience options to skip samplesheet creation when there is only one query genome to align (#112).
In the GFF export format, the target genome sequence lengths are now exported in ##sequence-region fields (#70).

`Fixed`

Using the nf-core version of the FASTA_BGZIP_INDEX_DICT_SAMTOOLS subworkflow that we just contributed.
Check for input file existence in the parameter schema #73).

`Parameters`

Old parameter	New parameter
	`--multi_cram`
	`--multiqc_thumbs`
	`--query`
	`--queryName`
	`--strand`

`Dependencies`

Dependency	Old version	New version
`SAMTOOLS_BGZIP`	1.21
`SAMTOOLS_DICT`	1.21	1.23.1
`SAMTOOLS_FAIDX`	1.21	1.23.1
`SAMTOOLS_MERGE`		1.23.1
`HTSLIB_BGZIPTABIX`		1.23.1

PR checklist

Closes #97 To speed up alignment, both strands of the target genome are indexed. This doubles memory usage and may produce output files containing `-/+` alignments, which are not supported by some downstream pipelines. To disable this behavior, the `--strand forward` option is given.

Adds a new option `--multiqc_thumb` that defines a pixel size for alignment thumbnails to be displayed in the MultiQC report. Defaults to zero for no plots. Closes #93

@piplus2

The option `-w` is not available on Macintosh. Thanks @piplus2 for catching this issue.

Optional alignment thumbnails in the MultiQC report.

Allow single strand indexing.

Closes #112 This is inspired by nf-co.re/demultiplex, which also allows to bypass --input and provide single files directly.

@piplus2

Thansk @piplus2 for the suggestion.

Add a `--query` option for when there is only one query

…ence.

The merged CRAM file is neither a pangenome nor a multiple sequence alignment, but I find it very useful. Temporarly CRAM files are produced but not exported. Their header indicates only the name of the query genomes in the read group fields. The files are merged in a single CRAM file, where each read group represents one genome. Each target-query alignment is a one-to-one relationship so a base in the target is aligned at most once to each query. Care is taken to ensure that the path to the reference genome is relative to the current directory. The multi-query CRAM file is output in the same directory as its index and the BGZIpped genome, indexed too. Thus the multi-query CRAM file can be loaded and visualised in the IGV. The coverage plot shows how many query genomes align to the target at a given location. Expanded track view allows to visualise all the sequence differences. You can stabilise the order of the genomes, but IGV enforces alphanumeric sorting. You can work around this limitation by prefixing the sample IDs with numbers in the sample sheet. Custom scripts can (and have) be written to slice a pieces of the multi-query CRAM file and turn these pieces into real MSAs…

Will change to CRAM 3.1 in pairgenomealign 3.0.0.

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

…which I submitted recently based on the local version.

New `--multi_cram` option to produce a multi-query CRAM file combining all the alignments

…n GFF format. Closes #70

Co-authored-by: Mateus de Oliveira Lopes <lopes3137@gmail.com>

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

Prepare 2.3.0

charles-plessy · 2026-06-04T00:35:17Z

Hi @muffato , as you have interest in genomics and CRAM files, I was wondering if you would be interested in reviewing this PR, where I use the new FASTA_BGZIP_INDEX_DICT_SAMTOOLS subworkflow and do many more exciting things such as representing pairwise genome alignments of multiple queries to a reference in a single CRAM file!

charles-plessy and others added 30 commits May 25, 2026 11:11

Prepare dev branch for new developments

3f23bf3

Optional alignment thumbnails in the MultiQC report.

60ad72d

Adds a new option `--multiqc_thumb` that defines a pixel size for alignment thumbnails to be displayed in the MultiQC report. Defaults to zero for no plots. Closes #93

Use base64 in a portable way.

f765267

The option `-w` is not available on Macintosh. Thanks @piplus2 for catching this issue.

Add a changelog entry.

5749ecb

Merge pull request #111 from nf-core/multiqc-thumbs-issue-93

715fd61

Optional alignment thumbnails in the MultiQC report.

Merge branch 'dev' into single-strand-indexing-issue-97

99683a5

Add a changelog entry.

a533634

Merge pull request #110 from nf-core/single-strand-indexing-issue-97

afd978c

Allow single strand indexing.

Add a --query option for when there is only one query

caf2432

Closes #112 This is inspired by nf-co.re/demultiplex, which also allows to bypass --input and provide single files directly.

Import samtools/merge module

db8002b

Error with clear message when both --input and --query are given.

5a2ffb7

Thansk @piplus2 for the suggestion.

Add a multi_cram option.

0ab27ac

Merge the fasta_bgzip_index_dict_samtools outputs in a single channel.

8a99fc1

Merge pull request #113 from nf-core/single-query-option-issue-112

63e7b75

Add a `--query` option for when there is only one query

Also output the dictionary file.

48fcd5a

Patch samtools/merge to preserve local paths to the reference.

f37dcf9

Correct default value of params.multi_cram, for use in if statements.

dcfa35f

Properly handle the case when maf-convert does not need a genome sequ…

e20e2ca

…ence.

Document the changes.

4a19cd1

Also update the subworkflow's snapshot.

0fcb2dc

Merge branch 'dev' into multi-cram-issue-60

c93da93

Fix changelog borken by merge

c870d97

prek run --show-diff-on-failure --color=always --all-files

c6cbffe

Use CRAM 3.0 to be consistent with maf-convert.

9f136d4

Will change to CRAM 3.1 in pairgenomealign 3.0.0.

Generate 4 channels at once.

cc1fd26

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

Use the 4 channels generated with multiMap.

220e3c2

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

Use the 4 channels generated with multiMap.

fbec929

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

Use the same bgzipped genome channel everywhere

2bfdc19

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

charles-plessy and others added 12 commits June 2, 2026 09:59

prek run --show-diff-on-failure --color=always --all-files

1ad1902

Simplify one if/else statement in just one if.

0e58188

Co-authored-by: Joon Klaps <joon.klaps@kuleuven.be>

Use nf-core's version of FASTA_BGZIP_INDEX_DICT_SAMTOOLS…

15df6b6

…which I submitted recently based on the local version.

Merge pull request #114 from nf-core/multi-cram-issue-60

8fac8f7

New `--multi_cram` option to produce a multi-query CRAM file combining all the alignments

Tubemap for version 2.3.0

5e80dd3

Export target genome sequence lenghts as ##sequence-region fields i…

354a1c6

…n GFF format. Closes #70

Fix 2.2.2 changelog.

7ddf1d1

Check for file existence in more inputs.

b0104dc

Co-authored-by: Mateus de Oliveira Lopes <lopes3137@gmail.com>

Fix changelog and prepare for 2.3.0 release

59bcec2

Fix typo.

e37b1e4

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

Merge pull request #116 from nf-core/prepare-2.3.0

b66bfb4

Prepare 2.3.0

Release v2.3.0

7b43756

charles-plessy changed the title ~~Dev~~ Release v2.3.0 Jun 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v2.3.0#117

Release v2.3.0#117
charles-plessy wants to merge 42 commits into
masterfrom
dev

charles-plessy commented Jun 3, 2026 •

edited

Loading

Uh oh!

charles-plessy commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

charles-plessy commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

v2.3.0 "Umi budou" - [June 3rd 2026]

Added

Fixed

Parameters

Dependencies

PR checklist

Uh oh!

charles-plessy commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

charles-plessy commented Jun 3, 2026 •

edited

Loading

`Added`

`Fixed`

`Parameters`

`Dependencies`