Skip to content

Commit 3de7cbf

Browse files
committed
chore: update meta
1 parent 0711c02 commit 3de7cbf

10 files changed

Lines changed: 202 additions & 131 deletions

File tree

modules/local/align_bam/meta.yml

Lines changed: 50 additions & 39 deletions
Original file line numberDiff line numberDiff line change
@@ -1,84 +1,91 @@
1-
name: fgbio_fastqtobam
1+
name: align_bam
22
description: |
3-
Generates an unmapped BAM (or SAM or CRAM) file from fastq files.
3+
Aligns reads from an unmapped BAM: streams reads with samtools fastq, aligns
4+
with bwa mem, and merges alignment information back into the original records
5+
with fgbio ZipperBams, optionally sorting the output with samtools.
46
keywords:
7+
- bwa
58
- fgbio
69
- samtools
710
- zipperbams
8-
- mem
9-
- bwa
1011
- alignment
11-
- map
12-
- fastq
1312
- bam
14-
- sam
15-
-
1613
tools:
1714
- bwa:
1815
description: |
1916
BWA is a software package for mapping DNA sequences against
2017
a large reference genome, such as the human genome.
2118
homepage: http://bio-bwa.sourceforge.net/
22-
documentation: http://www.htslib.org/doc/samtools.html
19+
documentation: http://bio-bwa.sourceforge.net/bwa.shtml
2320
arxiv: arXiv:1303.3997
2421
licence: ["GPL-3.0-or-later"]
2522
- fgbio:
2623
description: A set of tools for working with genomic and high throughput sequencing data, including UMIs
2724
homepage: http://fulcrumgenomics.github.io/fgbio/
2825
documentation: http://fulcrumgenomics.github.io/fgbio/tools/latest/
2926
tool_dev_url: https://github.com/fulcrumgenomics/fgbio
30-
doi: ""
3127
licence: ["MIT"]
3228
- samtools:
3329
description: |
3430
SAMtools is a set of utilities for interacting with and post-processing
35-
short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li.
36-
These files are generated as output by short read aligners like BWA.
31+
short DNA sequence read alignments in the SAM, BAM and CRAM formats.
3732
homepage: http://www.htslib.org/
38-
documentation: hhttp://www.htslib.org/doc/samtools.html
33+
documentation: http://www.htslib.org/doc/samtools.html
3934
doi: 10.1093/bioinformatics/btp352
4035
licence: ["MIT"]
4136
input:
4237
- meta:
4338
type: map
44-
description: |
45-
Groovy Map containing sample information
46-
e.g. [ id:'test', single_end:false ]
39+
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
4740
- unmapped_bam:
4841
type: file
49-
description: the unmapped BAM/SAM/CRAM file.
50-
pattern: "*.{bam,cram,sam}"
42+
description: Unmapped BAM whose reads will be aligned and zipped back together
43+
pattern: "*.bam"
44+
- meta2:
45+
type: map
46+
description: Groovy Map containing reference information
5147
- fasta:
5248
type: file
5349
description: FASTA reference file
5450
pattern: "*.{fasta,fa}"
55-
- index:
51+
- meta3:
52+
type: map
53+
description: Groovy Map containing reference information
54+
- fasta_fai:
5655
type: file
57-
description: BWA genome index files
58-
pattern: "Directory containing BWA index *.{amb,ann,bwt,pac,sa}"
59-
- sort:
60-
type: bool
61-
description: True to sort the output in template-coordinate order, otherwise don't sort
62-
pattern: "{true,false}"
63-
56+
description: FASTA index (.fai) for the reference
57+
pattern: "*.fai"
58+
- meta4:
59+
type: map
60+
description: Groovy Map containing reference information
61+
- dict:
62+
type: file
63+
description: Sequence dictionary (.dict) for the reference
64+
pattern: "*.dict"
65+
- meta5:
66+
type: map
67+
description: Groovy Map containing reference information
68+
- bwa_dir:
69+
type: directory
70+
description: Directory containing the BWA index files (*.amb, *.ann, *.bwt, *.pac, *.sa)
71+
- sort_type:
72+
type: string
73+
description: |
74+
How to sort the aligned output. One of `none` (leave in input order),
75+
`coordinate` (coordinate sort and index), or `template-coordinate`.
76+
pattern: "{none,coordinate,template-coordinate}"
6477
output:
6578
- meta:
6679
type: map
67-
description: |
68-
Groovy Map containing sample information
69-
e.g. [ id:'test', single_end:false ]
70-
- version:
71-
type: file
72-
description: File containing software version
73-
pattern: "*.{version.yml}"
80+
description: Groovy Map containing sample information
7481
- bam:
7582
type: file
76-
description: Mapped bam (unsorted)
77-
pattern: "*.{bam}"
78-
79-
authors:
80-
- "@nh13"
81-
83+
description: Aligned BAM with consensus tags reversed/reverse-complemented, optionally sorted
84+
pattern: "*.mapped.bam"
85+
- bai:
86+
type: file
87+
description: BAM index, emitted when a coordinate sort with indexing is requested
88+
pattern: "*.mapped.bam.bai"
8289
containers:
8390
docker:
8491
linux/amd64:
@@ -90,3 +97,7 @@ containers:
9097
name: oras://community.wave.seqera.io/library/fgbio_bwa_samtools:ee569c458f161e6b
9198
linux/arm64:
9299
name: oras://community.wave.seqera.io/library/fgbio_bwa_samtools:b612d5cd7a652862
100+
authors:
101+
- "@nh13"
102+
maintainers:
103+
- "@nh13"

modules/local/fgbio/callandfilterduplexconsensusreads/meta.yml

Lines changed: 19 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,15 +17,32 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: Input BAM file with UMI grouped reads, from fgbio's GroupReadsByUmi paired mode
2121
pattern: "*.bam"
22+
- fasta:
23+
type: file
24+
description: FASTA reference file, used by the filtering step
25+
pattern: "*.{fasta,fa}"
26+
- fasta_fai:
27+
type: file
28+
description: FASTA index (.fai) for the reference
29+
pattern: "*.fai"
30+
- min_reads:
31+
type: integer
32+
description: The minimum number of reads supporting a consensus base/read
33+
- min_baseq:
34+
type: integer
35+
description: The minimum base quality (used for both calling input and filtering)
36+
- max_base_error_rate:
37+
type: float
38+
description: The maximum raw-read error rate at a consensus base
2239
output:
2340
- meta:
2441
type: map
2542
description: Groovy Map containing sample information
2643
- bam:
2744
type: file
28-
description: Output BAM file
45+
description: Output BAM file containing filtered duplex consensus reads
2946
pattern: "*.bam"
3047
containers:
3148
docker:
@@ -40,7 +57,5 @@ containers:
4057
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4158
authors:
4259
- "@nh13"
43-
- "@znorgaard"
4460
maintainers:
4561
- "@nh13"
46-
- "@znorgaard"

modules/local/fgbio/callandfiltermolecularconsensusreads/meta.yml

Lines changed: 19 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,15 +17,32 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: Input BAM file with UMI grouped reads, from fgbio's GroupReadsByUmi
2121
pattern: "*.bam"
22+
- fasta:
23+
type: file
24+
description: FASTA reference file, used by the filtering step
25+
pattern: "*.{fasta,fa}"
26+
- fasta_fai:
27+
type: file
28+
description: FASTA index (.fai) for the reference
29+
pattern: "*.fai"
30+
- min_reads:
31+
type: integer
32+
description: The minimum number of reads supporting a consensus base/read
33+
- min_baseq:
34+
type: integer
35+
description: The minimum base quality (used for both calling input and filtering)
36+
- max_base_error_rate:
37+
type: float
38+
description: The maximum raw-read error rate at a consensus base
2239
output:
2340
- meta:
2441
type: map
2542
description: Groovy Map containing sample information
2643
- bam:
2744
type: file
28-
description: Output BAM file
45+
description: Output BAM file containing filtered single strand consensus reads
2946
pattern: "*.bam"
3047
containers:
3148
docker:
@@ -40,7 +57,5 @@ containers:
4057
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4158
authors:
4259
- "@nh13"
43-
- "@znorgaard"
4460
maintainers:
4561
- "@nh13"
46-
- "@znorgaard"

modules/local/fgbio/callduplexconsensusreads/meta.yml

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,15 +17,21 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: Input BAM file with UMI grouped reads, from fgbio's GroupReadsByUmi paired mode
2121
pattern: "*.bam"
22+
- min_reads:
23+
type: integer
24+
description: The minimum number of input reads to a consensus read
25+
- min_baseq:
26+
type: integer
27+
description: The minimum input base quality for a base to contribute to a consensus
2228
output:
2329
- meta:
2430
type: map
2531
description: Groovy Map containing sample information
2632
- bam:
2733
type: file
28-
description: Output BAM file
34+
description: Output BAM file containing duplex consensus reads
2935
pattern: "*.bam"
3036
containers:
3137
docker:
@@ -40,7 +46,5 @@ containers:
4046
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4147
authors:
4248
- "@nh13"
43-
- "@znorgaard"
4449
maintainers:
4550
- "@nh13"
46-
- "@znorgaard"

modules/local/fgbio/callmolecularconsensusreads/meta.yml

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,15 +17,21 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: Input BAM file with UMI grouped reads, from fgbio's GroupReadsByUmi
2121
pattern: "*.bam"
22+
- min_reads:
23+
type: integer
24+
description: The minimum number of input reads to a consensus read
25+
- min_baseq:
26+
type: integer
27+
description: The minimum input base quality for a base to contribute to a consensus
2228
output:
2329
- meta:
2430
type: map
2531
description: Groovy Map containing sample information
2632
- bam:
2733
type: file
28-
description: Output BAM file
34+
description: Output BAM file containing single strand consensus reads
2935
pattern: "*.bam"
3036
containers:
3137
docker:
@@ -40,7 +46,5 @@ containers:
4046
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4147
authors:
4248
- "@nh13"
43-
- "@znorgaard"
4449
maintainers:
4550
- "@nh13"
46-
- "@znorgaard"

modules/local/fgbio/collectduplexseqmetrics/meta.yml

Lines changed: 14 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -17,16 +17,26 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: BAM file with MI tags present and in Template Coordinate sort order, typically the output of fgbio GroupReadsByUMI in paired mode
2121
pattern: "*.bam"
2222
output:
2323
- meta:
2424
type: map
2525
description: Groovy Map containing sample information
26-
- bam:
26+
- metrics:
2727
type: file
28-
description: Output BAM file
29-
pattern: "*.bam"
28+
description: |
29+
Duplex sequencing QC metric files, sharing the output prefix:
30+
`*.family_sizes.txt` (frequency of tag families by size),
31+
`*.duplex_family_sizes.txt` (frequency of duplex tag families by observations from each strand),
32+
`*.duplex_yield_metrics.txt` (summary QC metrics computed across 5%..100% subsamples of the data),
33+
`*.umi_counts.txt` (frequency of UMI observations within reads and tag families), and
34+
`*.duplex_umi_counts.txt` (frequency of duplex UMI observations; emitted because `--duplex-umi-counts=true`).
35+
pattern: "*duplex_seq_metrics*.txt"
36+
- pdf:
37+
type: file
38+
description: PDF of plots generated from the duplex sequencing QC metrics files
39+
pattern: "*duplex_qc.pdf"
3040
containers:
3141
docker:
3242
linux/amd64:
@@ -40,7 +50,5 @@ containers:
4050
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4151
authors:
4252
- "@nh13"
43-
- "@znorgaard"
4453
maintainers:
4554
- "@nh13"
46-
- "@znorgaard"

modules/local/fgbio/correctumis/meta.yml

Lines changed: 21 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -17,16 +17,34 @@ input:
1717
description: Groovy Map containing sample information, e.g. `[ id:'sample1' ]`
1818
- bam:
1919
type: file
20-
description: Input BAM file
20+
description: Input BAM file with UMIs in the RX tag
2121
pattern: "*.bam"
22+
- umi_file:
23+
type: file
24+
description: File listing the expected UMI sequences, one per line
25+
pattern: "*.{txt,fasta,fa}"
26+
- max_mismatches:
27+
type: integer
28+
description: The maximum number of mismatches between a read's UMI and an expected UMI
29+
- min_distance:
30+
type: integer
31+
description: The minimum edit distance to the second-best expected UMI to accept a correction
2232
output:
2333
- meta:
2434
type: map
2535
description: Groovy Map containing sample information
2636
- bam:
2737
type: file
28-
description: Output BAM file
29-
pattern: "*.bam"
38+
description: BAM with corrected UMIs (reads whose UMIs were matched/corrected)
39+
pattern: "*.corrected.bam"
40+
- rejects:
41+
type: file
42+
description: BAM containing reads whose UMIs could not be matched to an expected UMI
43+
pattern: "*.rejected.bam"
44+
- metrics:
45+
type: file
46+
description: Metrics describing the frequency with which each expected UMI was observed
47+
pattern: "*.correct-umis-metrics.txt"
3048
containers:
3149
docker:
3250
linux/amd64:
@@ -40,7 +58,5 @@ containers:
4058
name: oras://community.wave.seqera.io/library/fgbio:2.5.21--61a21b7736fed99a
4159
authors:
4260
- "@nh13"
43-
- "@znorgaard"
4461
maintainers:
4562
- "@nh13"
46-
- "@znorgaard"

0 commit comments

Comments
 (0)