Skip to content

Fix/fix preprocessing 20211119#9

Open
cschu wants to merge 36 commits intomasterfrom
fix/fix_preprocessing_20211119
Open

Fix/fix preprocessing 20211119#9
cschu wants to merge 36 commits intomasterfrom
fix/fix_preprocessing_20211119

Conversation

@cschu
Copy link
Copy Markdown
Member

@cschu cschu commented Nov 19, 2021

No description provided.

cschu and others added 30 commits November 10, 2021 12:46
* R1/R2 lengths are now assessed and homogenised separately (i.e. they can have different lengths as supported by Figaro)
* initial read length distribution assessment is now performed by fastqc
* fastq filenames are normalised to R1/R2 naming scheme
* removed pair_id from sample meta information
added gaga2 workflow diagram
added workflow diagram
* paired-end reads are now filtered by whether they're spanning the amplicon length (figaro-requirement)
* single-end reads are not filtered
* fixed issue with homogeneous length trimming
* removed check for preprocessed reads that would prevent preprocessing for 'garbage' data
* read length distributions are now obtained from bbduk histograms (non-binned)
* experimental: allow shorter reads instead of forcing to completely cover amplicon
* version 0.5
* if reads are not covering the full amplicon size, a shortened amplicon size is provided to figaro (experimental)
* bbduk length histograms (instead of fastqc) are now provided to read length assessment
* sample classification was (temporarily?) moved into the fastq-collection Channel (due to issues with nf 21.10+)
* resolved a data flow issue that would allow dada2 processes to start before the preprocessing is finished
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant