Wildtype sequencing-based count error correction by MaximilianStammnitz · Pull Request #44 · nf-core/deepmutscan

MaximilianStammnitz · 2026-03-21T17:58:16Z

PR checklist

…templates/environment.yml

MaximilianStammnitz · 2026-03-21T18:35:07Z

@BenjaminWehnert1008 here there'll be quite a few things still to discuss / edit:

How should the WT sequencing sample(s) be specified in the sample sheet .csv?
Should we run this seq error correction function (only takes a few seconds) as a loop over all samples in a single process, or rather as one process per sample?
Add this option to the nf-core input command (--wt_count_error_correction) and in that case there needs to be an extra check if the corresponding sequencing fastqs are actually specified in the sample sheet .csv
Need to generate the corresponding process / main.nf, potentially update config file(s), etc.
Need to adapt this R script so that the input path / output path / threshold parameter are correct
All the downstream tasks (e.g. heatmaps, fitness calculation, etc.) need to use variantCounts_filtered_by_library_err_corrected.csv instead of variantCounts_filtered_by_library.csv and variantCounts_for_heatmaps_err_corrected.csv instead of variantCounts_for_heatmaps.csv

MaximilianStammnitz added 4 commits March 21, 2026 18:54

Create wt_based_seq_error_correction.R

19461e9

Add files via upload

2a1585d

Add files via upload

496b9f8

Delete modules/local/dmsanalysis/wildtype_based_seq_error_correction/…

eb7d738

…templates/environment.yml

MaximilianStammnitz requested a review from BenjaminWehnert1008 March 21, 2026 17:58