Skip to content

Commit 343a0c8

Browse files
prototaxitesjfy133
andauthored
Apply suggestions from code review
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
1 parent 03e5f05 commit 343a0c8

1 file changed

Lines changed: 8 additions & 6 deletions

File tree

docs/usage.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -420,12 +420,14 @@ nf-core/mag integrates an additional subworkflow to validate ancient DNA _de nov
420420
## A note on coverage estimation
421421

422422
In order to run the binning tools included in the pipeline, MAG must first align reads back to the assemblies, and estimate the coverage of each contig.
423-
During the coverage estimation step, these alignments are by default filtered to retain alignments that have a percentage identity of 97% (i.e., of
424-
the base pairs that match between the read and the contig, 97% are identical). This value is a good default for short read Illumina data, however for
425-
certain long read technologies, the error rates in the reads can be much higher. For example, older Oxford Nanopore chemistries can have error rates approaching
426-
15% - 20%. If you are having trouble with the coverage estimation steps (for example, the output depths for each bin are all at or near zero), it may be worth
427-
manually adjusting this parameter, if it is appropriate for your data. You can do this by adjusting the `longread_percentidentity` and `shortread_percentidentity`
428-
parameters for long reads and short reads, respectively - for older ONT data, you may wish to look at values of around 85% to improve coverage estimation.
423+
424+
During the coverage estimation step, these alignments are by default filtered to retain alignments that have a percentage identity of 97% (i.e., of the base pairs that match between the read and the contig, 97% are identical). This value is a good default for short read Illumina data, however for certain long read technologies, the error rates in the reads can be much higher.
425+
For example, older Oxford Nanopore chemistries can have error rates approaching
426+
15% - 20%.
427+
428+
If you are having trouble with the coverage estimation steps (for example, the output depths for each bin are all at or near zero), it may be worth manually adjusting this parameter, if it is appropriate for your data.
429+
You can do this by adjusting the `--longread_percentidentity` and `--shortread_percentidentity` parameters for long reads and short reads, respectively.
430+
For older ONT data, you may wish to look at values of around 85% to improve coverage estimation.
429431

430432
## A note on bin refinement
431433

0 commit comments

Comments
 (0)