You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/usage.md
+8-6Lines changed: 8 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -420,12 +420,14 @@ nf-core/mag integrates an additional subworkflow to validate ancient DNA _de nov
420
420
## A note on coverage estimation
421
421
422
422
In order to run the binning tools included in the pipeline, MAG must first align reads back to the assemblies, and estimate the coverage of each contig.
423
-
During the coverage estimation step, these alignments are by default filtered to retain alignments that have a percentage identity of 97% (i.e., of
424
-
the base pairs that match between the read and the contig, 97% are identical). This value is a good default for short read Illumina data, however for
425
-
certain long read technologies, the error rates in the reads can be much higher. For example, older Oxford Nanopore chemistries can have error rates approaching
426
-
15% - 20%. If you are having trouble with the coverage estimation steps (for example, the output depths for each bin are all at or near zero), it may be worth
427
-
manually adjusting this parameter, if it is appropriate for your data. You can do this by adjusting the `longread_percentidentity` and `shortread_percentidentity`
428
-
parameters for long reads and short reads, respectively - for older ONT data, you may wish to look at values of around 85% to improve coverage estimation.
423
+
424
+
During the coverage estimation step, these alignments are by default filtered to retain alignments that have a percentage identity of 97% (i.e., of the base pairs that match between the read and the contig, 97% are identical). This value is a good default for short read Illumina data, however for certain long read technologies, the error rates in the reads can be much higher.
425
+
For example, older Oxford Nanopore chemistries can have error rates approaching
426
+
15% - 20%.
427
+
428
+
If you are having trouble with the coverage estimation steps (for example, the output depths for each bin are all at or near zero), it may be worth manually adjusting this parameter, if it is appropriate for your data.
429
+
You can do this by adjusting the `--longread_percentidentity` and `--shortread_percentidentity` parameters for long reads and short reads, respectively.
430
+
For older ONT data, you may wish to look at values of around 85% to improve coverage estimation.
0 commit comments