You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1.[Classify viruses into species and genera using the ICTV standards](#61-classify-viruses-into-species-and-genera-using-the-ictv-standards)
28
25
2.[Assign viral contigs into vOTUs using the MIUViG standards](#62-assign-viral-contigs-into-votus-using-the-miuvig-standards)
29
26
3.[Dereplicate genomes](#63-dereplicate-genomes)
30
27
4.[Calculate pairwise similarities between all-versus-all genomes](#64-calculate-pairwise-similarities-between-all-versus-all-genomes)
31
28
5.[Process large datasets](#65-process-large-datasets)
32
-
7.[Tests](#7-test)
33
-
8.[Cite](#8-cite)
34
-
9.[License](#9-license)
29
+
7.[Limitations](#7-limitations)
30
+
8.[Tests](#8-test)
31
+
9.[Cite](#9-cite)
32
+
10.[License](#10-license)
35
33
36
34
37
35
## 1. Features
@@ -410,21 +408,24 @@ The following commands help reduce RAM usage and hard disk storage, making them
410
408
```
411
409
412
410
> [!NOTE]
413
-
> Vclust is efficient for comparing genome sequences of diverse viruses across a wide range of sequence identities. However, its computational performance may decline with very large datasets of highly similar or nearly identical genomes (e.g., tens of thousands from the same species). After prefiltering, numerous sequence pairs may still require alignment and clustering, leading to increased RAM usage and longer run times.
411
+
> Please see: 7. Limitations
414
412
413
+
## 7. Limitations
415
414
416
-
## 7. Test
415
+
Vclust is efficient for comparing genome sequences of diverse viruses across a wide range of sequence identities. However, RAM usage and running time may increase drastically in case of very large datasets of highly similar or nearly identical genomes (e.g., hundreds of thousands from the same species).
416
+
417
+
## 8. Test
417
418
418
419
To ensure that Vclust works as expected, you can run tests using [pytest](https://docs.pytest.org/).
419
420
420
421
```bash
421
422
pytest test.py
422
423
```
423
424
424
-
## 8. Cite
425
+
## 9. Cite
425
426
426
-
Zielezinski A, Gudyś A, Barylski J, Siminski K, Rozwalak P, Dutilh BE, Deorowicz S. *Ultrafast and accurate sequence alignment and clustering of viral genomes*. bioRxiv [[doi](https://google.pl)][[pubmed](https://google.pl)].
427
+
Zielezinski A, Gudyś A, Barylski J, Siminski K, Rozwalak P, Dutilh BE, Deorowicz S. *Ultrafast and accurate sequence alignment and clustering of viral genomes*. bioRxiv [[doi:10.1101/2024.06.27.601020](https://www.biorxiv.org/content/10.1101/2024.06.27.601020)].
427
428
428
-
## 9. License
429
+
## 10. License
429
430
430
431
[GNU General Public License, version 3](https://www.gnu.org/licenses/gpl-3.0.html)
0 commit comments