You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+15Lines changed: 15 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,6 +3,21 @@
3
3
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
4
4
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
5
5
6
+
## dev - unreleased
7
+
8
+
### `Added`
9
+
10
+
-[#799](https://github.com/nf-core/mag/pull/799) - Add `--cat_classify_unbinned`, to enable taxonomic classification of unbinned contigs using CAT (by @dialvarezs)
11
+
12
+
### `Changed`
13
+
-[#799](https://github.comf/nf-core/mag/pull/799) - Migrate from local CAT modules to nf-core ones, updating version (by @dialvarezs)
Copy file name to clipboardExpand all lines: docs/usage.md
+10-5Lines changed: 10 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -127,21 +127,26 @@ The lineage directory (e.g., `bacteria_odb12`) should contain files such as `dat
127
127
Then, you must provide `--busco_db <your_db>/` and `--busco_db_lineage <downloaded_lineage>` to the pipeline.
128
128
You can also pass to `--busco_db` a URL pointing to a lineage tarball, or the tarball itself if stored locally.
129
129
130
-
> [!WARNING]
130
+
> [!WARNING]
131
131
> When any kind of parameter is provided to `--busco_db`, BUSCO will run in offline mode.
132
132
> If the lineage specified via `--busco_db_lineage` is not found locally, or if you attempt automatic lineage selection without having a complete lineage dataset pre-downloaded, BUSCO will fail.
133
133
134
134
### CAT
135
135
136
-
> [!WARNING]
137
-
> This database is very large at ~180 GB!
136
+
> [!WARNING]
137
+
> This database (CAT_nr) is very large at ~200 GB!
138
138
> This can take a long time, so we strongly recommend downloading and unzipping prior the pipeline run.
139
139
140
-
This database can be downloaded from the [CAT developers' website](https://tbb.bio.uu.nl/bastiaan/CAT_prepare/), which is based in the Netherlands (and could be slow for other regions of the world).
140
+
This database can be downloaded from the [CAT developers' website](https://tbb.bio.uu.nl/tina/CAT_pack_prepare/), which is based in the Netherlands (and could be slow for other regions of the world).
141
+
142
+
> [!NOTE]
143
+
Only the `CAT_nr` database is supported by the pipeline.
144
+
145
+
Enabling the `--cat_db_generate` option will create a new database using the latest version of the NCBI nr database. This requires a large download (over 200 GB) and several hours of subsequent processing. If you enable the `--save_cat_db` option, the database will be saved in the `Taxonomy/CAT/db` directory and can be reused in future runs by specifying its path with `--cat_db`.
141
146
142
147
### GTDB
143
148
144
-
> [!WARNING]
149
+
> [!WARNING]
145
150
> This database is very large at ~110 GB!
146
151
> This can take a long time, so we strongly recommend downloading and unzipping prior the pipeline run.
Copy file name to clipboardExpand all lines: nextflow_schema.json
+5Lines changed: 5 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -520,6 +520,11 @@
520
520
"type": "boolean",
521
521
"description": "Only return official taxonomic ranks (Kingdom, Phylum, etc.) when running CAT."
522
522
},
523
+
"cat_classify_unbinned": {
524
+
"type": "boolean",
525
+
"description": "Classify unbinned contigs with CAT (contig mode).",
526
+
"help_text": "This will run CAT classification on all unbinned contigs, which can be useful to get an idea of the taxonomic composition of the unbinned contigs."
527
+
},
523
528
"skip_gtdbtk": {
524
529
"type": "boolean",
525
530
"description": "Skip the running of GTDB, as well as the automatic download of the database"
0 commit comments