Skip to content

Commit a29bc67

Browse files
committed
precom busco documentation
1 parent 3652256 commit a29bc67

File tree

4 files changed

+38
-2
lines changed

4 files changed

+38
-2
lines changed

.github/workflows/ci.yml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,3 +41,10 @@ jobs:
4141
# Remember that you can parallelise this by using strategy.matrix
4242
run: |
4343
nextflow run ${GITHUB_WORKSPACE} -profile test,docker --outdir ./results
44+
45+
- name: Run pipeline with test data and precomputed BUSCOs
46+
# You can customise CI pipeline run tests as required
47+
# For example: adding multiple test runs with different parameters
48+
# Remember that you can parallelise this by using strategy.matrix
49+
run: |
50+
nextflow run ${GITHUB_WORKSPACE} -profile test_nobusco,docker --outdir ./results

CHANGELOG.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ The pipeline is now considered to be a complete and suitable replacement for the
1414
- Updated the Blastn settings to allow 7 days runtime at most, since that
1515
covers 99.7% of the jobs.
1616
- Allow database inputs to be optionally compressed (`.tar.gz`)
17+
- Allow `BUSCO` run outputs to be optionally pre-computed and provided with `--busco_output`
1718

1819
### Software dependencies
1920

@@ -27,6 +28,14 @@ Note, since the pipeline is using Nextflow DSL2, each process will be run with i
2728
| multiqc | 1.20 and 1.21 | 1.20 and 1.25.1 |
2829
| samtools | 1.18 and 1.19.2 | 1.20 and 1.21 |
2930

31+
### Parameters
32+
33+
| Old parameter | New parameter |
34+
| ------------- | ---------------------- |
35+
| | --busco-output |
36+
37+
> **NB:** Parameter has been **updated** if both old and new parameter information is present. </br> **NB:** Parameter has been **added** if just the new parameter information is present. </br> **NB:** Parameter has been **removed** if new parameter information isn't present.
38+
3039
## [[0.6.0](https://github.com/sanger-tol/blobtoolkit/releases/tag/0.6.0)] – Bellsprout – [2024-09-13]
3140

3241
The pipeline has now been validated for draft (unpublished) assemblies.

conf/test_nobusco.config

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -37,8 +37,10 @@ params {
3737
blastn = "https://tolit.cog.sanger.ac.uk/test-data/Meles_meles/resources/nt_mMelMel3.1.tar.gz"
3838

3939
// Precomputed BUSCO outputs
40-
// Note we deliberately left out archaea_odb10 to test the addition of missing lineages
41-
busco_output = "busco_output"
40+
// busco_output_noArchaea.tar.gz deliberately leaves out archaea_odb10 to test the pipeline's detection and filling of missing lineages
41+
// Switch to *_busco_output.tar.gz for fully precomputed BUSCOs
42+
busco_output = "https://tolit.cog.sanger.ac.uk/test-data/Meles_meles/resources/GCA_922984935.2_busco_output_noArchaea.tar.gz"
43+
//busco_output = "https://tolit.cog.sanger.ac.uk/test-data/Meles_meles/resources/GCA_922984935.2_busco_output.tar.gz"
4244

4345
// Need to be set to avoid overfilling /tmp
4446
use_work_dir_as_temp = true

docs/usage.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,24 @@ An [example samplesheet](assets/test/samplesheet.csv) has been provided with the
5454
The pipeline can also accept a samplesheet generated by the [nf-core/fetchngs](https://nf-co.re/fetchngs) pipeline (tested with version 1.11.0).
5555
The pipeline then needs the `--fetchngs_samplesheet true` option _and_ `--align true`, since the data files would all be unaligned.
5656

57+
### Support for pre-computed `BUSCO` outputs
58+
59+
The pipeline may be optionally run with a set of pre-computed [`BUSCO`](https://busco.ezlab.org) runs, provided using the `--busco_output` parameter. These can be provided as either a directory path, or a `.tar.gz` compressed archive. The contents should be each `run_` output directory (directly from `BUSCO`) named as `run_[odb_dabasase_name]`:
60+
61+
```
62+
GCA_922984935.2_busco_output/
63+
├── run_archaea_odb10
64+
├── run_bacteria_odb10
65+
├── run_carnivora_odb10
66+
├── run_eukaryota_odb10
67+
├── run_eutheria_odb10
68+
├── run_laurasiatheria_odb10
69+
├── run_mammalia_odb10
70+
├── run_metazoa_odb10
71+
├── run_tetrapoda_odb10
72+
└── run_vertebrata_odb10
73+
```
74+
5775
## Database parameters
5876

5977
Configure access to your local databases with the `--busco`, `--blastp`, `--blastx`, `--blastn`, and `--taxdump` parameters.

0 commit comments

Comments
 (0)