Skip to content

Commit 8bb0082

Browse files
committed
Document legacy STAR iGenomes guidance
1 parent 806b52f commit 8bb0082

2 files changed

Lines changed: 27 additions & 0 deletions

File tree

docs/usage.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -378,6 +378,28 @@ sample,multiplexed_sample_id,probe_barcode_ids,cmo_ids,ocm_ids,description
378378

379379
> You must provide the barcodes CSV with the `--cellranger_multi_barcodes` parameter.
380380
381+
## Reference genome options
382+
383+
The pipeline can resolve reference files from `conf/igenomes.config` when you provide `--genome`, for example `--genome GRCh38`. These entries may include pre-built aligner indices such as STAR indices, depending on the selected genome.
384+
385+
Some AWS iGenomes STAR indices were generated with older STAR versions and contain legacy metadata. nf-core/scrnaseq includes a compatibility step for these configured iGenomes entries so that legacy STAR indices can run with the STAR version shipped by the pipeline. This support is intended to keep existing iGenomes usage working, not to make legacy indices the preferred reference for new analyses.
386+
387+
> [!WARNING]
388+
> For production runs, we recommend building fresh indices from current reference files instead of relying on legacy AWS iGenomes indices. The nf-core [reference genome documentation](https://nf-co.re/docs/running/reference-genomes) warns that AWS iGenomes annotations are significantly outdated, for example human annotations from Ensembl release 75, and that GRCh38 iGenomes uses the NCBI assembly rather than the masked Ensembl assembly.
389+
390+
To generate and keep a STAR index for future runs, provide current FASTA and GTF files and set `--save_reference`:
391+
392+
```bash
393+
nextflow run nf-core/scrnaseq \
394+
--input samplesheet.csv \
395+
--outdir results \
396+
--aligner star \
397+
--fasta reference.fa.gz \
398+
--gtf annotation.gtf.gz \
399+
--save_reference \
400+
-profile docker
401+
```
402+
381403
## Running the pipeline
382404

383405
The minimum typical command for running the pipeline is as follows:

subworkflows/local/starsolo/main.nf

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,11 @@ workflow STARSOLO {
4444
// STAR 2.7.4a+ requires. Modern indices skip the adapter entirely.
4545
def ch_star_raw = channel.value([ [:], file(star_index, checkIfExists: true) ])
4646
if (star_index_legacy) {
47+
log.warn(
48+
"Using a legacy AWS iGenomes STAR index. nf-core/scrnaseq will update the STAR metadata for " +
49+
"compatibility, but we recommend regenerating the index for production runs. See " +
50+
"https://nf-co.re/scrnaseq/dev/docs/usage#reference-genome-options"
51+
)
4752
STAR_GENOMEPARAMS_UPGRADE(ch_star_raw)
4853
ch_versions = ch_versions.mix(STAR_GENOMEPARAMS_UPGRADE.out.versions_gawk)
4954
ch_star_index = STAR_GENOMEPARAMS_UPGRADE.out.index

0 commit comments

Comments
 (0)