Skip to content

Error running cellrangerarc workflow with 10x Multiome data #463

Open
@maggiebr0wn

Description

@maggiebr0wn

Description of the bug

Title: Error running cellrangerarc workflow with 10x Multiome data

Description

I'm encountering persistent errors when trying to run the nf-core/scrnaseq pipeline with 10x Genomics Multiome (RNA + ATAC) data using the cellrangerarc aligner option.

Expected Behavior

According to the nf-core/scrnaseq documentation, the pipeline should support 10x Multiome data analysis with cellrangerarc when providing a CSV with the proper format including the fastq_barcode column.

Actual Behavior

The pipeline consistently fails with the error "Path value cannot be null" in the FASTQC step, despite using the exact format specified in the documentation. The pipeline also shows a warning about the fastq_barcode header being unidentified:

WARN: Found the following unidentified headers in /path/to/sample_sheet.csv:

  • fastq_barcode

Additional Information

I've verified all file paths exist and are readable. The issue appears to be with how the pipeline handles the fastq_barcode column, despite it being documented as required for cellrangerarc workflows.

Command used and terminal output

## Steps to Reproduce
1. Prepared a CSV file with the following columns (as specified in documentation):
   `sample,fastq_1,fastq_2,fastq_barcode,sample_type`

2. Included both ATAC (with fastq_barcode populated with I2 files) and GEX data (with empty fastq_barcode field)

3. Ran the pipeline with:

nextflow run nf-core/scrnaseq -r 4.0.0 
-profile apptainer 
--input /path/to/sample_sheet.csv 
--fasta /path/to/refdata-cellranger-arc-GRCh38-2020-A-2.0.0/fasta/genome.fa 
--gtf /path/to/refdata-cellranger-arc-GRCh38-2020-A-2.0.0/genes/genes.gtf 
--aligner cellrangerarc 
--save_align_intermeds true

4. Even tried with `--schema_ignore_params fastq_barcode` but still encountered the same error

## Environment
- nextflow version: 24.10.5
- nf-core/scrnaseq version: 4.0.0
- Containerization: Apptainer/Singularity
- Operating system: Linux

## Sample Input File (first few lines)
sample,fastq_1,fastq_2,fastq_barcode,sample_type
LB_brain25_9428_1_HPC,/path/to/LB_brain25_9428_1_HPC_MultiOmeATAC_R1_001.fastq.gz,/path/to/LB_brain25_9428_1_HPC_MultiOmeATAC_R2_001.fastq.gz,/path/to/LB_brain25_9428_1_HPC_MultiOmeATAC_I2_001.fastq.gz,atac
LB_brain25_9428_1_HPC,/path/to/LB_brain25_9428_1_HPC_MultiOmeGEX_R1_001.fastq.gz,/path/to/LB_brain25_9428_1_HPC_MultiOmeGEX_R2_001.fastq.gz,,gex

## Error Message

ERROR ~ Error executing process > 'NFCORE_SCRNASEQ:SCRNASEQ:FASTQC_CHECK (23)'
Caused by:
Path value cannot be null
Container:
/path/to/singularity/quay.io-biocontainers-fastqc-0.12.1--hdfd78af_0.img

Relevant files

No response

System information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions