Home

Welcome to the FastqToGeneCounts wiki!

This section is partially completed, please check back later

This is a snakemake workflow that aims to do several things, using as much parallelization as possible:

Given a CSV file containing: SRR Codes, a target output name, and Paired End or Single End reads
Generate genome files using STAR
Download each SRR code in parallel using prefetch
Unpack the .sra files using parallel-fastq-dump, generating .fastq.gz files
Optionally trim the resulting .fastq.gz files (using Trim Galore)
Perform FastQC on the parallel-fastq-dump files, and optionally on the resulting trimmed files
Perform STAR align on files from parallel-fastq-dump (or trim) files to the generated genome files
Perform MultiQC, using the files from parallel-fastq-dump, FastQC, and STAR algner

Ultimately, the results of this project will be a series of .fastq.gz files available, along with reports from FastQC and MultiQC. The .fastq.gz files may be used in further analysis

Sections

(Getting started)[Getting-Started]
Downloading
Installing
Running

Created by Josh Loecker and Brandt Bessell

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Welcome to the FastqToGeneCounts wiki!

Sections

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally