Bacterial Genome Assembly and Annotation Pipeline

This Nextflow pipeline automates assembly, polishing, annotation, and quality assessment of bacterial genomes using both long and short read data. The workflow integrates raw data quality control, genome assembly, read alignment, polishing, annotation, and evaluation against a reference genome.

Modules Used

The pipeline utilizes the following modules, found in the modules folder:

FASTQC: Quality control for short read data
FILTLONGER: Filtering long reads
FLYE: Long read genome assembly
BOWTIE2_INDEX: Indexing for short read alignment
BOWTIE2_ALIGN: Aligning short reads to assemblies
SAMTOOLS_SORT: Sorting and indexing alignments
PILON: Genome polishing using aligned short reads
PROKKA: Genome annotation
BUSCO: Genome completeness assessment
BUSCO_PLOT: Visualization of BUSCO results
NCBI_DATASETS: Downloading reference genomes
QUAST & QUAST_UNPOLISHED: Assembly quality evaluation (for polished and unpolished assemblies)

Folder Structure

main.nf: The central Nextflow pipeline script.
modules/: Contains all process definitions as separate modules.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
envs		envs
modules		modules
.gitignore		.gitignore
Project1Report.html		Project1Report.html
Project1Report.ipynb		Project1Report.ipynb
Project1ReportLaTex.pdf		Project1ReportLaTex.pdf
README.md		README.md
bac_samples.csv		bac_samples.csv
busco_figure.png		busco_figure.png
circos_plot.png		circos_plot.png
full_pipeline.png		full_pipeline.png
main.nf		main.nf
nextflow.config		nextflow.config
report-20250921-59829220.html		report-20250921-59829220.html
week1.nf		week1.nf
week2.nf		week2.nf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bacterial Genome Assembly and Annotation Pipeline

Modules Used

Folder Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bacterial Genome Assembly and Annotation Pipeline

Modules Used

Folder Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages