This pipeline is a bash script that automates quality control, variant calling, and drug resistance gene annotation for Klebsiella pneumoniae whole-genome sequencing data. It leverages tools such as FastQC, MultiQC, and Snippy to perform SNP calling and identify mutations associated with antibiotic resistance.
- Performs quality checks using FastQC
- Generates combined quality reports using MultiQC
- Calls SNPs using Snippy with a reference genome
- Detects and annotates drug resistance mutations
- Outputs drug-specific resistance summaries per sample
-
A
list.txtfile with one sample name per line (without file extensions). -
Paired-end FASTQ files for each sample named as:
sample_1.fastq.gzsample_2.fastq.gz
-
A GenBank reference file for Klebsiella pneumoniae (update the path in the script).
- Make the script executable:
chmod +x klebsiella_pipeline.sh- Run the script from within your working directory:
./klebsiella_pipeline.shEnsure the script is located in the same directory as your FASTQ files and list.txt.
For each sample, the script generates:
sample.snps/directory containing SNP calling outputsample.Drug-Resistance-Details.txt: Detected resistance mutations- Drug-specific summaries such as:
sample.Aminoglycoside.Resistance.txtsample.Beta-lactam.Resistance.txtsample.Fluoroquinolone.Resistance.txt, etc.
Empty files are automatically removed to declutter the output.
| Drug class | Genes detected |
|---|---|
| Aminoglycosides | aacA4, rmtC |
| Beta-lactams | bla |
| Chloramphenicol | ramR |
| Fluoroquinolones | ramR, oqxA, oqxB, gyrA, parC |
| Fosfomycin | fosA |
| Polymyxins | mgrB |
| Sulfonamides | sul1 |
| Tetracyclines | tet |
| Macrolides | mph |
"Bioinformatics is not something you are taught, it’s a way of life."
📧 Email:
This script is provided under the MIT License.