This repository contains a Snakemake pipeline and a corresponding conda environment file for performing metagenomic assembly, binning, and phylogenomic analysis, with a focus on intracellular parasites and other organisms with reduced genomes.
- Snakemake
- Python 3.8+
- Conda (optional, but recommended)
Clone this repository:
git clone https://github.com/mikeraiko/ParaMAG.git
cd ParaMAG
Prepare your input files and create a minimal config file config.yaml like this:
reads1: data/sample_R1.fastq.gz
reads2: data/sample_R2.fastq.gz
adapter_file: adapters/TruSeq3-PE.faThen run the wrapper script:
python paramag.py --run --config config.yaml
To test the pipeline:
python paramag.py --example
This will automatically run the full workflow on a small test dataset.
For help and additional options:
python paramag.py --help
maxbin_output/: binned genomesbusco_output/: completeness metricssuper_tree_output.tre: consensus treereport/summary_report.md: summary report