This repository contains exploratory metagenomic and metatranscriptomic analyses performed on a methane-oxidizing bacterial community enriched in low iron nitrate mineral salts medium under methane gas from a Lake Washington sediment inoculum.
The analyses in this repository were conducted during the early stages of a project that ultimately resulted in the publication:
Robes JMD, Liebergesell TCE, Beals DG, Yu X, Brazelton WJ, Puri AW. 2025. Inverse stable isotope probing–metabolomics (InverSIP) identifies an iron acquisition system in a methane-oxidizing bacterial community. PNAS.
Unlike the code associated with the final publication, this repository primarily contains exploratory workflows used to evaluate metagenomic assemblies, metatranscriptomic mapping strategies, metagenome-assembled genomes (MAGs), biosynthetic gene clusters (BGCs), and antiSMASH-derived secondary metabolite predictions.
-
Metatranscriptome_processing/
- Bash scripts for adapter trimming, quality control, read mapping, and transcriptome processing.
-
Additional scripts and notebooks
- Exploratory analyses investigating MAGs, BGC expression, antiSMASH predictions, and BiG-MAP outputs.
This repository contains exploratory analyses that informed later project development but does not represent the final computational workflow used in the associated publication. Raw sequencing data are not included. Scripts, file paths, and software environments reflect the original analysis environment and may require modification before reuse. For additional information regarding the biological questions, experimental methods, underlying datasets, or final published results, please contact Aaron Puri at the University of Utah or see the associated publication.
Starting with 3 biological replicates and raw DNA and RNA sequencing data:
- Process the DNA to generate metagenomes
- Process the RNA to generate metatranscriptomes
- Run the bash scripts in Metatranscriptome_processing.md to trim adapters, check quality, map reads to the metagenome, and optionally merge biological replicates
Once you have processed all 3 samples, each with a corresponding metagenome (21142X#) and metatranscriptome (21113X#), here are the avenues for analysis:
- Co-assembly: Mapping merged reads to the co-assembly
- Individual MAGs: Mapping reads to individual MAGs of interest
- antiSMASH map: Mapping reads to antiSMASH clusters of interest
- All-by-all: Mapping all reads sets to all replicates + co-assembly and focusing on antiSMASH clusters of interest
- BiG-MAP: BiG-MAP analysis to get RPKM values