Meta-analytic microbiome target discovery for immune checkpoint inhibitor response in advanced melanoma
This repository includes a set of analyses and visualization for a project entitled "Meta-analytic microbiome target discovery for immune checkpoint inhibitor response in advanced melanoma".
This repository contains a reproducible computational pipeline for conducting a meta-analysis of metagenomic samples in melanoma immunotherapy. Our study integrates microbial species, biosynthetic gene clusters (BGCs), and functional pathways across multiple datasets to identify key microbial signatures associated with immune checkpoint inhibitor (ICI) response.
If you use this pipeline in your research, please cite our study:
Zhang, X., Mallick, H. & Rahnavard, A. Meta-analytic microbiome target discovery for immune checkpoint inhibitor response in advanced melanoma. bioRxiv 2025, 2025.03.21.644637 (2025). https://doi.org/10.1101/2025.03.21.644637.
- Comprehensive Meta-Analysis: The comprehensive meta-analysis of metagenomic samples in melanoma immunotherapy(N=678).
- Identification of Secondary Metabolites: Novel discovery of secondary metabolites linked to immunotherapy response.
- Biomarker Discovery: Faecalibacterium SGB15346 identified as a potential biomarker for ICI response.
- Functional Insights: RiPP biosynthetic gene clusters and Enterobacteriaceae-associated BGCs enriched in responders.
- Reproducible Analysis Pipeline: A standardized and reproducible workflow for future research applications.
We compiled publicly available whole metagenome shotgun sequencing (MGS) datasets from melanoma patients receiving immunotherapy. The following studies were included:
- GopalakrishnanV_2018 (PRJEB228939)
- MatsonV_2018 (PRJNA3997428)
- FrankelAE_2017 (PRJNA39790610)
- SpencerCN_2021 (PRJNA77029513)
- BaruchEN_2021 (PRJNA67873712)
- DavarD_2021 (PRJNA67286711)
- LeeKA_2022 (PRJEB431196)
-
Removal of host DNA sequences using KneadData.
-
Quality filtering and trimming with fastp.
-
Taxonomic profiling via MetaPhlAn 4.
-
Pathway Analysis: Conducted using HUMAnN3 to identify enriched pathways.
-
Biosynthetic Gene Clusters (BGCs): Identified using antiSMASH 7.0 and BGCLens.
-
Gene Families: Annotated using omePath.
-
Bray-Curtis Dissimilarity: Used to assess inter-study variability.
-
Statistical associations analyzed using MaAsLin2 within
MMUPHin.R
. -
Compound Poisson Linear Model (CPLM) applied to identify significant microbiome features.
-
PERMANOVA used to quantify variance explained by batch effects.
-
Volcano Plot: Run
volcano_plot.R
to visualize differentially abundant taxa or pathways. -
Heatmap: Use
heatmap.R
to generate heatmaps for significant associations.
-
Faecalibacterium SGB15346 is significantly enriched in responders across multiple studies.
-
RiPP biosynthetic gene clusters exhibit increased abundance in responders.
-
dTDP-sugar biosynthesis pathways correlate with non-response.
-
Batch effect correction successfully reduced inter-study variability while preserving biological signals.
For questions, please open an issue or contact Ali Rahnavard, Himel Mallick or Xinyang Zhang.
This project is licensed under the MIT License - see the LICENSE file for details.