PERADIGM

Phenotype Embedding Similarity-based Rare Disease Gene Mapping

This repository contains the R code supporting the analysis described in the paper:
PERADIGM: Phenotype Embedding Similarity-based Rare Disease Gene Mapping

Overview

PERADIGM is a framework that integrates phenotype embedding and patient similarity to identify rare disease-associated genes using large-scale biobank data. This repository includes code to replicate the key analyses and figures from the study.

main.R: Main script for the analysis, including:
- Data loading and preprocessing
- Running phenotype-gene association tests
- Generating similarity matrices and embeddings
- Outputting statistical results
function.R: Contains all helper functions for:
- Embedding computation
- Similarity scoring
- Regression-based testing
- Carrier/control selection

📁 Repository Structure

Place your data files using the following directory structure:

data/
├── R_doc/
│   ├── hesin_diag_all_new.RData
│   ├── eid_all.RData
│   ├── cov_adjust.RData
│   └── IC_hesin_500k.csv
├── icd_related/
│   └── ICD10_mapping.csv
├── generate_all_gene_pos/
│   └── gene_info.RData
├── embedding/
│   └── hesin_icd10_descrip_embed.txt
└── hesin_diag.txt   # Optional/redundant diagnosis file

🔧 Getting Started

To reproduce the analysis:

Ensure R and required packages are installed.
Place the data files in the correct subfolders as shown above.
Run main.R to initiate the pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
function.R		function.R
main.R		main.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PERADIGM

Overview

Contents

📁 Repository Structure

🔧 Getting Started

About

Uh oh!

Releases

Packages

Languages

YCSGP/PERADIGM

Folders and files

Latest commit

History

Repository files navigation

PERADIGM

Overview

Contents

📁 Repository Structure

🔧 Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages