consensusClustR

Consensus clustering is a resampling-based method for discovering robust sample or feature clusters like stable patient subtypes and their (molecular) signatures. It addresses challenges in traditional clustering such as determining the correct number of clusters and assessing their stability.

Consensus Matrix, CDF, and Delta Area Plots: To identify the optimal number of clusters K, consensus clustering evaluates the stability of sample groupings across repeated subsampling. The consensus matrix shows how frequently each pair of samples/features is clustered together across iterations, with values close to 1 indicating highly stable associations. From this matrix, a Cumulative Distribution Function (CDF) plot is generated, summarizing the distribution of consensus values for each K. The delta area plot calculates the proportional increase in the area under the CDF curve as K increases. Together, these plots help find the “elbow point”, the value of K where adding more clusters yields diminishing returns, indicating that further clusters mostly capture noise rather than meaningful structure.

This repo is an R workflow for performing consensus clustering using ConsensusClusterPlus
- locally, for smaller expression matrices, or
- on an HPC cluster for large (genomic) data.

See consensusWorkflow.md (or, if you prefer, consensusWorkflow.rmd) for instructions on running the analysis.

Right now the workflow is optimized for clustering expression (numeric) data. If you’ve got a mix of categorical and continuous features, you can plug in a custom distance function.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
required_files		required_files
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
consensusClustR.Rproj		consensusClustR.Rproj
consensusWorkflow.md		consensusWorkflow.md
consensusWorkflow.rmd		consensusWorkflow.rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

consensusClustR

About

Uh oh!

Releases

Packages

Languages

License

rskanchi/consensusClustR

Folders and files

Latest commit

History

Repository files navigation

consensusClustR

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages