All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- "assembly" argument is optional. Assembly is inferred from Cooler if possible, with fallback to optional argument.
- Add: support for NCBI or UCSC genomes by name, based off of: https://github.com/open2c/bioframe/blob/main/bioframe/io/data/_assemblies.yml
- Add Hoang Tran as a package author
- Python 3.10 or higher is required. Update pyproject.toml & documentation to reflect this.
immune-joint-analysis: Demonstrates the use of thejointly-hictoolkit for joint analysis of CD4+ and CD8+ T cell Hi-C and ChIP-seq data available via the ENCODE portal.breast-tissue-joint-analysis: Demonstrates the use of thejointly-hictoolkit for joint analysis of human breast tissue Hi-C data (Choppavarapu et al.), along with breast tissue and MCF-7 cell ChIP-seq, RNA-seq, DNase-seq, and ATAC-seq data from the ENCODE portal.
- Command-line interface (
jointly) with six core subcommands:embed: Joint decomposition of Hi-C matrices using PCA, NMF, or SVDpost-process: UMAP dimensionality reduction and k-means clusteringtrajectory: Trajectory inference using k-means and UMAP projectionsembedding2yaml: Extract metadata from embeddings to generate experiment YAMLtracks2yaml: Convert CSV metadata into YAML for signal track ingestionhdf5db: Build a compressed HDF5 database (JointDb) integrating Hi-C embeddings and epigenetic signal data
- Support for multi-resolution
.mcoolHi-C input files - Outputs include
.parquet,.csv.gz,.pkl.gz,.png, and.h5formats - Compatibility with ENCODE bigwig tracks
- Configurable percentile-based filtering, chromosome restrictions, and mini-batch sizing
- Generates UMAP visualizations with color-coded cluster and sample metadata
- Supports multi-neighbor UMAP and multi-k clustering in a single run
hdf5dboutput for joint Hi-C + epigenomics database format (JointDb)
- Out-of-core, fixed-memory matrix decomposition
- Designed for time courses, tissue atlases, and multi-condition comparisons
- Published on PyPI
- Available as a Docker image on GitHub Container Registry (GHCR)
- Continuous integration with test and linting checks on every PR and push to
main - Automatic GitHub release tagging, PyPI publication, and Docker publishing via GitHub Actions
TBD: Next milestone features will be added here as development progresses.