@@ -62,17 +62,6 @@ The target audience is computational biologists who work with single-cell ATAC-s
6262bulk ATAC-seq data and need to systematically prioritize peaks for experimental
6363follow-up, particularly for enhancer discovery or adeno-associated virus (AAV) tool design.
6464
65- Existing tools address related but distinct problems: peak callers such as
66- MACS2 [ @Zhang2008 ] identify open chromatin regions but rank peaks only by
67- fold change or p-value, which reflects signal strength rather than cell-type
68- specificity. A peak with high MACS2 fold change may be active across many cell
69- types (a housekeeping element) and therefore a poor candidate for cell-type
70- targeted AAV tools. Differential accessibility tools such as ArchR [ @Corces2018 ]
71- test for cell type enrichment but operate within their own data model. Annotation
72- tools such as GREAT [ @McLean2010 ] link peaks to genes. None provide a unified,
73- flexible framework for assembling a standardized feature matrix across
74- heterogeneous input tracks — which is precisely what PyPeakRankR addresses.
75-
7665# State of the field
7766
7867Several tools perform individual aspects of peak level feature computation.
@@ -84,6 +73,17 @@ but does not produce portable, tool agnostic feature tables. `pyfaidx`
8473[ @Shirley2015 ] enables FASTA sequence access but provides no genomics feature
8574pipeline.
8675
76+ Existing tools address related but distinct problems: peak callers such as
77+ MACS2 [ @Zhang2008 ] identify open chromatin regions but rank peaks only by
78+ fold change or p-value, which reflects signal strength rather than cell-type
79+ specificity. A peak with high MACS2 fold change may be active across many cell
80+ types (a housekeeping element) and therefore a poor candidate for cell-type
81+ targeted AAV tools. Differential accessibility tools such as ArchR [ @Corces2018 ]
82+ test for cell type enrichment but operate within their own data model. Annotation
83+ tools such as GREAT [ @McLean2010 ] link peaks to genes. None provide a unified,
84+ flexible framework for assembling a standardized feature matrix across
85+ heterogeneous input tracks — which is precisely what PyPeakRankR addresses.
86+
8787PyPeakRankR fills this gap by combining ` pyBigWig ` , ` pyfaidx ` , and
8888` scipy ` [ @Virtanen2020 ] into a CLI pipeline that assembles heterogeneous
8989features into a single reproducible TSV table.
0 commit comments