You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+37-8Lines changed: 37 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# P6
2
2
**Peter's Parse and Processing of Prenatal Particulars via Pandas**
3
3
4
-
A simple, extensible CLI for downloading the Human Phenotype Ontology, parsing genotype/phenotype Excel workbooks, and producing [GA4GH Phenopackets](https://phenopacket-schema.readthedocs.io/en/latest/schema.html#version-2-0) as specified [here](https://phenopacket-schema.readthedocs.io/_/downloads/en/stable/pdf/).
4
+
A simple, extensible CLI for downloading the Human Phenotype Ontology, parsing genotype/phenotype Excel workbooks, and producing [GA4GH Phenopackets](https://phenopacket-schema.readthedocs.io/en/latest/schema.html#version-2-0) as specified [here](https://phenopacket-schema.readthedocs.io/_/downloads/en/stable/pdf/). This project enables downloading the latest or specified Human Phenotype Ontology (HPO) JSON release, auto-classifying Excel sheets as genotype or phenotype data, normalizing column names and HPO IDs, and writing one Phenopacket per record. Additional commands provide quick auditing of workbooks for header normalization, sheet classification, and required variant columns. Built for easy integration and reproducibility, P6 supports rapid phenotypic data preparation for research and clinical workflows, and runs locally with simple installation via pip. The end usage of this project is to convert an existing digital record of phenotypic data into phenopackets, such that they may be linked to their corresponding VCFs and used to integrate with a larger federated repository system.
5
5
6
6
## Table of Contents
7
7
@@ -10,10 +10,12 @@ A simple, extensible CLI for downloading the Human Phenotype Ontology, parsing g
10
10
3.[Installation](#installation)
11
11
4.[Quickstart](#quickstart)
12
12
-[Download HPO JSON](#download-hpo-json)
13
-
-[Parse Excel to Phenopackets](#parse-excel-to-phenopackets)
13
+
-[Parse Excel to Phenopackets](#parse-excel-to-phenopackets)
14
+
-[Audit Excel Workbooks](#audit-excel-workbooks)
14
15
5.[CLI Reference](#cli-reference)
15
16
-[`p6 download`](#p6-download)
16
17
-[`p6 parse-excel`](#p6-parse-excel)
18
+
-[`p6 audit-excel`](#p6-audit-excel)
17
19
6.[Development & Testing](#development--testing)
18
20
7.[Contributing](#contributing)
19
21
8.[License](#license)
@@ -94,18 +96,32 @@ Resulting phenopacket files will be under:
0 commit comments