Skip to content

Commit 329f429

Browse files
authored
Merge pull request #19 from lh3/readme-male
README: remove PARs for males
2 parents 04dd6f0 + e8d87c4 commit 329f429

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ cd hickit-0.1_x64-linux
77
# Map Dip-C reads and extract contacts (skip if you use your own pipeline)
88
./seqtk mergepe read1.fq.gz read2.fq.gz | ./pre-dip-c - | bwa mem -5SP -p hs37d5.fa - | gzip > aln.sam.gz
99
./k8 hickit.js vcf2tsv phased.vcf > phased_SNP.tsv # extract phased SNPs from VCF
10-
./k8 hickit.js sam2seg -v phased_SNP.tsv aln.sam.gz | ./k8 hickit.js chronly - | gzip > contacts.seg.gz # for male
10+
./k8 hickit.js sam2seg -v phased_SNP.tsv aln.sam.gz | ./k8 hickit.js chronly - | ./k8 hickit.js bedflt par.bed - | gzip > contacts.seg.gz # for male
1111
#./k8 hickit.js sam2seg -v phased_SNP.tsv aln.sam.gz | ./k8 hickit.js chronly -y - | gzip > contacts.seg.gz # for female
1212
./hickit -i contacts.seg.gz -o - | bgzip > contacts.pairs.gz # optional
1313

@@ -174,7 +174,7 @@ hickit -i contacts.seg.gz -o - | bgzip > contacts.pairs.gz
174174
When you have phased SNPs in VCF, you can generate contact pairs with the phase columns
175175
```sh
176176
hickit.js vcf2tsv NA12878_phased.vcf.gz > phased_SNP.tsv
177-
hickit.js sam2seg -v phased_SNP.tsv aln.sam.gz | hickit.js chronly - | gzip > contacts.seg.gz
177+
hickit.js sam2seg -v phased_SNP.tsv aln.sam.gz | hickit.js chronly - | hickit.js bedflt par.bed - | gzip > contacts.seg.gz
178178
hickit -i contacts.seg.gz -o - | bgzip > contacts.pairs.gz
179179
```
180180
where `hickit.js chronly` filters out non-chromosomal contigs and
@@ -184,7 +184,7 @@ chr1 1010717 C T
184184
chr1 1011531 T C
185185
chr1 1013136 C G
186186
```
187-
Note that the above is for **male** samples. For **female** samples, the part `hickit.js chronly -` should be replaced by `hickit.js chronly -y -` to remove the Y chromosome.
187+
Note that the above is for **male** samples. Here the pseudoautosomal regions (PARs, coordinates supplied in `par.bed`) are excluded from analysis. For **female** samples, the part `hickit.js chronly - | hickit.js bedflt par.bed -` should be replaced by `hickit.js chronly -y -` to remove the Y chromosome instead.
188188

189189
### <a name="impute"></a>Imputing missing phases (diploid single-cell Hi-C only)
190190

0 commit comments

Comments
 (0)