Kivvi produces a VCF file for each region for each sample. The Kivvi VCF follows the standard VCF format specifications. The QUAL field contains a placeholder value. We have repurposed the sample column so that each column represents a completely assembled allele. In addition, we report the RU INFO field to represent the repeat units where a variant is found (repeat units are separated by comma). The four values for each repeat unit are:
- Repeat unit ID
- Where to find this repeat unit, represented as
x.y, wherexis the index of the allele andyis the index of the repeat unit on the allele. - Read depth at this position for this repeat unit
- Number of reads supporting the variant
| CHROM | POS | ID | REF | ALT | QUAL | FILTER | INFO | FORMAT | allele1 | allele2 |
|---|---|---|---|---|---|---|---|---|---|---|
| chr6 | 160613775 | . | G | C | . | PASS | RU=31:2.3:31:31,30:2.4:30:30 | GT | 0 | 1 |
This variant is found on two repeat units:
- Repeat unit index #31, which is the third unit on the second allele (
2.3). There are 31 reads covering this position for this repeat unit and 31 of those support the variant. - Repeat unit index #30, which is the fourth unit on the second allele (
2.4). There are 30 reads covering this position for this repeat unit and 30 of those support the variant.
This variant is only found on the second allele, so its GT is 0 for allele1 and 1 for allele2.
| CHROM | POS | ID | REF | ALT | QUAL | FILTER | INFO | FORMAT | allele1 | allele2 | allele3 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| d4z4_ref | 1611 | . | C | G | . | PASS | RU=73:2.3:19:19,44:2.6:21:21 | GT | 0 | 1 | 0 |
D4Z4 reference can be found here.