Skip to content

Missing value for SVCLAIM in DEL/DUP in VCFv4.4: clarification and addition #849

@khetherin

Description

@khetherin

We are seeking clarification on the validity of a missing value ('.') in SVCLAIM for DEL/DUP.
The reason we ask this: Our use case is that we have legacy data sets in GVF format that we are converting to VCF and we would like to use VCFv4.4 given its improved support for structural variants. With legacy data we may not necessarily have the information to determine the SVCLAIM value. In such a case, the lack of an option for a missing value in SVCLAIM is not very practical. A missing value ('.') for SVCLAIM for DEL and DUP would allow us to represent unknown evidence for SVCLAIM but still make use of the improved support for structural variants in VCFv4.4.

The rule in the VCF specification v4.4 (Section 3, page 16) states "DEL/DUP: SVCLAIM must be specified and can [be] D, J, or DJ. J and DJ claims indicate a breakpoint between the start and end of the DEL/DUP".
However, the specification does not state explicitly if a missing value is accepted for SVCLAIM in DEL and DUP.

Question 1: Is a missing value ('.') valid for SVCLAIM in DEL and DUP? Please could we kindly request that the specification is amended to explicitly state whether or not this is accepted.

Question 2: If it is not currently accepted, please could we add a missing value of ('.') to the accepted values for SVCLAIM for DEL and DUP?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions