Skip to content

tooling to auto-generate Defining Location Constraint (bulk Cat-VRS construction) #211

@Hui-zju

Description

@Hui-zju

Feature description

I’m working on batch construction of Cat-VRS CategoricalVariants (potentially at scale for a curated variant set), and I’m still unsure about best practices around building Constraints, especially Defining Location Constraint.

Question / Pain Point

When I decide to use a Defining Location Constraint, I need to construct the Location object(s) precisely (e.g., reference sequence identifiers, coordinate system, normalization behavior, transcript vs genomic context, etc.). In practice this is error-prone for bulk generation.

Is there an existing tool or recommended workflow to automatically generate the Defining Location Constraint content, given higher-level inputs like:

a gene symbol (e.g., PTEN / EGFR),

a transcript accession (optional),

a region description (e.g., “exon 19”, “kinase domain”, “coding region”, “promoter”, “whole gene”, etc.),

or a HGVS-like description / coordinates?

If not currently available:

Feature Request

Would the project consider adding (or recommending) a helper utility / API in the Python reference implementation to generate a valid Defining Location Constraint automatically, for example:

Input: gene symbol + transcript ID + region spec (exon/domain/interval)
Output: a normalized VRS Location (SequenceLocation / ChromosomeLocation) suitable for Defining Location Constraint.

Or Input: HGVS / VCF-like coordinates + assembly
Output: normalized Location object + any required identifiers.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions