trgt-instability divergence outputs one tab-delimited record per allele with 6
fields:
trid: repeat identifier from the repeat catalogallele_index: allele index reported by TRGTallele_seq: allele consensus sequencepurity: allele purity value carried through from TRGTlengths: comma-separated repeat lengths, one per readdistances: comma-separated edit distances, one per read
The lengths and distances lists must have the same number of elements.
Downstream commands ignore blank lines and lines starting with #.
These per-read fields are the raw ingredients used to construct read divergence rates and, downstream, the allele instability profile for the allele.
model requires divergence records to be grouped by trid. If the same
trid appears in multiple non-contiguous blocks, training fails.
trgt-instability model emits a tab-delimited record per trid with fields:
trid: tandem repeat identifierbin_edges: comma-separated bin edge valuesalpha: comma-separated Dirichlet-multinomial parameterstraining_counts:--separated count vectors retained after outlier trimming, with comma-separated counts within each vector
By default, trgt-instability test emits one tab-delimited record per tested
allele with 3 fields:
tridallele_seqp_value: one-sided empirical p-value for excess instability from the parametric-bootstrap test
When --report-effect-size is enabled, test emits 6 fields instead:
tridallele_seqp_valued_mediand_ci_lowerd_ci_upper
The d_* fields summarize the posterior Wasserstein distance between the
allele-specific latent instability profile and the fitted baseline mean
instability profile.