-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
In a DeepIndexFile that is currently sitting on Cori at $CSCRATCH/exabiome/deep-index/input/gtdb/r95/ar122_r95.input.h5, the vocabulary attribute on seq_table/sequence was [A T C G N], when it should be [A C Y W S K D V N T G R W S M H B N].
This attribute should get set during conversion here:
https://github.com/exabiome/deep-taxon/blob/d4ddbf3f33ab0906558389bc543b7f577585ac64/src/exabiome/gtdb/prepare_data.py#L306
According to this line, the correct vocabulary should be returned:
https://github.com/exabiome/deep-taxon/blob/d7f54ddc0b424fc7ee045b1a4fa9a57facfe28ec/src/exabiome/sequence/convert.py#L354-L355
Metadata
Metadata
Assignees
Labels
No labels