Skip to content

vocab not being written correctly #27

@ajtritt

Description

@ajtritt

In a DeepIndexFile that is currently sitting on Cori at $CSCRATCH/exabiome/deep-index/input/gtdb/r95/ar122_r95.input.h5, the vocabulary attribute on seq_table/sequence was [A T C G N], when it should be [A C Y W S K D V N T G R W S M H B N].

This attribute should get set during conversion here:
https://github.com/exabiome/deep-taxon/blob/d4ddbf3f33ab0906558389bc543b7f577585ac64/src/exabiome/gtdb/prepare_data.py#L306

According to this line, the correct vocabulary should be returned:
https://github.com/exabiome/deep-taxon/blob/d7f54ddc0b424fc7ee045b1a4fa9a57facfe28ec/src/exabiome/sequence/convert.py#L354-L355

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions