Skip to content

Commit cd15009

Browse files
committed
Fixup: Add date annotations for rare genotypes
Six of the samples that are force-included in the Nextclade dataset tree have empty collection date fields in the metadata output from NCBI Datasets. This results in the samples being removed downstream by the TreeTime clock filter. This commit adds collection dates (which were manually extracted from the strain names in the NCBI metadata) for these samples so that they will be included in the Nextclade dataset tree.
1 parent dc3cd4b commit cd15009

1 file changed

Lines changed: 10 additions & 0 deletions

File tree

ingest/defaults/annotations.tsv

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -146,3 +146,13 @@ U64582 date 1988-XX-XX
146146
X84865 date 1994-XX-XX
147147
X84872 date 1990-XX-XX
148148
X84879 date 1971-XX-XX
149+
#
150+
# Strains with rare genotypes
151+
# Dates are retrieved from epi-weeks reported within strain names on NCBI
152+
# These are force-included in the nextclade tree to boost representation of rare genotypes
153+
AF410989 date 1987-03-09 # genotype E
154+
AY037009 date 2000-06-12 # genotype G2
155+
AY037043 date 2000-04-17 # genotype H2
156+
AY037026 date 1997-03-24 # genotype H2
157+
AY037028 date 2000-03-13 # genotype D2
158+
FJ668380 date 2003-02-10 # genotype D10

0 commit comments

Comments
 (0)