Current Behavior
There appears to be missing lineages from the https://github.com/corneliusroemer/pango-sequences/blob/main/data/pango-consensus-sequences_genome-nuc.fasta.zst file that are present in the JSON.
A total of 1222 appear to be missing.
Expected behavior
Is there supposed to be one representative for each lineage?
How to reproduce
Steps to reproduce the current behavior:
Compare the JSON summary file to the genome.zst
Possible solution
Are these supposed to be missing? If so we will accept, but it would be nice if they could be added.
Your environment: if browsing Nextstrain online
Downloading and using data file from Github
Let me know if you would like a complete list.