-
Notifications
You must be signed in to change notification settings - Fork 6
Description
Hi there,
I created a PanGraph of 200 SARS-CoV-2 sequences using FASTA sequences as input, and it seems that eleven of them aren't represented incorrectly in the JSON file. I have uploaded the data here. The original FASTA file is denoted by sars_200_orig.fa. The represented sequences (determined by me) are represented by sars_200_pangraph.fa, and the PanGraph JSON file is denoted by sars_200.json. The sequences that we believe aren't matching are England/BRBR-2B7C38D/2021|OV263009.1|2021-11-22, IMS-10178-CVDP-0E892CAB-4101-45AD-A5AB-82C23A77B85B|OX112182.1|2021-10-14, Denmark/DCGC-179132/2021|OW435830.1|2021-10-02, SouthAfrica/NHLS-UCT-GS-AD95/2021|OM739820.1|2021-08-30, IMS-10150-CVDP-7250DCF0-8B47-40DA-89AF-8E56669A8CB5|OU964784.1|2021-10-12, USA/CA-CDC-FG-175698/2021|OL666921.1|2021-11-18, Denmark/DCGC-196557/2021|OW446795.1|2021-10-24, Denmark/DCGC-151767/2021|OV830941.1|2021-08-12, USA/MA-CDCBI-CRSP_4TOCNN2I3HYX32WD/2021|MZ752955.1|2021-08-02, England/LOND-12FD57B/2021|OU391062.1|2021-05-23 and RNA|OX380648.1|2022-10-22.
Can you please look into it?
Best,
Harsh