Skip to content

Mismatching SARS-CoV-2 Sequences #65

@TheHarshShow

Description

@TheHarshShow

Hi there,

I created a PanGraph of 200 SARS-CoV-2 sequences using FASTA sequences as input, and it seems that eleven of them aren't represented incorrectly in the JSON file. I have uploaded the data here. The original FASTA file is denoted by sars_200_orig.fa. The represented sequences (determined by me) are represented by sars_200_pangraph.fa, and the PanGraph JSON file is denoted by sars_200.json. The sequences that we believe aren't matching are England/BRBR-2B7C38D/2021|OV263009.1|2021-11-22, IMS-10178-CVDP-0E892CAB-4101-45AD-A5AB-82C23A77B85B|OX112182.1|2021-10-14, Denmark/DCGC-179132/2021|OW435830.1|2021-10-02, SouthAfrica/NHLS-UCT-GS-AD95/2021|OM739820.1|2021-08-30, IMS-10150-CVDP-7250DCF0-8B47-40DA-89AF-8E56669A8CB5|OU964784.1|2021-10-12, USA/CA-CDC-FG-175698/2021|OL666921.1|2021-11-18, Denmark/DCGC-196557/2021|OW446795.1|2021-10-24, Denmark/DCGC-151767/2021|OV830941.1|2021-08-12, USA/MA-CDCBI-CRSP_4TOCNN2I3HYX32WD/2021|MZ752955.1|2021-08-02, England/LOND-12FD57B/2021|OU391062.1|2021-05-23 and RNA|OX380648.1|2022-10-22.
Can you please look into it?

Best,
Harsh

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions