Skip to content

BUG: ENSEMBL ids not mapped correctly to Uniprot on import #574

@henninggaertner

Description

@henninggaertner

What is the issue?

Ensembl IDs like "ENSP12345678901" are not mapped to their uniprot counterparts ("P54321") because the regex used for identifying uniprot IDs also matched the Ensembl ID. this needs to fixed in the file ms_data_import.py.

How to reproduce the issue?

Try to import data where one of the Protein IDs is the aforementioned ensembl ID. It will not map that to the respective uniprot Id.

What is the expected behavior?

The Uniprot Regex should no longer match / the code correctly uses the mapped uniprot ID instead of the old ensembl ID.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions