Skip to content

Mapping Yeast Uniprot/GeneSymbol/ORF issues #259

Open
@alexandre239

Description

@alexandre239

Hi,

I have used pypath to download the proteome uniprot accession IDs associated to yeast reference strain (tax id = 559292), done as follows:

from pypath.inputs import uniprot
from pypath.utils import mapping

yeast_proteome = list(set(uniprot._all_uniprots("559292", True)))
print(yeast_proteome)

output:
['P89102', 'Q00764', 'P38350', 'P00549', 'Q8TGQ1', 'P06242', 'Q08900',...'Q0614']

Afterwards, I have attempted to convert each uniprot ID to the corresponding gene symbol, done like this:

gene_symbols_uniprot = []

for uniprot_ID in yeast_proteome:

    geneID = mapping.map_name(uniprot_ID, 'uniprot', 'genesymbol')
      gene_symbols_uniprot.append(geneID)

print(gene_symbols_uniprot)

output:
[set(), set(), set(), set(), set(),...set()]

However, as shown above, the obtained list (geneID) is a list of empty sets, meaning that the conversion failed for all uniprot IDs, which seems weird to me. Would you be able to help me if there is anything that I am doing wrong or if I should be expecting another result and why?

Additionally, I wonder if it is possible to match systematic ORF names (for instance, I am working with Yeast ORFs) to UniProt IDs. If so, how? Your help is much appreciated, thanks! (For instance, ORF systematic names would be YOL030W, YHR052C-B...)

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugProblem in the code

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions