Description
Hi,
I have used pypath to download the proteome uniprot accession IDs associated to yeast reference strain (tax id = 559292), done as follows:
from pypath.inputs import uniprot
from pypath.utils import mapping
yeast_proteome = list(set(uniprot._all_uniprots("559292", True)))
print(yeast_proteome)
output:
['P89102', 'Q00764', 'P38350', 'P00549', 'Q8TGQ1', 'P06242', 'Q08900',...'Q0614']
Afterwards, I have attempted to convert each uniprot ID to the corresponding gene symbol, done like this:
gene_symbols_uniprot = []
for uniprot_ID in yeast_proteome:
geneID = mapping.map_name(uniprot_ID, 'uniprot', 'genesymbol')
gene_symbols_uniprot.append(geneID)
print(gene_symbols_uniprot)
output:
[set(), set(), set(), set(), set(),...set()]
However, as shown above, the obtained list (geneID) is a list of empty sets, meaning that the conversion failed for all uniprot IDs, which seems weird to me. Would you be able to help me if there is anything that I am doing wrong or if I should be expecting another result and why?
Additionally, I wonder if it is possible to match systematic ORF names (for instance, I am working with Yeast ORFs) to UniProt IDs. If so, how? Your help is much appreciated, thanks! (For instance, ORF systematic names would be YOL030W, YHR052C-B...)