Hi @pascalnotin
This related to my previous issue (#61 ), but I have a more specific request.
Could you possibly provide a 1:1 mapping between all of your RefSeq protein IDs (NP_xxxxx.y) and Ensembl protein IDs (ENSPxxxxx.y)?
I've tried doing the mapping myself but I'm afraid there's simply too many possible many:many mappings that make this process ambiguous. The end result is that the sequence I'm getting from Ensembl API is not matching up with the sequence in the preprocessed clinical sub/indel mutation provided by ProteinGym. It's a little hard for me to pinpoint exactly where the issue is stemming from without the ID mappings.
Thanks in advance,
Brian
Hi @pascalnotin
This related to my previous issue (#61 ), but I have a more specific request.
Could you possibly provide a 1:1 mapping between all of your RefSeq protein IDs (NP_xxxxx.y) and Ensembl protein IDs (ENSPxxxxx.y)?
I've tried doing the mapping myself but I'm afraid there's simply too many possible many:many mappings that make this process ambiguous. The end result is that the sequence I'm getting from Ensembl API is not matching up with the sequence in the preprocessed clinical sub/indel mutation provided by ProteinGym. It's a little hard for me to pinpoint exactly where the issue is stemming from without the ID mappings.
Thanks in advance,
Brian