Skip to content

[New Model] PoET-2 for DMS Zero-Shot Benchmarks#88

Closed
timt51 wants to merge 1 commit intoOATML-Markslab:mainfrom
OpenProteinAI:poet-2-dms-unsupervised
Closed

[New Model] PoET-2 for DMS Zero-Shot Benchmarks#88
timt51 wants to merge 1 commit intoOATML-Markslab:mainfrom
OpenProteinAI:poet-2-dms-unsupervised

Conversation

@timt51
Copy link
Copy Markdown
Contributor

@timt51 timt51 commented Aug 25, 2025

This PR adds a new baseline model, PoET-2, for the following benchmarks: DMS substitutions and DMS indels.

  1. To download the model weights and MSAs used by the model to make predictions, run the following:

    cd proteingym/baselines/PoET-2
    make download
    

    This will save the model weights in the directory ~/.cache/ProteinGym/baselines/PoET-2 and MSAs in the directory ~/.cache/ProteinGym/baselines/PoET. Note that the MSAs are the same as those used for PoET(-1).

  2. To make predictions, run

    • scripts/scoring_DMS_zero_shot/scoring_PoET_2_substitutions.sh for the substitutions benchmark or
    • scripts/scoring_DMS_zero_shot/scoring_PoET_2_indels.sh for the indels benchmark

    Predictions will be saved to the directories ${DMS_output_score_folder_subs}PoET-2 and ${DMS_output_score_folder_indels}PoET-2 for the substitutions and indels benchmarks respectively.

    These scripts will download ~12GB of predicted structures from AlphaFoldDB and save them in the directory ${PROTEINGYM_CACHE}/baselines/PoET-2/DMS_AF2_structures_cache. The download may take awhile. If you would like to perform the download step first without performing the model inference step, you can run the scripts with the environment variable SAMPLE_PROMPTS_ONLY=1. When this environment variable is set, GPUs will not be utilized, but an NVIDIA GPU must still be present on the system in order for the imports to run without failure.

    Lastly, note that the scripts will try to utilize all GPUs that are available to them.

  3. Using the provided scripts for computing model performance on the DMS benchmarks, I obtained an average Spearman correlation of 0.500 for the DMS substitutions benchmark and 0.573 for the DMS indels benchmark.

  4. Precomputed PoET-2 predictions can be downloaded at the following links:

@timt51 timt51 force-pushed the poet-2-dms-unsupervised branch from 23d3a2d to 430955f Compare August 26, 2025 15:49
@timt51
Copy link
Copy Markdown
Contributor Author

timt51 commented Sep 2, 2025

Closing in favor of #89, which includes code for both DMS and clinical unsupervised benchmarks.

@timt51 timt51 closed this Sep 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant