Skip to content

19 samples failed to reproduce in PDB timesplit set #41

@BlacksheepJane

Description

@BlacksheepJane

Hi, thank you for your great work on SurfDock!

While reproducing the evaluation on the PDB timesplit (363) dataset using eval_samples.sh, I encountered failures in docking for 19 molecular complexes.

The PDB IDs along with the failure reasons are as follows:

PDB ID Error Message
6qlp index 284 is out of bounds for axis 0 with size 284
6d08 index 272 is out of bounds for axis 0 with size 272
6qtr Sizes of tensors must match except in dimension 1
6cf7 Sizes of tensors must match except in dimension 1
6jam index 165 is out of bounds for axis 0 with size 165
6pnn Sizes of tensors must match except in dimension 1
6oim Sizes of tensors must match except in dimension 1
6k2n index 257 is out of bounds for axis 0 with size 257
6qi7 Sizes of tensors must match except in dimension 1
6olx Sizes of tensors must match except in dimension 1
6pya Sizes of tensors must match except in dimension 1
6jag Sizes of tensors must match except in dimension 1
6qr7 Sizes of tensors must match except in dimension 1
6cjj Sizes of tensors must match except in dimension 1
6h14 index 168 is out of bounds for axis 0 with size 168
6i78 index 105 is out of bounds for axis 0 with size 105
6jao index 160 is out of bounds for axis 0 with size 160
6nd3 Sizes of tensors must match except in dimension 1
6kjf Sizes of tensors must match except in dimension 1

From the above, there appear to be two main categories of errors:

  1. index XXX is out of bounds for axis 0 with size XXX
  2. Sizes of tensors must match except in dimension 1.

After preliminary investigation, I found that both issues originate in the get_complex function within SurfDock/score_in_place_dataset/score_dataset.py.

  • The first issue occurs at the line:
    mda_rec_model = mda.Universe(pure_pocket_path)
  • The second issue occurs during the call:
    get_rec_graph(mda_rec_model, rec_coords, c_alpha_coords, n_coords, c_coords, complex_graph, rec_radius=self.receptor_radius,
                c_alpha_max_neighbors=self.c_alpha_max_neighbors, all_atoms=self.all_atoms,
                atom_radius=self.atom_radius, atom_max_neighbors=self.atom_max_neighbors, remove_hs=self.remove_hs, lm_embeddings=lm_embeddings)

May I ask if you have encountered similar issues when testing on this dataset? If so, could you share how you addressed them?

I would really appreciate any help or guidance you can provide. Thanks again for the great work!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions