-
Notifications
You must be signed in to change notification settings - Fork 769
Implementation of SugarSelection #4790
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
Implementation of SugarSelection with the known abbreviations and aglycans obtained from the glycam webserver
|
Hello @talagayev! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found: There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻 Comment last updated at 2025-01-11 17:54:06 UTC |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## develop #4790 +/- ##
========================================
Coverage 92.72% 92.73%
========================================
Files 180 180
Lines 22475 22483 +8
Branches 3190 3190
========================================
+ Hits 20841 20849 +8
Misses 1177 1177
Partials 457 457 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
pep fix 1
Added test files
Added tests for sugar selection
pep fix
|
@hmacdope I will ping you here since you were one of the participants in the discussion that initiated the Issue that this PR is covering. The main problem that I see currently, would be that the GLYCAM Abbreviations for the sugars due to the combinations have quite a lot of different names/abbreviations. Some of them could lead to tricky cases, with the Allose Nomenclature having I checked the RCSB-PDB and found only one case, where a unique ligand was called As for the coverage, while looking at the PDBs in RCSB-PDB it covers |
added changelog
added docs
Adds a minimal sugar selection that matches common GLYCAM06 hexose residue names (Glc, Gal, Man). This mirrors existing residue-based selectors in the codebase and intentionally limits the initial scope to well-documented cases. Closes MDAnalysis#4790.
RMeli
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
IAlibay
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Needs updating against main and a couple of small things, otherwise this is good.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
gzip this file?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
compressed the file to 6kya.pdb.gz
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is small enough to be ok non-gzipped.
| # DSSP testing: from https://github.com/ShintaroMinami/PyDSSP | ||
| DSSP = (_data_ref / "dssp").as_posix() | ||
|
|
||
| GLYCAM = (_data_ref / 'GLYCAM_sugars.pdb').as_posix() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add some comments to describe these files?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added the description :)
package/CHANGELOG
Outdated
| directly passing them. (Issue #3520, PR #5006) | ||
|
|
||
| Enhancements | ||
| * Addition of 'sugar' token for GLYCAM, PDB and CHARMM sugar selection (Issue #4790) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry my bad - it should go in 2.11! (I forget what we have and haven't released)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All good, will adjust it :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
adjusted it t 2.11
|
I also wanted to get your quick opinion on that PR @IAlibay, since when we implemented the Here I have a small worry with this PR, that with the big amount of possible of abbrevations with 3 letters some may overlap with co-crystalized ligands. So for example something like What would be your opinion on that, how likely something like that could happen, espeically again with here the token covering every possible GLYCAM nomenclature, which leads to many ids falling under it 🤔 |
Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>
moved to 2.11.0
Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>
Fixes #4563
Changes made in this Pull Request:
SugarSelectionclass that allows the selection of sugars throughthe access of known PDB, CHARMM and GLYCAM abbreviations
GLYCAMandSUGAR_PDBfiles inMDAnalysisTest.datatest_sugar_glycam_selection()andtest_sugar_pdb_selectionin
test_atomselections.pyCurrently I used the following abbreviations:
https://glycam.org/docs/othertoolsservice/2016/06/09/3d-snfg-list-of-residue-names/index.html
In addition of using the GLYCAM Webserver to obtain the known Sugar abbreviations and also
included the aglycans that I obtained from the GLYCAM Weberserver.
The Pytest Files were retrieved from RCSB-PDB and the GLYCAM-Webserver:
https://glycam.org/
PR Checklist
Developers certificate of origin
📚 Documentation preview 📚: https://mdanalysis--4790.org.readthedocs.build/en/4790/