Add KLIFS kinase-ligand database processor #1488
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This adds a new source module for KLIFS (Kinase-Ligand Interaction Fingerprints and Structures), a database of curated structural data on kinase-inhibitor binding.
Changes:
New indra.sources.klifs module with API client and processor
Generates Inhibition statements from kinase-ligand co-crystallization data
Kinases grounded to HGNC/UniProt
Ligands grounded to ChEMBL/ChEBI where available
Unit tests for processor logic and API integration included
Added source to source_info.json, default_belief_probs.json, statement_presentation.py, setup.py, and README.md
Belief priors: Set to match similar drug-target databases (syst: 0.01, rand: 0.1). Open to feedback on these values.
Note: The processor assumes kinase-ligand interactions are inhibitory, which is typical for KLIFS data but not universally true. This is reflected in the rand prior.