Link Prediction

Study bias, topological imbalance, and biased predictive models are cyclicly connected, creating a vicious cycle that hinders our ability to make accurate predictions about less studied genes, diseases, and drugs.

It has recently been shown how study bias in protein binding experiments creates topological imbalanced networks (Lucchetta et al. 2023). Contemporary research by (Bonner et al. 2022) showed that knowledge graph embedding (KGE) link prediction (LP) models, when trained on topological imbalance networks become heavily biased towards recommending nodes with high degrees (Bonner et al. 2022). We show these overly prioritized nodes are those that have been extensively studied; when these predictions are used to generate hypotheses and direct experimental studies, creating more study bias. The cycle goes on and on, creating a system of preferential attachment, where well-studied (and subsequently connected) genes get more connections while less-studied nodes receive few to no more connections.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
AncestrySpecificMONDO		AncestrySpecificMONDO
Experiments		Experiments
Scripts		Scripts
README.md		README.md
genome_informatics_poster.png		genome_informatics_poster.png
process_monarch_kg.smk		process_monarch_kg.smk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Link Prediction

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Link Prediction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages