Biological Pathway Mapping and Analysis

This project provides a Streamlit web application to analyze biological pathways from a list of genes. It uses a combination of public biological databases and a local Large Language Model (LLM) via Ollama to build, reconcile, and visualize a knowledge graph of genes and their associated pathways.

Features

Easy Gene Input: Paste a list of genes separated by newlines or commas.
1-Click Analysis: Automatically fetches data, builds a knowledge graph, and runs analysis.
LLM-Powered Insights: Generates hypotheses and biological insights from the network structure.
Interactive Visualization: Displays the resulting gene-pathway graph.

How It Works

Data Fetching: Gathers pathway and interaction data from KEGG, Reactome, UniProt, and STRING-DB.
LLM Reconciliation: Uses a local LLM (via Ollama) to clean, merge, and reconcile the data from the different sources into a coherent set of pathways.
Graph Analysis: Builds a network graph and analyzes it to find central nodes and communities.
Insight Generation: Feeds the analysis results back into the LLM to generate plain-language hypotheses and summaries.
Streamlit UI: Provides a simple web interface for users to input genes and see the results.

Getting Started: Local Setup

These instructions will help you get a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

Before you begin, ensure you have the following installed on your system:

Git: To clone the repository.
Python 3.9+: To run the application.
Ollama: To run the local Large Language Model.
- Download and install Ollama for your operating system (macOS, Linux, or Windows).
- After installing, you must pull at least one model. Open your terminal and run:
```
ollama pull gemma3:1b
```
- Default Model Note: gemma3:1b is a relatively small and manageable model, suitable for most modern laptops.
- Optional Powerful Model: For more detailed and nuanced insights, you can also pull a larger model:
```
ollama pull gpt:oss120b-cloud
```
  gpt:oss120b-cloud is a very large model and requires significant computational resources (e.g., a powerful CPU and ample RAM, or a GPU) to run effectively. If you choose to use this model, you will need to update the src/adapters/ollama_adapter.py file to specify gpt:oss120b-cloud as the model name.

Installation and Setup

Follow these steps to set up the project environment.

1. Clone the Repository

Open your terminal or command prompt and clone the repository to your local machine:

git clone https://github.com/Hami0095/bio-pathway-mapper.git
cd bio-pathway-mapper/biological_kg

2. Create and Activate a Python Virtual Environment

It's highly recommended to use a virtual environment to manage project dependencies.

On macOS/Linux:

python3 -m venv .venv
source .venv/bin/activate

On Windows:

python -m venv .venv
.venv\Scripts\activate

3. Install Dependencies

With your virtual environment activated, install the required Python packages:

pip install -r requirements.txt

Running the Application

Once the setup is complete, you can run the Streamlit application.

1. Ensure Ollama is Serving the Model

Make sure the Ollama application is running on your machine. You can check this by looking for the Ollama icon in your system's menu bar or taskbar.

2. Launch the Streamlit App

In your terminal (with the virtual environment still activated), run the following command:

streamlit run streamlit_app.py

This will open the application in a new tab in your default web browser. You can now start using the tool!

Credits

Abdur Rehman - LinkedIn

Future Work

We plan to create an "Automated Setup Agent" that will download and complete the setup of installing the model on users' computers and will launch the Streamlit app on their browsers along with the Ollama server. You can track the progress of this feature in Issue #1 (this is a placeholder link).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
README.md		README.md
READ_ABOUT_ME.md		READ_ABOUT_ME.md
main.py		main.py
requirements.txt		requirements.txt
setup.bat		setup.bat
setup.sh		setup.sh
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Biological Pathway Mapping and Analysis

Features

How It Works

Getting Started: Local Setup

Prerequisites

Installation and Setup

Running the Application

Credits

Future Work

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Biological Pathway Mapping and Analysis

Features

How It Works

Getting Started: Local Setup

Prerequisites

Installation and Setup

Running the Application

Credits

Future Work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages