HRS

Health Recommendation System

A Flask-based web application that provides AI-powered health recommendations based on patient symptoms. The application supports multiple LLM providers (Google Gemini and OpenAI GPT) and uses RAG (Retrieval-Augmented Generation) to provide more accurate, context-aware medical recommendations by leveraging a curated medical knowledge base. It runs using Gunicorn as the WSGI server and can be deployed as a Docker container.

Features

Clean and responsive web interface
Patient symptoms input form with multiple fields (symptoms, duration, severity, additional info)
RAG (Retrieval-Augmented Generation): Enhanced recommendations using a medical knowledge base
Multiple LLM provider support: Choose between Google Gemini (default) or OpenAI GPT
Flexible provider configuration through environment variables
AI-powered health recommendations with configurable providers
Context-aware responses using vector-based document retrieval
Detailed output screen showing symptoms summary and AI-generated recommendations
Medical disclaimer for user safety
Production-ready with Gunicorn WSGI server
Dockerized for easy deployment

Prerequisites

Python 3.13 or higher
Conda (recommended) or pip for package management
- Conda: Install from https://docs.conda.io/en/latest/miniconda.html
- pip: Comes with Python
An API key from one of the supported providers:
- Gemini API key (recommended, default provider): Get from https://makersuite.google.com/app/apikey
- OpenAI API key (optional): Get from https://platform.openai.com/api-keys
Docker (for containerized deployment)

Running Locally

Using Conda (Recommended)

Create and activate the conda environment:

conda env create -f environment.yml
conda activate hrs

Set up environment variables:

cp .env.example .env
# Edit .env and configure your preferred LLM provider:
# - Set LLM_PROVIDER to either 'gemini' (default) or 'openai'
# - Add the corresponding API key (GEMINI_API_KEY or OPENAI_API_KEY)

Run with Flask development server:

python -m src.app

Run with Gunicorn (Linux only):

gunicorn --bind 0.0.0.0:5000 src.app:app

Open your browser and navigate to http://localhost:5000

Using pip

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

cp .env.example .env
# Edit .env and configure your preferred LLM provider:
# - Set LLM_PROVIDER to either 'gemini' (default) or 'openai'
# - Add the corresponding API key (GEMINI_API_KEY or OPENAI_API_KEY)

Run with Flask development server:

python -m src.app

Run with Gunicorn (Linux only):

gunicorn --bind 0.0.0.0:5000 src.app:app

Open your browser and navigate to http://localhost:5000

Running with Docker

Using Conda-based Docker Image (Recommended)

Build the Docker image:

docker build -f Dockerfile.conda -t hrs-flask-app .

Run the container with your LLM provider configuration:

# Using Gemini (default)
docker run -d -p 8080:5000 -e GEMINI_API_KEY=your_api_key_here --name hrs hrs-flask-app

# Or using OpenAI
docker run -d -p 8080:5000 -e LLM_PROVIDER=openai -e OPENAI_API_KEY=your_api_key_here --name hrs hrs-flask-app

Open your browser and navigate to http://localhost:8080
Stop the container:

docker stop hrs
docker rm hrs

Using pip-based Docker Image

Build the Docker image:

docker build -t hrs-flask-app .

Run the container with your LLM provider configuration:

# Using Gemini (default)
docker run -d -p 8080:5000 -e GEMINI_API_KEY=your_api_key_here --name hrs hrs-flask-app

# Or using OpenAI
docker run -d -p 8080:5000 -e LLM_PROVIDER=openai -e OPENAI_API_KEY=your_api_key_here --name hrs hrs-flask-app

Open your browser and navigate to http://localhost:8080
Stop the container:

docker stop hrs
docker rm hrs

Project Structure

HRS/
├── src/                   # Source directory containing core modules
│   ├── app.py             # Flask web application (routes and web logic)
│   ├── LLM/                # LLM module containing all LLM-related functionality
│   │   ├── __init__.py         # Module initialization and exports
│   │   ├── llm_service.py     # LLM service module (provider factory, integration, prompts)
│   │   ├── llm_provider.py    # Abstract base class for LLM providers
│   │   ├── openai_provider.py # OpenAI provider implementation
│   │   ├── gemini_provider.py # Google Gemini provider implementation
│   │   └── llm_constants.py   # Shared constants and message templates
│   ├── EHR/               # Electronic Health Records module
│   ├── P-CAFE/            # P-CAFE module
│   └── RAG/               # RAG module for Retrieval-Augmented Generation
│       ├── __init__.py        # RAG module initialization and exports
│       ├── rag_service.py     # RAG service for document retrieval and context augmentation
│       └── medical_knowledge.py # Medical knowledge base for RAG
├── templates/
│   ├── index.html         # Patient symptoms input form
│   └── output.html        # AI recommendations display page
├── tests/                 # Unit tests
│   ├── __init__.py        # Tests module initialization
│   ├── test_llm_service.py # Unit tests for LLM service
│   ├── test_rag_service.py # Unit tests for RAG service
│   └── test_medical_knowledge.py # Unit tests for medical knowledge base
├── requirements.txt       # Python dependencies (pip)
├── environment.yml        # Conda environment specification
├── .env.example          # Environment variables template
├── Dockerfile            # Docker configuration (pip-based)
├── Dockerfile.conda      # Docker configuration (conda-based)
├── .dockerignore        # Docker ignore file
├── .gitignore           # Git ignore file
└── README.md            # This file

LLM Provider Configuration

The application supports multiple LLM providers with easy configuration:

Supported Providers

Google Gemini (Default)
- Model: gemini-2.5-flash
- Configuration: Set LLM_PROVIDER=gemini (or omit, as it's the default)
- API Key: GEMINI_API_KEY
OpenAI GPT
- Model: gpt-3.5-turbo
- Configuration: Set LLM_PROVIDER=openai
- API Key: OPENAI_API_KEY

Configuration Steps

Copy the example environment file:
```
cp .env.example .env
```

Edit .env and set your preferred provider:

# For Gemini (default)
LLM_PROVIDER=gemini
GEMINI_API_KEY=your_gemini_api_key_here

# For OpenAI
LLM_PROVIDER=openai
OPENAI_API_KEY=your_openai_api_key_here

The application will automatically use the configured provider

Demo Mode

If no API key is configured, the application runs in demo mode, showing example responses instead of real AI-generated recommendations.

RAG (Retrieval-Augmented Generation) Configuration

The application uses RAG to enhance health recommendations with relevant medical knowledge from a curated knowledge base.

How RAG Works

Medical Knowledge Base: Contains curated medical information about common conditions, symptoms, diagnostic tests, and treatment approaches
Vector Embeddings: Documents are converted to numerical vectors using sentence-transformers
Semantic Search: When a user submits symptoms, the system retrieves the most relevant medical documents
Context Augmentation: Retrieved information is added to the LLM prompt for more accurate recommendations

RAG Configuration

RAG is enabled by default. You can configure it in your .env file:

# Enable/disable RAG functionality (default: true)
RAG_ENABLED=true

# ChromaDB storage path (default: ./chromadb_data)
CHROMADB_PATH=./chromadb_data

# Embedding model (default: all-MiniLM-L6-v2)
EMBEDDING_MODEL=all-MiniLM-L6-v2

RAG Features

Automatic Initialization: Medical knowledge base is automatically indexed on first run
Graceful Fallback: If RAG fails to initialize, the system continues to work normally without RAG
Semantic Retrieval: Uses vector similarity to find the most relevant medical information
Context-Aware Responses: LLM receives both the user query and relevant medical context

Embedding Models

The system supports various sentence-transformer models:

all-MiniLM-L6-v2 (default): Fast and efficient, good for most use cases
all-mpnet-base-v2: More accurate but slower
Any model from the sentence-transformers library

Note: The first time RAG initializes, it will download the embedding model from HuggingFace. This requires internet access. In offline environments, RAG will gracefully disable itself and the system will function normally.

Testing

The project includes comprehensive unit tests using Python's unittest framework.

Running Tests

Run all tests:

python -m unittest discover -s tests -p "test_*.py" -v

Run a specific test file:

python -m unittest tests.test_llm_service -v

Run a specific test class:

python -m unittest tests.test_rag_service.TestRAGService -v

Test Coverage

The test suite includes:

test_llm_service.py: Tests for input sanitization, severity validation, and provider factory
test_rag_service.py: Tests for RAG service initialization, context retrieval, and prompt augmentation
test_medical_knowledge.py: Tests for medical knowledge base structure and content

All tests use mocks where appropriate to avoid external dependencies (ChromaDB, LLM APIs, etc.).

Usage

Open the application in your browser
Enter your symptoms in the input form:
- Describe your symptoms
- Specify how long you've had them
- Rate the severity (1-10)
- Add any additional relevant information (optional)
Click "Get Health Recommendations"
View AI-generated health recommendations on the output page
Click "Enter New Symptoms" to input new symptoms

Important Disclaimer

This application provides general health information and recommendations generated by AI. It is NOT a substitute for professional medical advice, diagnosis, or treatment. Always consult with a qualified healthcare provider for proper medical care.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HRS

Features

Prerequisites

Running Locally

Using Conda (Recommended)

Using pip

Running with Docker

Using Conda-based Docker Image (Recommended)

Using pip-based Docker Image

Project Structure

LLM Provider Configuration

Supported Providers

Configuration Steps

Demo Mode

RAG (Retrieval-Augmented Generation) Configuration

How RAG Works

RAG Configuration

RAG Features

Embedding Models

Testing

Running Tests

Test Coverage

Usage

Important Disclaimer

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 261 Commits
config		config
data		data
docs		docs
figures		figures
models		models
notebooks		notebooks
src		src
templates		templates
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.conda		Dockerfile.conda
README.md		README.md
bbb.txt		bbb.txt
environment.yml		environment.yml

shaham-lab/HRS

Folders and files

Latest commit

History

Repository files navigation

HRS

Features

Prerequisites

Running Locally

Using Conda (Recommended)

Using pip

Running with Docker

Using Conda-based Docker Image (Recommended)

Using pip-based Docker Image

Project Structure

LLM Provider Configuration

Supported Providers

Configuration Steps

Demo Mode

RAG (Retrieval-Augmented Generation) Configuration

How RAG Works

RAG Configuration

RAG Features

Embedding Models

Testing

Running Tests

Test Coverage

Usage

Important Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages