Skip to content

EESI/MicroTraitLLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MicroTraitLLM

A Retrieval Augmented Generation (RAG) tool for querying prokaryote functional traits, grounded in the PubMedCentral Open Access Subset to reduce hallucination and improve answer reliability.

Background

Single cell sequencing and metagenomics have generated vast amounts of prokaryotic genomic data — far more than researchers can manually process. While LLMs offer powerful capabilities for information retrieval and synthesis in bioinformatics, they are prone to hallucination. MicroTraitLLM addresses both challenges by combining LLM inference with retrieval from curated biomedical literature.

Installation

git clone https://github.com/EESI/MicroTraitLLM.git
cd MicroTraitLLM
pip install -r requirements.txt

Python 3.8+ recommended. Consider using a virtual environment:

python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

Usage

python main.py

or

waitress-serve --host=127.0.0.1 --port=8080 main:app

Note: You must use the 'remote' option if not downloading the PubMedCentral Open Access Subset.

License

MIT

About

A retrieval-augmented generation (RAG) LLM which utilizes zero-shot and single-shot prompting to give specific, citation-based answers for researchers.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors