Parallel Monte Carlo Tree Search with MPI and Groq

Enhancing SLM responses using distributed Monte Carlo Tree Search

Overview

This project implements a novel approach to improving LLM responses using a parallel Monte Carlo Tree Search (MCTS) algorithm powered by MPI (Message Passing Interface). By distributing the search process across multiple cores, we can efficiently explore and evaluate different response variations to find optimal answers.

Features

Parallel MCTS Implementation: Utilizes MPI for distributed processing
Groq Integration: Leverages llama-3.2-1b-preview for response generation and evaluation
Adaptive Response Improvement: Iteratively enhances answers through tree exploration
Detailed Logging: Comprehensive logging system using Loguru
Score-based Evaluation: Quantitative assessment of response quality

Installation

Clone the repository:

git clone https://github.com/yourusername/MCTS_distributed.git
cd parallel-mcts-llm

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install mpi4py openai numpy loguru

Set up your OpenAI API key:

GROQ_API_KEY=YOUR_GROQ_API_KEY
GROQ_API_BASE_URL=https://api.groq.com/openai/v1

Usage

Run the program with MPI:

mpiexec -n 4 python app.py

The -n 4 parameter specifies the number of processes:

Process 0: Root process (manages the search tree)
Processes 1-3: Worker processes (evaluate responses)

How It Works

Initialization
- Starts with a basic "I don't know" response
- Creates initial tree structure
MCTS Process
- Selection: Choose promising nodes using UCT
- Expansion: Generate new response variations
- Simulation: Evaluate responses in parallel
- Backpropagation: Update scores through the tree

Parallel Processing

Root Process (0)               Worker Processes (1-3)
├── Manages tree              ├── Rate responses
├── Coordinates search        ├── Evaluate variations
└── Collects results          └── Send scores back

Performance

The parallel implementation offers significant advantages:

Evaluates multiple responses simultaneously
Reduces total processing time
Scales with available cores

Configuration

Key parameters that can be tuned:

num_iterations: Number of MCTS iterations
C: Exploration constant in UCT calculation
Number of parallel processes

Example

Question: "What is quantum computing?"

Initial Response: "I don't know"

Final Response: "Quantum computing is a type of computation that harnesses quantum mechanical phenomena like superposition and entanglement to process information. Unlike classical computers that use bits (0 or 1), quantum computers use quantum bits or 'qubits' that can exist in multiple states simultaneously. This allows quantum computers to solve certain problems exponentially faster than classical computers, particularly in areas like cryptography, drug discovery, and complex optimization problems."

Score: 92/100

Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

Future Improvements

Variability and error in JSON responses
Add API support for monte carlo
Integrate with other LLM providers
Add visualization of the search tree
Implement caching for API calls
Add unit tests and benchmarks

References

Monte Carlo Tree Search: Wikipedia
MPI for Python: mpi4py Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
helper.py		helper.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parallel Monte Carlo Tree Search with MPI and Groq

Overview

Features

Installation

Usage

How It Works

Performance

Configuration

Example

Contributing

Future Improvements

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ighoshsubho/MCTS_distributed

Folders and files

Latest commit

History

Repository files navigation

Parallel Monte Carlo Tree Search with MPI and Groq

Overview

Features

Installation

Usage

How It Works

Performance

Configuration

Example

Contributing

Future Improvements

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages