🚀 Groq - Evals

Compare and evaluate different Groq models with ease

🌟 Overview

Groq Model Evaluator is a powerful web application that allows you to compare and evaluate different Groq language models side by side. Built with Next.js and FastAPI, it provides an intuitive interface for testing model performance, analyzing responses, and making data-driven decisions about which model best suits your needs.

✨ Key Features

🔄 Side-by-side model comparison
🤖 Automated reasoning about model performance
🎨 Beautiful, responsive UI
🔑 Secure API key management Yet to come:
📈 Visual metric representations
📊 Comprehensive evaluation metrics
🎯 Semantic similarity analysis

🚀 Getting Started

Prerequisites

Python 3.8+
Node.js 18+
Groq API key (Get one here)

Installation

Clone the repository:

git clone https://github.com/yourusername/groq-evals.git
cd groq-evals

Install backend dependencies:

cd backend
python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate
pip install -r requirements.txt

Install frontend dependencies:

cd frontend
npm install

Configuration

Create a .env file in the backend directory:

AVAILABLE_MODELS=["gemma2-9b-it", "llama-3.1-8b-instant", "mixtral-8x7b-32768"]
EVALUATION_MODELS=["deepseek-r1-distill-llama-70b"]

Running the Application

Start the backend server:

cd backend
uvicorn app.main:app --reload

Start the frontend development server:

cd frontend
npm run dev

Open http://localhost:3000 in your browser

📊 Features

Model Comparison

Compare responses from different Groq models side by side
Automatic evaluation of response quality
Detailed reasoning for model selection

🤝 Contributing

We welcome contributions! Here's how you can help:

Fork the repository
Create a feature branch:

git checkout -b feature/amazing-feature

Commit your changes:

git commit -m 'Add amazing feature'

Push to your branch:

git push origin feature/amazing-feature

Open a Pull Request

Development Guidelines

Follow the existing code style
Add comments for complex logic
Update documentation as needed
Add tests for new features
Ensure all tests pass before submitting PR

📝 License

This project is licensed under the MIT License

🙏 Acknowledgments

Groq for their amazing API
The open-source community for inspiration and tools
All contributors who help improve this project

📬 Contact

Have questions? Need help? Feel free to:

Open an issue
Start a discussion
Reach out to maintainers

Made with ❤️ by Adi

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
backend		backend
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Groq - Evals

🌟 Overview

✨ Key Features

🚀 Getting Started

Prerequisites

Installation

Configuration

Running the Application

📊 Features

Model Comparison

🤝 Contributing

Development Guidelines

📝 License

🙏 Acknowledgments

📬 Contact

About

Uh oh!

Releases

Packages

Languages

adisinghstudent/Groq-Evals

Folders and files

Latest commit

History

Repository files navigation

🚀 Groq - Evals

🌟 Overview

✨ Key Features

🚀 Getting Started

Prerequisites

Installation

Configuration

Running the Application

📊 Features

Model Comparison

🤝 Contributing

Development Guidelines

📝 License

🙏 Acknowledgments

📬 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages