AI Product Evaluation Pipeline

This is a code for Streamlit-based application that implements the evaluation pipeline for Ideas Recall telegram bot and fully described in this article https://vvk93.substack.com/p/building-an-ai-powered-telegram-bot This evaluation pipeline was used to test prompts and AI capabilities and was later used for real telegram bot with some minor changes.

Overview

This project implements an automated evaluation pipeline that:

Downloads transcripts from YouTube videos
Generates summaries and flashcards using AI
Evaluates the quality of generated content through multiple stages
Provides detailed metrics and human feedback capabilities

Main Features

1. YouTube Transcript Processing

Automatic transcript download from YouTube videos
Support for various video formats and languages

2. AI-Powered Content Generation

Generates concise summaries from video transcripts
Creates educational flashcards for key concepts
Uses configurable AI models for generation

3. Multi-Stage Evaluation Pipeline

Stage 1: Automated Checks
- JSON format validation
- Length verification
- BERTScore semantic similarity check
Stage 2: AI Judge Assessment
- Accuracy evaluation
- Completeness scoring
- Relevance assessment
- Clarity measurement
Stage 3: Human Feedback
- User utility rating system
- Interactive feedback collection

4. Comprehensive Dashboard

Real-time pipeline execution status
Detailed evaluation metrics
Historical run logs
Configuration management
Raw output inspection

Technical Stack

Frontend: Streamlit
AI Models: OpenAI API
Video Processing: YouTube API
Evaluation Metrics: BERTScore, Custom AI Judge

Setup

Clone the repository
Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables:
- OPENAI_API_KEY: Your OpenAI API key
- GOOGLE_API_KEY: Your Google API key
Run the application:
```
streamlit run app.py
```

Configuration

The application allows customization of:

AI model selection
Evaluation thresholds
Token limits
System prompts
Target scores

Usage

Enter a YouTube URL in the sidebar
Click "Download & Run Pipeline"
View results across multiple tabs:
- Configuration & Prompts
- Generated Output
- Evaluation Results
- Run Log & History

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
__pycache__		__pycache__
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.py		config.py
evaluation_metrics.py		evaluation_metrics.py
llm_interface.py		llm_interface.py
pipeline.log		pipeline.log
requirements.txt		requirements.txt
youtube_handler.py		youtube_handler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Product Evaluation Pipeline

Overview

Main Features

1. YouTube Transcript Processing

2. AI-Powered Content Generation

3. Multi-Stage Evaluation Pipeline

4. Comprehensive Dashboard

Technical Stack

Setup

Configuration

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

VVK93/IdeaRecallPipeline

Folders and files

Latest commit

History

Repository files navigation

AI Product Evaluation Pipeline

Overview

Main Features

1. YouTube Transcript Processing

2. AI-Powered Content Generation

3. Multi-Stage Evaluation Pipeline

4. Comprehensive Dashboard

Technical Stack

Setup

Configuration

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages