AcmeCloud RAG Support Assistant

Internal R&D Prototype | Engineering Team

Overview

This prototype explores whether a Retrieval-Augmented Generation (RAG) system can help reduce Support team workload by providing accurate, context-aware answers to common customer inquiries.

The system retrieves relevant information from our internal knowledge base (FAQs and escalation notes) and uses it to generate responses via an LLM.

Project Status

🔬 Experimental - This is an exploratory prototype for internal evaluation.

Quick Start

⚠️ Note: This project contains TODO stubs that must be implemented before it will run. See Implementation Tasks below.

Prerequisites

Python 3.11+
OpenAI API key

Setup

Open this repository in GitHub Codespaces (recommended) or clone locally
Install dependencies:
```
pip install -r requirements.txt
```

Set your OpenAI API key (choose one method):

# Option A: Environment variable
export OPENAI_API_KEY="your-api-key-here"

# Option B: Create a .env or .env.local file
echo 'OPENAI_API_KEY=your-api-key-here' > .env.local

Run the evaluation:
```
python run_eval.py
```

Project Structure

├── data/
│   ├── faq.json              # Public FAQ entries
│   ├── escalation_notes.json # Internal support escalation notes
│   └── eval_questions.json   # Evaluation question set with reference answers
├── src/
│   ├── config.py             # Configuration settings
│   ├── data_loader.py        # Document loading utilities
│   ├── llm_client.py         # OpenAI API wrapper
│   ├── embeddings.py         # Text embedding functions [EDITABLE]
│   ├── retrieval.py          # Document retrieval logic [EDITABLE]
│   └── rag_pipeline.py       # Main RAG pipeline [EDITABLE]
├── run_eval.py               # Evaluation runner script
└── requirements.txt

Implementation Tasks

The following modules contain TODO markers indicating where implementation is needed:

1. Embeddings (`src/embeddings.py`)

embed_texts(): Convert text strings to vector embeddings
embed_documents(): Embed all documents in the knowledge base

2. Retrieval (`src/retrieval.py`)

cosine_similarity(): Compute similarity between query and documents
retrieve(): Find and return the most relevant documents

3. RAG Pipeline (`src/rag_pipeline.py`)

build_context(): Format retrieved documents into context
build_prompt(): Create the LLM prompt with context and question
answer_question(): Orchestrate the full RAG pipeline

Configuration

Edit src/config.py to adjust:

Setting	Default	Description
`embedding_model`	`text-embedding-3-small`	OpenAI embedding model
`llm_model`	`gpt-4o-mini`	LLM for response generation
`top_k`	`3`	Number of documents to retrieve
`retrieval_strategy`	`cosine`	Similarity metric
`max_context_length`	`4000`	Maximum context characters

Evaluation

The evaluation script (run_eval.py) tests the RAG pipeline against 10 predefined questions covering:

Easy FAQ lookups
Medium-complexity support scenarios
Hard edge cases requiring internal knowledge

Reference answers are provided for qualitative comparison.

Data Sources

Source	Count	Description
FAQ	10	Public customer-facing answers
Escalation Notes	10	Internal support procedures

Next Steps

After implementing the core components:

Run python run_eval.py to test the pipeline
Review generated answers against reference answers
Experiment with configuration changes
Document findings for team review

AcmeCloud Engineering | Internal Use Only

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.devcontainer		.devcontainer
data		data
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run_eval.py		run_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AcmeCloud RAG Support Assistant

Overview

Project Status

Quick Start

Prerequisites

Setup

Project Structure

Implementation Tasks

1. Embeddings (`src/embeddings.py`)

2. Retrieval (`src/retrieval.py`)

3. RAG Pipeline (`src/rag_pipeline.py`)

Configuration

Evaluation

Data Sources

Next Steps

About

Uh oh!

Releases

Packages

Languages

trywilco/AcmeCloud

Folders and files

Latest commit

History

Repository files navigation

AcmeCloud RAG Support Assistant

Overview

Project Status

Quick Start

Prerequisites

Setup

Project Structure

Implementation Tasks

1. Embeddings (src/embeddings.py)

2. Retrieval (src/retrieval.py)

3. RAG Pipeline (src/rag_pipeline.py)

Configuration

Evaluation

Data Sources

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Embeddings (`src/embeddings.py`)

2. Retrieval (`src/retrieval.py`)

3. RAG Pipeline (`src/rag_pipeline.py`)

Packages