An AI-powered, agent-based chatbot designed to elevate customer service in coffee shop apps. Built with LLMs, NLP, and RAG, this chatbot can take orders, answer menu questions, provide personalized recommendations, and ensure safe interactions.
A user interacts with a mobile application (Firebase-powered). Their queries are sent to an API Endpoint which is backed by a multi-agent architecture, using Serverless LLaMA 3.1, a vector database (Pinecone), and trained recommendation models.
- All user input hits this central endpoint.
- The Agent Coordinator orchestrates which downstream agent will handle the query.
- A mobile app UI (possibly built using Flutter or React Native).
- The app uses Firebase for authentication, analytics, or real-time syncing.
- The user can browse items (e.g., coffee products), ask for recommendations, or place orders.
- Acts as a content filter or safety check.
- It checks whether the user’s query is safe or appropriate (e.g., no harmful or off-topic queries).
- Sends a response back directly to the user.
- Passes the query to the Input Classifier Agent.
- Classifies the query into types:
Order,Recommendation, orDetails. - Routes the query to the corresponding agent.
- Order Agent: Handles user orders.
- Recommendation Agent:
- Uses a trained model or Serverless LLaMA 3.1 to understand preferences.
- Integrates with Pinecone vector database for semantic search and contextual recommendations.
- Details Agent: Provides more details about a specific product.
- Gathers the results from the chosen task agent.
- Sends the response back to the user app.
- Serverless LLaMA 3.1 powers natural language understanding and generation.
- Pinecone is used for retrieval-augmented generation (RAG):
- Stores vector embeddings of items (or past queries).
- Enables fast, relevant semantic searches to power the recommendation system.
- User query → API → Guard Agent → Input Classifier
- Input Classifier → [Order | Recommendation | Details Agent]
- Agent (optionally) queries LLaMA + Pinecone + trained models
- Response Agent sends back output to user
- AI-powered personalized recommendations.
- Natural language food ordering (e.g., "Get me something with caramel").
- Product info (e.g., "Tell me more about the Pumpkin Latte").
- Moderation of inappropriate queries.