Project: Voice Conversation with Pete

This project aims to enable interactive voice conversations with Pete using integrated Automatic Speech Recognition (ASR), Large Language Model (LLM) for dialogue, and Text-to-Speech (TTS). The goal is to orchestrate a seamless conversation loop where users can speak to Pete, have their speech transcribed, receive intelligent responses, and hear Pete reply in natural-sounding speech.

**Component Status Updates:**

1. **Text-to-Speech (TTS):**
   - The TTS module is working for the most part; currently falling back to espeak because either the forebrain is down or it's not running the correct version of our custom coqui streamer.
   - It's good enough for now, but getting the coqui extension running is a priority.
   - Goal: Achieve voice synthesis with just the motherbrain, with graceful degradation if extensions are unavailable.

2. **Chat Module:**
   - Stubbed but untested; based on a previous working version.
   - Handles conversation tracking and currently uses ollama.
   - Future direction: Integrate our own LLM model runner, with fallback to ollama for graceful degradation.

3. **Automatic Speech Recognition (ASR):**
   - Mostly implemented with a graceful degradation path to local faster-whisper.
   - Relies on our own ASR over websocket service; based on an older prototype and currently untested/integrated.

**Key Objectives (unchanged):**
- Integrate ASR to capture and transcribe user speech
- Connect transcribed text to LLM for generating responses
- Integrate TTS to convert LLM responses to speech
- Orchestrate the full conversation workflow
- Test and iterate on voice interaction quality
- Document the design and technology stack

Sub-issues will track each technology component and implementation step.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Project: Voice Conversation with Pete #438

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Project: Voice Conversation with Pete #438

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions