-
Notifications
You must be signed in to change notification settings - Fork 9
[Enhancement] Fully continuous conversation #69
Description
The project is already excellent and very stable overall, but there's one feature I'd love to have:true continuous conversation — the ability to keep talking even when the agent's last response didn't end with a question.
Currently, after LLM's response, both states can occur, and there's no way to tell whether listening is in progress without looking at the screen. One can only guess, which is especially critical for languages without a clear structure for interrogative sentences.
I noticed that this option already exists in one of the forks, but I wasn't happy with its stability/reliability.
I also have my own ready-made logic for this feature that I've been successfully using on ESP32 since last summer. The implementation is actually quite straightforward — the only complication was having to add extra checks after introducing two wake words.
In short:
During the intent_progress phase, check if the agent's response ends with a question mark.
If both conditions are met (in intent_progress + ends with question), automatically start listening immediately after TTS completes.
Additionally, I'd suggest make the listening stage timeout configurable, and cancel it if VAD triggers.
The main ww should restart the session
The stop ww should interrupt any actions