Skip to content

[Enhancement] Fully continuous conversation #69

@mitrokun

Description

@mitrokun

The project is already excellent and very stable overall, but there's one feature I'd love to have:true continuous conversation — the ability to keep talking even when the agent's last response didn't end with a question.
Currently, after LLM's response, both states can occur, and there's no way to tell whether listening is in progress without looking at the screen. One can only guess, which is especially critical for languages ​​without a clear structure for interrogative sentences.

I noticed that this option already exists in one of the forks, but I wasn't happy with its stability/reliability.
I also have my own ready-made logic for this feature that I've been successfully using on ESP32 since last summer. The implementation is actually quite straightforward — the only complication was having to add extra checks after introducing two wake words.

In short:
During the intent_progress phase, check if the agent's response ends with a question mark.
If both conditions are met (in intent_progress + ends with question), automatically start listening immediately after TTS completes.
Additionally, I'd suggest make the listening stage timeout configurable, and cancel it if VAD triggers.
The main ww should restart the session
The stop ww should interrupt any actions

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions