Assistant mode #40

richiejp · 2025-12-31T16:03:15Z

Eventually Assistant Mode will allow you to control your desktop with voice using natural speech and also allow a VLM to describe what is on the desktop. We can use MCP servers (tool calls) or a VLM which will be able to locate the coordinates of items on the desktop and click them.

Initially though this PR will just allow you to speak with an LLM using audio both ways over the OpenAI realtime API.

For LocalAI support this requires mudler/LocalAI#6245 which will implement the conversational parts of the API before we move onto tool calls and multi-modal support needed for a full desktop assistant.

richiejp added 3 commits December 30, 2025 15:18

Add mode option

cf48b89

chore: Update go-openai-realtime

6b53a5d

Use regular session when mode is assistant

25007eb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Assistant mode #40

Assistant mode #40

Uh oh!

richiejp commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Assistant mode #40

Are you sure you want to change the base?

Assistant mode #40

Uh oh!

Conversation

richiejp commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants