Improved Current-Word Detection

As the user is speaking a dialogue phrase, we highlight the current word they are on to make the UI feel responsive and to detect when they are done speaking. This has to be very low-latency and light on computational resources. [Currently, we use the Web Speech API](https://github.com/KoelLabs/server/blob/06e4271c1865dcb9597d5d8ea235af8c3162aa8f/src/static/FeedbackGiver.js#L215). However, it is [only supported on Chrome and Safari](https://caniuse.com/?search=web%20speech%20api).

We need improved browser support, through some fallback either to a small local model or to one running remotely (either on the server or a cloud solution like Azure/GCP). A small local model might be ideal for low latency, especially if we take advantage of the fact that we know which words the user is trying to say. Hence we don't need a full transcription model, just one that can detect when a given word has been said.

This task will involve some research and evaluation of the best approach.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved Current-Word Detection #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improved Current-Word Detection #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions