Optimized Needleman-Wunsch Alignment Algorithm

Part of providing pronunciation feedback involves aligning a target phonemic sequence (the sounds made by the actor reference) with the user phonemic sequence (the sounds the user makes when trying to mimic the actor dialogue). An appropriate algorithm for this is [Needleman-Wunsch](https://en.wikipedia.org/wiki/Needleman%E2%80%93Wunsch_algorithm). We currently have [a Python implementation of this](https://github.com/KoelLabs/server/blob/06e4271c1865dcb9597d5d8ea235af8c3162aa8f/src/phoneme_utils.py#L143) that we run after the user phonemic sequence has been transcribed. We want this to be as fast as possible to provide low latency feedback. Here are some potential solutions to try out:

- Implement it in C, or use an existing C implementation, and call that from Python
- Explore alternative algorithms with better than $O(nm)$ time and space complexities
- Explore a streaming version that can be calculated as the user phonemic sequence is being transcribed rather than having to wait until the end (this would also avoid having to do the full work of re-calculating it every time the transcription changes and we want to update the word colorings)
- Explore calculating it client-side (in JavaScript/WASM) to avoid network delays (the code would go in FeedbackGiver.js)

Since the sequences will be fairly short, experimental evaluation results that take into account network delays etc. will be more relevant than the asymptotic time complexity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimized Needleman-Wunsch Alignment Algorithm #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimized Needleman-Wunsch Alignment Algorithm #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions