electron-speech-to-speech - free unlimited local speech-to-speech and real-time captioning solution

A no-brainer ready-to-use Electron based speech-to-speech and live captions app for your voice calls based on 100% locally run AI models

Main features

Entire speech-to-speech (transcription, translation, voice synthesis) pipeline utilizing OpenAI Whisper, VITS Kokoro and various other open-source AI models running on WASM and WebGPU
Live captions using my whisper.cpp Node.js addon supporting GPU acceleration through Vulkan API and Apple Metal, or OpenBLAS for CPU inference on Windows. It can transcribe up to 99 languages and also optionally translate to English. You can not only caption your system's audio but also any input stream as well (recommended to use virtual audio device for voice calls, more on that below)
Cross-platform - while Windows build is provided and the app is optimized for it, you can compile for other platforms (Mac, Linux) with a single npm command

Installation

Just visit releases and download an installer for your platform from Assets section of the latest release. For example, .exe file for Windows

Currently only Windows builds are provided

Recommended system requirements

At least 32GB RAM given that some models run CPU-side with WASM as the WebGPU support for them is experimental and buggy

Specifically, during speech-to-speech OpenAI Whisper transcription models run on WebGPU while translation and voice synthesis are CPU managed

Misc. Recommendations

To be used inside voice chat apps like Discord, you will need a virtual audio input device that will be a target for this program. VB-Cable is a free software which is confirmed to be working as of now on Windows 11:

https://vb-audio.com/Cable/

Here's how to use it:

install at least one pair of virtual input and output devices
Go to control panel, sound settings, playback tab and verify there's an entry with the virtual device name you defined during installation (CABLE-A Input, for example)
(Optional) If you want to hear synthesized speech output yourself: close the window, go to recording tab, double click your installed virtual device (CABLE-A Output, for example), then listen tab and check Listen to this device.
Choose the respective option in the Electron app from the second select field, so it corresponds to your virtual audio device name.

Also, you can make this device as your default input device by opening the same window as defined in 2), right clicking on the device and selecting both Set as Default Device and Set as Default Communication Device. That way you won't have to reconfigure your VC apps (unless you're already using specific options there).

Same goes for live captions input device. Choose:

System Audio if you wish to caption output audio from your computer
a microphone device for transcribing your voice
a virtual audio device to only caption incoming streams from specific configured apps.

Recommended IDE Setup

VSCode + ESLint + Prettier

Project Setup

Install

$ npm install

Development

$ npm run dev

Build

# For windows
$ npm run build:win

# For macOS
$ npm run build:mac

# For Linux
$ npm run build:linux

Scaffolded with npm create @quick-start/electron@latest react-ts template

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
.vscode		.vscode
build		build
resources		resources
src		src
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.yaml		.prettierrc.yaml
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
dev-app-update.yml		dev-app-update.yml
electron-builder.yml		electron-builder.yml
electron.vite.config.ts		electron.vite.config.ts
eslint.config.mjs		eslint.config.mjs
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.web.json		tsconfig.web.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

electron-speech-to-speech - free unlimited local speech-to-speech and real-time captioning solution

Main features

Installation

Recommended system requirements

Misc. Recommendations

Recommended IDE Setup

Project Setup

Install

Development

Build

About

Uh oh!

Releases

Packages

Languages

License

AXGZ21/electron-speech-to-speech

Folders and files

Latest commit

History

Repository files navigation

electron-speech-to-speech - free unlimited local speech-to-speech and real-time captioning solution

Main features

Installation

Recommended system requirements

Misc. Recommendations

Recommended IDE Setup

Project Setup

Install

Development

Build

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages