Project status

This is a GUI for OpenAI's Whisper TTS model... or anything with a compatible API.

Project status

As you might have guessed from the existence of various buttons for which there isn't a lot of explanation, this is a side project of mine that I have, for some reason, decided to put on the internet. Do not expect it to work in any meaningful way. (It might work though.)

The code might also provide excellent examples for

how to do some stuff in win32 (tray icons! global hotkeys! COM automation!)
how to not do stuff in win32 (the code is ugly, could use some cleanup, and this was my first attempt at tray icons, global hotkeys and COM automation.)

Usage

You hit record. You talk. You then press the button again (it is now labeled "Stop").

This will cause your recorded speech to be converted to MP3 and sent to Whisper. Once it responds, we insert the result into the application in the foreground.

You might already have noticed that this has questionable levels of usability, given how "the application in the front" is this GUI. To solve this, we are registering a global hotkey on F8. This corresponds to the record / stop button.

Setup

You should start by either setting an OpenAI API key for the official OpenAI Whisper API, or, if you're running Whisper locally, pointing it at an endpoint that has the same API.

Building

You need libcurl & liblame to be available in c:\devel to compile this. At some point I should put the zip file containing them somewhere.

FAQ

How do we insert the text?

If the application in the foreground happens to be Emacs, we try to connect to its server and insert the text that way. For everyone else, we copy the text to the clipboard and send a literal Ctrl-V to the app in the foreground.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
whisper32cmd		whisper32cmd
.gitignore		.gitignore
README.md		README.md
build.bat		build.bat
build.ps1		build.ps1
emacs.cpp		emacs.cpp
emacs.hpp		emacs.hpp
publish.bat		publish.bat
recorder.cpp		recorder.cpp
recorder.idl		recorder.idl
recorder.rc		recorder.rc
resource.h		resource.h
settings.cpp		settings.cpp
settings.hpp		settings.hpp
text_injection.cpp		text_injection.cpp
text_injection.hpp		text_injection.hpp
update-local.ps1		update-local.ps1
update.ps1		update.ps1
utils.cpp		utils.cpp
utils.hpp		utils.hpp
voice_recorder_icon.ico		voice_recorder_icon.ico
whisper_win32.sln		whisper_win32.sln
whisper_win32.vcxproj		whisper_win32.vcxproj
whisper_win32.vcxproj.filters		whisper_win32.vcxproj.filters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project status

Usage

Setup

Building

FAQ

How do we insert the text?

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project status

Usage

Setup

Building

FAQ

How do we insert the text?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages