SayKey

SayKey is a tool that turns your speech into text. It works fast, accurately, and without the internet. It uses SenseVoice to do this.

Key Features

⚡ Super Fast: Convert speech to text in real-time.
🎯 Accurate: Enjoy precise transcriptions.
🔒 100% Offline: Your data stays on your device.
⌨️ Hotkey Activated: Start dictation with a simple keyboard shortcut.
✨ Smart Punctuation: Automatically adds punctuation to your text.
🛠️ Customizable: Easy-to-use settings for a personalized experience.

Quick Start Guide

Download SayKey
Or visit the Releases page and download the latest SayKey.zip.
Extract the SayKey.zip file.
Run SayKey.exe.
If Windows Defender shows "Windows protected your PC":
- Click More info → Run anyway.
Look for the white capsule-shaped icon above your taskbar.
Place the text cursor where you want to type.
Hold Ctrl+Q, speak, and release Ctrl+Q to convert speech to text!

For a detailed setup guide, check out the Installation Wiki.

System Requirements

OS: Windows 10 or later
Memory: At least 4 GB RAM (8 GB recommended)
Disk: 1.5 GB free disk space
CPU: x86-64 CPU with AVX support recommended

See the Installation Guide for the most up-to-date information.

Usage

Start voice typing

Make sure SayKey.exe is running and you see the capsule icon above the taskbar.
Focus any text input (Notepad, Word, browser, chat app, etc.).
Press and hold Ctrl+Q to start recording.
Speak clearly into your microphone.
Release Ctrl+Q. SayKey will recognize your speech and type the text at the cursor position.

Select microphone

Right-click the capsule icon.
Click Microphone.
Choose the device you want to use.

Change the hotkey

Use the settings in the desktop app, or
Call the HTTP API POST /set_hotkey if you are integrating SayKey programmatically.

For Developers

SayKey is open-source and we welcome contributions.

Clone the repository

git clone https://github.com/WenJing95/SayKey.git
cd SayKey

Dependencies

Python 3.10
Node.js (latest LTS)
Git

Backend setup

cd backend
cd CT-Transformer-punctuation
pip install -e .
pip install -r requirements.txt

Start the backend server:

python main.py --sense-voice=./sherpa-onnx/model.int8.onnx --tokens=./sherpa-onnx/tokens.txt

You should see output similar to:

SayKey is running. Hold ctrl+q to start recording, release to recognize.
Important: Ensure the cursor is in the desired input location before using voice typing.
INFO:     Uvicorn running on http://localhost:58652 (Press CTRL+C to quit)

Command-line arguments (backend)

Required:

--tokens: Path to the tokens.txt file for the speech model.
--sense-voice: Path to the SenseVoice model.onnx.

Optional:

--num-threads: Number of threads (default: 4).
--microphone-index: Index of the microphone to use. If not specified, the system default microphone is used.
--hotkey: Hotkey combination to start recording (default: ctrl+q).
--api-port: Port number for the API server (default: 58652).
--punc-model-dir: Directory path for punctuation model files (default: ./punc-onnx).
--host: Host address for the API server (default: localhost).

HTTP API

The backend exposes a small HTTP API on --api-port (default 58652).

Base URL: http://localhost:58652

`GET /ping`

Health check to verify the backend is alive.

curl http://localhost:58652/ping

Response:

{"status": "alive"}

`GET /list_audio_devices`

List available audio input devices.

curl http://localhost:58652/list_audio_devices

Example response:

{
  "devices": [
    {
      "index": 0,
      "name": "Microphone (Realtek High Definition Audio)",
      "is_current": true
    },
    {
      "index": 1,
      "name": "Stereo Mix (Realtek High Definition Audio)",
      "is_current": false
    }
  ]
}

`POST /set_audio_device`

Set the current microphone by index.

curl -X POST http://localhost:58652/set_audio_device \
  -H "Content-Type: application/json" \
  -d '{"index": 1}'

`POST /set_hotkey`

Configure the hotkey used to start and stop recording.

curl -X POST http://localhost:58652/set_hotkey \
  -H "Content-Type: application/json" \
  -d '{"hotkey": "ctrl+q"}'

`GET /get_hotkey`

Retrieve the current hotkey configuration.

curl http://localhost:58652/get_hotkey

Example response:

{"hotkey": "ctrl+q"}

Building & Packaging

Package backend (Windows)

cd backend
./build_onefile.bat

Frontend dev & build

cd frontend
npm install

# Run in development
npm run build
npm start

# Package for distribution
npm run build
npm run electron:build

Create a full release folder

Copy everything from backend\dist.
Copy everything from frontend\release\win-unpacked.
Place them in the same directory.
Run SayKey.exe.

FAQ

Nothing happens when I hold Ctrl+Q.

Check that SayKey is running (capsule icon is visible).
Make sure your cursor is in a text field.
Try switching to another microphone in the tray menu.

Windows says "protected your PC" and blocks the app.

Click More info → Run anyway if you trust the binary from this repository.

Contributing

We welcome issues and pull requests.

Fork the repository.
Create a feature branch:
```
git checkout -b feature/your-feature
```
Commit your changes and push the branch.
Open a Pull Request.

License

SayKey is MIT licensed. See LICENSE.txt for details.

Acknowledgements

SenseVoice for the speech-to-text engine.
CT-Transformer-punctuation by lovemefan for punctuation capabilities.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
backend		backend
frontend		frontend
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
readme.md		readme.md
readme_zh.md		readme_zh.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SayKey

Key Features

Quick Start Guide

System Requirements

Usage

Start voice typing

Select microphone

Change the hotkey

For Developers

Clone the repository

Dependencies

Backend setup

Command-line arguments (backend)

HTTP API

`GET /ping`

`GET /list_audio_devices`

`POST /set_audio_device`

`POST /set_hotkey`

`GET /get_hotkey`

Building & Packaging

Package backend (Windows)

Frontend dev & build

Create a full release folder

FAQ

Contributing

License

Acknowledgements

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SayKey

Key Features

Quick Start Guide

System Requirements

Usage

Start voice typing

Select microphone

Change the hotkey

For Developers

Clone the repository

Dependencies

Backend setup

Command-line arguments (backend)

HTTP API

GET /ping

GET /list_audio_devices

POST /set_audio_device

POST /set_hotkey

GET /get_hotkey

Building & Packaging

Package backend (Windows)

Frontend dev & build

Create a full release folder

FAQ

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`GET /ping`

`GET /list_audio_devices`

`POST /set_audio_device`

`POST /set_hotkey`

`GET /get_hotkey`

Packages