Flototext

Windows voice recognition application with real-time transcription.

Features

Push-to-talk with F2: Hold F2 to record, release to transcribe
AI Transcription: Uses Qwen3-ASR-1.7B (supports French and 52 languages)
Auto-paste: Transcribed text is automatically pasted at cursor position
Custom Dictionary: Define word corrections for technical terms, names, etc.
Auto-mute: System audio is muted during recording to prevent interference
History: All transcriptions are saved (7-day retention)
Visual feedback: Color-coded system tray icon + Windows notifications

Requirements

Windows 10/11
Python 3.10+
NVIDIA GPU with CUDA (RTX series recommended)
~4 GB VRAM available

Installation

Clone the project:

git clone https://github.com/florentcollect/Flototext.git
cd Flototext

Create a virtual environment (recommended):

python -m venv venv
venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

For GPU support with PyTorch CUDA:

pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu121

Double-click install.bat (as administrator)
- Configures auto-start with Windows
- Registers the application in Windows Settings

To uninstall: Windows Settings → Apps → Flototext → Uninstall

Usage

Launch manually (if needed):
- Double-click start-flototext.bat
- Or: python -m flototext.main
The icon appears in the system tray (near the clock)
Recording:
- Press and hold F2 to start recording
- Speak (supports 52 languages including French, English, Chinese...)
- Release F2 to stop and transcribe
The transcribed text will be automatically pasted at your cursor position

Tray Icon States

Color	State
Orange	Loading model
Green	Ready
Red	Recording
Yellow	Processing transcription
Gray	Error

System Tray Menu

Right-click on the icon to access options:

Copy last transcription: Copy the last transcription to clipboard
Edit dictionary: Open the custom words dictionary file
Sounds: Enable/disable audio feedback
Notifications: Enable/disable Windows notifications
Mute during recording: Enable/disable system audio muting while recording
Quit: Close the application

Custom Dictionary

Create custom word corrections for terms that are frequently misrecognized (technical jargon, names, etc.).

The dictionary file is located at data/custom_words.json:

{
  "corrections": {
    "clode": "Claude",
    "anthropique": "Anthropic",
    "pie torche": "PyTorch"
  }
}

Keys: What the ASR model outputs (lowercase)
Values: The correct spelling you want

Access via tray menu: Edit dictionary (opens the file in your default JSON editor)

Database

Transcriptions are stored in data/transcriptions.db and kept for 7 days.

View history:

sqlite3 data/transcriptions.db "SELECT * FROM transcriptions ORDER BY created_at DESC LIMIT 10"

Project Structure

Flototext/
├── flototext/
│   ├── __init__.py
│   ├── main.py                 # Entry point
│   ├── config.py               # Configuration
│   ├── core/
│   │   ├── hotkey_manager.py   # F2 key detection
│   │   ├── audio_recorder.py   # Microphone capture
│   │   ├── transcriber.py      # Qwen3-ASR model
│   │   ├── text_inserter.py    # Clipboard paste
│   │   ├── text_corrector.py   # Custom word corrections
│   │   └── audio_muter.py      # System audio muting
│   ├── storage/
│   │   ├── database.py         # SQLite operations
│   │   └── models.py           # Data models
│   └── ui/
│       ├── tray_app.py         # System tray icon
│       ├── notifications.py    # Toast notifications
│       └── sounds.py           # Audio feedback
├── data/
│   ├── transcriptions.db       # Database
│   └── custom_words.json       # Custom word dictionary
├── assets/
│   └── icon.ico
├── install.bat                 # Windows installer
├── uninstall.bat               # Uninstaller
├── start-flototext.bat                   # Manual launch
├── requirements.txt
└── README.md

Configuration

Edit flototext/config.py to customize:

hotkey.trigger_key: Trigger key (default: "f2")
audio.sample_rate: Sample rate (default: 16000)
model.model_name: ASR model to use
ui.play_sounds: Sounds enabled by default
ui.show_notifications: Notifications enabled by default
ui.mute_during_recording: Mute system audio while recording (default: True)

Troubleshooting

Model won't load

Check CUDA is installed: python -c "import torch; print(torch.cuda.is_available())"
Make sure you have enough VRAM (~4 GB)

No audio during recording

Check that the default microphone is properly configured in Windows
Test with another recording application

Text doesn't paste

Make sure a text field is active (blinking cursor)
Text is also copied to clipboard (use Ctrl+V manually)

Support

If Flototext saves you time, consider supporting its development:

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Flototext

Features

Requirements

Installation

Usage

Tray Icon States

System Tray Menu

Custom Dictionary

Database

Project Structure

Configuration

Troubleshooting

Model won't load

No audio during recording

Text doesn't paste

Support

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github		.github
assets		assets
data		data
flototext		flototext
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_shortcut.ps1		create_shortcut.ps1
install.bat		install.bat
requirements.txt		requirements.txt
start-flototext.bat		start-flototext.bat
stop.bat		stop.bat
uninstall.bat		uninstall.bat

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Flototext

Features

Requirements

Installation

Usage

Tray Icon States

System Tray Menu

Custom Dictionary

Database

Project Structure

Configuration

Troubleshooting

Model won't load

No audio during recording

Text doesn't paste

Support

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages