System-Level Audio Capture | Your Final Backup for Untranscribed Content
Directly capture system audio output. No matter how platforms protect their content, how DRMs encrypt their videos, or how live streams broadcast in real-time — as long as your computer can output sound, DeLive can transcribe it to text.
Your last resort when all other paths are blocked.
When subtitle export plugins fail, when platforms prevent downloads, when live streams have no captions, and when content is protected by DRM — system-level audio capture is your ultimate backup solution.
Need to export subtitles or transcribed content for building knowledge bases, analysis, research, or any other purpose, but the platform restricts access? DeLive captures system audio and delivers clean, exportable text you own.
- 🎧 System-Level Audio Capture - Directly capture system audio output, bypassing platform restrictions
- 🛡️ Bypass Protection Barriers - Works on platforms with download restrictions, DRM protection, or no subtitle export
- 📺 Universal Scene Coverage - Live streams, recorded videos, meetings, private courses, paid content... any audio scenario
- ⚡ Real-Time Transcription - Convert speech to text instantly with minimal latency
- 📢 Live Caption Overlay - Floating subtitle window, customizable font, color, size, and position
- 📤 Export to TXT/SRT - Simple text files or timestamped subtitle files for any player
- 🌐 60+ Language Support - Chinese, English, Japanese, and many more
- 🔄 Multiple ASR Providers - Switch between providers for different accuracy and pricing needs
- Dark/Light Theme - Comfortable viewing in any environment
- Modern Interface - Clean, frameless design with custom title bar
- Auto-Start on Login - Ready to use when your computer boots
- System Tray Integration - Runs quietly in the background
- Bilingual Interface - Chinese and English UI language options
- Auto Updates - Automatic detection and download of latest versions
graph TB
subgraph "User Interface Layer"
UI[React Frontend]
EC[Electron Container]
CW[Caption Window<br/>Floating Overlay]
end
subgraph "Audio Processing Layer"
AC[Audio Capture<br/>getDisplayMedia]
AP[Audio Processor<br/>AudioProcessor]
MR[MediaRecorder]
end
subgraph "ASR Abstraction Layer"
PR[Provider Registry]
BP[BaseASRProvider]
subgraph "Service Providers"
SP[Soniox Provider]
VP[Volc Provider]
MP[More Providers...]
end
end
subgraph "Backend Service Layer"
PS[Proxy Server<br/>Express + WS]
VC[Volcengine Proxy<br/>volcProxy]
end
subgraph "External ASR Services"
SONIOX[Soniox API<br/>WebSocket]
VOLC[Volcengine API<br/>WebSocket]
end
UI --> EC
EC --> AC
EC --> CW
AC --> AP
AC --> MR
AP -->|PCM 16kHz| VP
MR -->|WebM/Opus| SP
PR --> BP
BP --> SP
BP --> VP
BP --> MP
SP -->|Direct| SONIOX
VP --> PS
PS --> VC
VC -->|With Headers| VOLC
BP -->|Transcription| CW
style UI fill:#61dafb,color:#000
style EC fill:#47848f,color:#fff
style CW fill:#f472b6,color:#000
style PR fill:#f59e0b,color:#000
style PS fill:#10b981,color:#fff
style SONIOX fill:#6366f1,color:#fff
style VOLC fill:#ef4444,color:#fff
| Layer | Component | Description |
|---|---|---|
| User Interface | React + Electron | Modern desktop application interface |
| Caption Window | Transparent BrowserWindow | Floating subtitle overlay with customizable style |
| Audio Processing | AudioProcessor / MediaRecorder | Process audio format based on ASR service requirements |
| ASR Abstraction | Provider Registry | Unified ASR service interface, supports dynamic provider switching |
| Backend Service | Express + WebSocket | Proxy for services requiring custom Headers |
| External Services | Soniox / Volcengine | Actual speech recognition cloud services |
| Provider | Status | Features |
|---|---|---|
| Soniox | ✅ Supported | High accuracy, multi-language, direct WebSocket |
| Volcengine | ✅ Supported | Chinese optimized, proxy connection |
| More providers | 🔜 Planned | Extensible architecture, easy to add new providers |
- Node.js 18+
- ASR Service API Key (choose one):
# Clone the project
git clone https://github.com/XimilalaXiang/DeLive.git
cd DeLive
# Install all dependencies
npm run install:all# Start backend server (required for Volcengine)
cd server && npm run dev
# In another terminal, start frontend + Electron
npm run dev# Build Windows application
npm run dist:winBuilt files are located in the release/ directory:
DeLive-x.x.x-x64.exe- InstallerDeLive-x.x.x-portable.exe- Portable version
- Select Provider - Click settings and choose your ASR service provider
- Configure API Key - Enter the corresponding API key for your provider
- Test Configuration - Click "Test Config" to verify settings
- Start Recording - Click the "Start Recording" button
- Select Audio Source - Choose the screen/window to share (check "Share audio")
- Real-time Transcription - The system will automatically capture audio and display results
- Stop Recording - Click "Stop Recording", transcription will be saved to history
- Enable Captions - Click "Show Caption" button in settings
- Customize Style - Click the settings icon to adjust font, color, background, etc.
- Move Caption - Hover over the caption window, click the lock icon to unlock, then drag to reposition
- Lock Position - Click the lock icon again to lock the caption in place
- Reset Position - Click "Reset Position" button to restore default location
- Export to TXT - Click export button and select TXT format
- Export to SRT - Click export button and select SRT format for subtitle files
DeLive/
├── electron/ # Electron main process
│ ├── main.ts # Main process entry
│ └── preload.ts # Preload script
├── frontend/ # React frontend
│ ├── src/
│ │ ├── components/ # UI components
│ │ │ ├── CaptionOverlay.tsx # Caption window component
│ │ │ ├── CaptionControls.tsx # Caption settings controls
│ │ │ └── ...
│ │ ├── hooks/ # Custom Hooks
│ │ ├── providers/ # ASR provider implementations
│ │ │ ├── base.ts # Base class
│ │ │ ├── registry.ts # Provider registry
│ │ │ └── implementations/ # Provider implementations
│ │ ├── stores/ # Zustand state management
│ │ ├── types/ # TypeScript types
│ │ │ └── asr/ # ASR related type definitions
│ │ ├── utils/ # Utility functions
│ │ │ └── audioProcessor.ts # Audio processor
│ │ └── i18n/ # Internationalization
│ └── ...
├── server/ # Backend proxy service
│ └── src/
│ ├── index.ts # Express server
│ └── volcProxy.ts # Volcengine WebSocket proxy
├── build/ # App icon resources
├── scripts/ # Build scripts
└── package.json
| Layer | Technology |
|---|---|
| Desktop Framework | Electron 40 |
| Frontend | React 18 + TypeScript + Vite |
| Styling | Tailwind CSS |
| State Management | Zustand |
| Backend | Express + ws |
| ASR Engine | Soniox V4 / Volcengine |
| Bundler | electron-builder |
| Shortcut | Function |
|---|---|
Ctrl+Shift+D |
Show/Hide main window |
DeLive uses an extensible provider architecture. To add a new provider:
- Create a new Provider class in
frontend/src/providers/implementations/ - Extend
BaseASRProviderand implement required methods - Register the new provider in
registry.ts - If the service requires custom Headers, add a proxy in
server/src/
Refer to existing implementations (SonioxProvider.ts and VolcProvider.ts) for detailed guidance.
- System Requirements - Windows 10/11 64-bit
- API Quota - Be aware of each provider's API usage limits
- Volcengine - Requires starting the backend server (
cd server && npm run dev) - Tray Behavior - Clicking close minimizes to tray, right-click tray icon and select "Exit" to fully close
- Caption Window - The caption window is always on top and mouse-transparent when locked
When you first run DeLive, Windows may display a SmartScreen warning saying "Windows protected your PC". This is normal behavior for new applications that haven't yet established reputation with Microsoft.
Why does this happen?
- DeLive is an open-source project without a paid code signing certificate
- New applications without widespread usage will trigger this warning
- This does NOT mean the software is harmful
How to proceed:
- Click "More info" on the warning dialog
- Click "Run anyway" to start DeLive
Verify Safety:
- VirusTotal Scan Results - You can verify the application is safe
- The source code is fully open and auditable on GitHub
Apache License 2.0
Apache 2.0 License - Free to use, modify, and distribute with attribution
- Soniox - Powerful speech recognition API
- Volcengine - Chinese-optimized speech recognition service
- BiBi-Keyboard - Multi-provider architecture reference
- Electron - Cross-platform desktop application framework
- React - User interface library
- Tailwind CSS - CSS framework
Made with ❤️ by XimilalaXiang

