DeLive

System-Level Audio Capture | Your Final Backup for Untranscribed Content

Directly capture system audio output. No matter how platforms protect their content, how DRMs encrypt their videos, or how live streams broadcast in real-time — as long as your computer can output sound, DeLive can transcribe it to text.

💡 When to Use DeLive

Your last resort when all other paths are blocked.

When subtitle export plugins fail, when platforms prevent downloads, when live streams have no captions, and when content is protected by DRM — system-level audio capture is your ultimate backup solution.

Need to export subtitles or transcribed content for building knowledge bases, analysis, research, or any other purpose, but the platform restricts access? DeLive captures system audio and delivers clean, exportable text you own.

🎯 Core Features

🎧 System-Level Audio Capture - Directly capture system audio output, bypassing platform restrictions
🛡️ Bypass Protection Barriers - Works on platforms with download restrictions, DRM protection, or no subtitle export
📺 Universal Scene Coverage - Live streams, recorded videos, meetings, private courses, paid content... any audio scenario
⚡ Real-Time Transcription - Convert speech to text instantly with minimal latency
📢 Live Caption Overlay - Floating subtitle window, customizable font, color, size, and position
📤 Export to TXT/SRT - Simple text files or timestamped subtitle files for any player
🌐 60+ Language Support - Chinese, English, Japanese, and many more
🔄 Multiple ASR Providers - Switch between providers for different accuracy and pricing needs

🎨 User Experience

Dark/Light Theme - Comfortable viewing in any environment
Modern Interface - Clean, frameless design with custom title bar
Auto-Start on Login - Ready to use when your computer boots
System Tray Integration - Runs quietly in the background
Bilingual Interface - Chinese and English UI language options
Auto Updates - Automatic detection and download of latest versions

🏗️ System Architecture

graph TB
    subgraph "User Interface Layer"
        UI[React Frontend]
        EC[Electron Container]
        CW[Caption Window<br/>Floating Overlay]
    end
    
    subgraph "Audio Processing Layer"
        AC[Audio Capture<br/>getDisplayMedia]
        AP[Audio Processor<br/>AudioProcessor]
        MR[MediaRecorder]
    end
    
    subgraph "ASR Abstraction Layer"
        PR[Provider Registry]
        BP[BaseASRProvider]
        
        subgraph "Service Providers"
            SP[Soniox Provider]
            VP[Volc Provider]
            MP[More Providers...]
        end
    end
    
    subgraph "Backend Service Layer"
        PS[Proxy Server<br/>Express + WS]
        VC[Volcengine Proxy<br/>volcProxy]
    end
    
    subgraph "External ASR Services"
        SONIOX[Soniox API<br/>WebSocket]
        VOLC[Volcengine API<br/>WebSocket]
    end
    
    UI --> EC
    EC --> AC
    EC --> CW
    AC --> AP
    AC --> MR
    
    AP -->|PCM 16kHz| VP
    MR -->|WebM/Opus| SP
    
    PR --> BP
    BP --> SP
    BP --> VP
    BP --> MP
    
    SP -->|Direct| SONIOX
    VP --> PS
    PS --> VC
    VC -->|With Headers| VOLC
    
    BP -->|Transcription| CW
    
    style UI fill:#61dafb,color:#000
    style EC fill:#47848f,color:#fff
    style CW fill:#f472b6,color:#000
    style PR fill:#f59e0b,color:#000
    style PS fill:#10b981,color:#fff
    style SONIOX fill:#6366f1,color:#fff
    style VOLC fill:#ef4444,color:#fff

Architecture Overview

Layer	Component	Description
User Interface	React + Electron	Modern desktop application interface
Caption Window	Transparent BrowserWindow	Floating subtitle overlay with customizable style
Audio Processing	AudioProcessor / MediaRecorder	Process audio format based on ASR service requirements
ASR Abstraction	Provider Registry	Unified ASR service interface, supports dynamic provider switching
Backend Service	Express + WebSocket	Proxy for services requiring custom Headers
External Services	Soniox / Volcengine	Actual speech recognition cloud services

🔌 Supported ASR Services

Provider	Status	Features
Soniox	✅ Supported	High accuracy, multi-language, direct WebSocket
Volcengine	✅ Supported	Chinese optimized, proxy connection
More providers	🔜 Planned	Extensible architecture, easy to add new providers

🚀 Quick Start

Prerequisites

Node.js 18+
ASR Service API Key (choose one):
- Soniox API Key
- Volcengine APP ID and Access Token

Installation

# Clone the project
git clone https://github.com/XimilalaXiang/DeLive.git
cd DeLive

# Install all dependencies
npm run install:all

Development Mode

# Start backend server (required for Volcengine)
cd server && npm run dev

# In another terminal, start frontend + Electron
npm run dev

Build

# Build Windows application
npm run dist:win

Built files are located in the release/ directory:

DeLive-x.x.x-x64.exe - Installer
DeLive-x.x.x-portable.exe - Portable version

📖 Usage

Basic Transcription

Select Provider - Click settings and choose your ASR service provider
Configure API Key - Enter the corresponding API key for your provider
Test Configuration - Click "Test Config" to verify settings
Start Recording - Click the "Start Recording" button
Select Audio Source - Choose the screen/window to share (check "Share audio")
Real-time Transcription - The system will automatically capture audio and display results
Stop Recording - Click "Stop Recording", transcription will be saved to history

Real-time Screen Captions (New)

Enable Captions - Click "Show Caption" button in settings
Customize Style - Click the settings icon to adjust font, color, background, etc.
Move Caption - Hover over the caption window, click the lock icon to unlock, then drag to reposition
Lock Position - Click the lock icon again to lock the caption in place
Reset Position - Click "Reset Position" button to restore default location

Export Options

Export to TXT - Click export button and select TXT format
Export to SRT - Click export button and select SRT format for subtitle files

📁 Project Structure

DeLive/
├── electron/              # Electron main process
│   ├── main.ts               # Main process entry
│   └── preload.ts            # Preload script
├── frontend/              # React frontend
│   ├── src/
│   │   ├── components/       # UI components
│   │   │   ├── CaptionOverlay.tsx  # Caption window component
│   │   │   ├── CaptionControls.tsx # Caption settings controls
│   │   │   └── ...
│   │   ├── hooks/            # Custom Hooks
│   │   ├── providers/        # ASR provider implementations
│   │   │   ├── base.ts           # Base class
│   │   │   ├── registry.ts       # Provider registry
│   │   │   └── implementations/  # Provider implementations
│   │   ├── stores/           # Zustand state management
│   │   ├── types/            # TypeScript types
│   │   │   └── asr/              # ASR related type definitions
│   │   ├── utils/            # Utility functions
│   │   │   └── audioProcessor.ts # Audio processor
│   │   └── i18n/             # Internationalization
│   └── ...
├── server/                # Backend proxy service
│   └── src/
│       ├── index.ts          # Express server
│       └── volcProxy.ts      # Volcengine WebSocket proxy
├── build/                 # App icon resources
├── scripts/               # Build scripts
└── package.json

🔧 Tech Stack

Layer	Technology
Desktop Framework	Electron 40
Frontend	React 18 + TypeScript + Vite
Styling	Tailwind CSS
State Management	Zustand
Backend	Express + ws
ASR Engine	Soniox V4 / Volcengine
Bundler	electron-builder

⌨️ Keyboard Shortcuts

Shortcut	Function
`Ctrl+Shift+D`	Show/Hide main window

🔧 Adding New ASR Providers

DeLive uses an extensible provider architecture. To add a new provider:

Create a new Provider class in frontend/src/providers/implementations/
Extend BaseASRProvider and implement required methods
Register the new provider in registry.ts
If the service requires custom Headers, add a proxy in server/src/

Refer to existing implementations (SonioxProvider.ts and VolcProvider.ts) for detailed guidance.

⚠️ Notes

System Requirements - Windows 10/11 64-bit
API Quota - Be aware of each provider's API usage limits
Volcengine - Requires starting the backend server (cd server && npm run dev)
Tray Behavior - Clicking close minimizes to tray, right-click tray icon and select "Exit" to fully close
Caption Window - The caption window is always on top and mouse-transparent when locked

🛡️ Windows SmartScreen Warning

When you first run DeLive, Windows may display a SmartScreen warning saying "Windows protected your PC". This is normal behavior for new applications that haven't yet established reputation with Microsoft.

Why does this happen?

DeLive is an open-source project without a paid code signing certificate
New applications without widespread usage will trigger this warning
This does NOT mean the software is harmful

How to proceed:

Click "More info" on the warning dialog
Click "Run anyway" to start DeLive

Verify Safety:

VirusTotal Scan Results - You can verify the application is safe
The source code is fully open and auditable on GitHub

📄 License

Apache License 2.0

Apache 2.0 License - Free to use, modify, and distribute with attribution

🙏 Acknowledgments

Soniox - Powerful speech recognition API
Volcengine - Chinese-optimized speech recognition service
BiBi-Keyboard - Multi-provider architecture reference
Electron - Cross-platform desktop application framework
React - User interface library
Tailwind CSS - CSS framework

Made with ❤️ by XimilalaXiang

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github/workflows		.github/workflows
assets		assets
build		build
electron		electron
frontend		frontend
scripts		scripts
server		server
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
README_EN.md		README_EN.md
README_JA.md		README_JA.md
README_TW.md		README_TW.md
README_ZH.md		README_ZH.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeLive

💡 When to Use DeLive

🎯 Core Features

🎨 User Experience

🏗️ System Architecture

Architecture Overview

🔌 Supported ASR Services

🚀 Quick Start

Prerequisites

Installation

Development Mode

Build

📖 Usage

Basic Transcription

Real-time Screen Captions (New)

Export Options

📁 Project Structure

🔧 Tech Stack

⌨️ Keyboard Shortcuts

🔧 Adding New ASR Providers

⚠️ Notes

🛡️ Windows SmartScreen Warning

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases 18

Packages

Contributors 3

Uh oh!

Languages

License

XimilalaXiang/DeLive

Folders and files

Latest commit

History

Repository files navigation

DeLive

💡 When to Use DeLive

🎯 Core Features

🎨 User Experience

🏗️ System Architecture

Architecture Overview

🔌 Supported ASR Services

🚀 Quick Start

Prerequisites

Installation

Development Mode

Build

📖 Usage

Basic Transcription

Real-time Screen Captions (New)

Export Options

📁 Project Structure

🔧 Tech Stack

⌨️ Keyboard Shortcuts

🔧 Adding New ASR Providers

⚠️ Notes

🛡️ Windows SmartScreen Warning

📄 License

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Contributors 3

Uh oh!

Languages

Packages