LiveKit Plugin for Voxist ASR

Production-ready LiveKit Speech-to-Text plugin for Voxist automatic speech recognition API.

Features

Ultra-low latency: < 300ms end-to-end with connection pooling
Multi-language support: French, English, German, Italian, Spanish, Dutch, Portuguese, Swedish
Production-ready: Auto-reconnection, error recovery, comprehensive testing
Simple API: 3-line integration with LiveKit agents

Installation

pip install livekit-plugins-voxist

Quick Start

from livekit import agents
from livekit.plugins import voxist, openai, elevenlabs

async def entrypoint(ctx: agents.JobContext):
    # Initialize Voxist STT
    stt = voxist.VoxistSTT(language="fr")

    # Create voice agent
    agent = agents.VoicePipelineAgent(
        stt=stt,
        llm=openai.LLM(model="gpt-4"),
        tts=elevenlabs.TTS(voice="Rachel"),
    )

    await agent.start(ctx.room)

if __name__ == "__main__":
    cli.run_app(agents.WorkerOptions(entrypoint_fnc=entrypoint))

Configuration

Environment Variables

# Required
export VOXIST_API_KEY="voxist_..."

# Optional
export VOXIST_BASE_URL="wss://api-asr.voxist.com/ws"
export VOXIST_LANGUAGE="fr"

Advanced Configuration

stt = voxist.VoxistSTT(
    api_key="voxist_...",
    language="fr",
    sample_rate=16000,
    interim_results=True,
    connection_pool_size=3,      # 2-3 recommended for ultra-low latency
    chunk_duration_ms=100,       # Audio chunk size
    stride_overlap_ms=20,        # Chunk overlap for accuracy
)

Supported Languages

Code	Language
`fr`, `fr-FR`	French
`en`, `en-US`	English
`de`, `de-DE`	German
`it`	Italian
`es`	Spanish
`nl`, `nl-NL`	Dutch
`pt`	Portuguese
`sv`	Swedish

Performance

Latency: < 300ms end-to-end (95th percentile)
Connection pool: Zero cold-start with pre-warmed connections
Recovery: < 2s automatic reconnection
Memory: < 50MB per agent instance
CPU: < 10% for audio processing

Documentation

Technical Specification: See /claudedocs/livekit-plugin-technical-specification.md
API Reference: Coming soon
Examples: See examples/ directory

Local Development with Docker

1. Setup Python Environment

# Create virtual environment
python -m venv .venv

# Activate (macOS/Linux)
source .venv/bin/activate

# Activate (Windows)
.venv\Scripts\activate

# Install package with dev dependencies
pip install -e ".[dev]"

2. Start LiveKit Server

docker run --rm -p 7880:7880 -p 7881:7881 -p 7882:7882/udp \
    livekit/livekit-server \
    --dev \
    --bind 0.0.0.0

The server runs with development keys:

API Key: devkey
API Secret: secret

3. Install LiveKit CLI

# macOS
brew install livekit-cli

# Linux
curl -sSL https://get.livekit.io/cli | bash

# Or download from: https://github.com/livekit/livekit-cli/releases

4. Generate Access Token

# Generate token for a test room
livekit-cli create-token \
    --api-key devkey \
    --api-secret secret \
    --join --room test-room \
    --identity user1 \
    --valid-for 24h

5. Expose with ngrok (for external access)

To test from external devices or meet.livekit.io (which requires HTTPS):

# Install ngrok: https://ngrok.com/download
# Then expose LiveKit server
ngrok http 7880

Note the ngrok URL (e.g., https://xxxx.ngrok-free.app)

6. Test with meet.livekit.io

Go to meet.livekit.io
Click "Custom Server"
Enter:
- LiveKit URL: wss://xxxx.ngrok-free.app (use your ngrok URL with wss://)
- Token: (paste the token from step 3)
Click "Connect"

Note: For local testing without ngrok, use ws://localhost:7880

7. Run the Agent

# Set environment variables
export LIVEKIT_URL=ws://localhost:7880
export LIVEKIT_API_KEY=devkey
export LIVEKIT_API_SECRET=secret
export VOXIST_API_KEY=your_voxist_api_key

# Run the agent
python examples/simple_transcription.py dev

The agent will connect to the room and transcribe audio from participants.

Staging Backend

For testing against Voxist staging:

stt = voxist.VoxistSTT(
    api_key="your_staging_key",
    base_url="wss://asr-staging-dev.voxist.com/ws",
    language="fr",
)

Development Status

🚧 Under Active Development

Track progress with beads issue tracker:

/beads:stats              # View project statistics
/beads:ready              # See ready-to-work issues
/beads:show livekit-1     # View epic details

License

MIT - See LICENSE file

Support

Issues: https://github.com/voxist/livekit-plugins-voxist/issues
Voxist API: https://api-asr.voxist.com/docs
LiveKit: https://docs.livekit.io

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
examples		examples
livekit/plugins/voxist		livekit/plugins/voxist
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiveKit Plugin for Voxist ASR

Features

Installation

Quick Start

Configuration

Environment Variables

Advanced Configuration

Supported Languages

Performance

Documentation

Local Development with Docker

1. Setup Python Environment

2. Start LiveKit Server

3. Install LiveKit CLI

4. Generate Access Token

5. Expose with ngrok (for external access)

6. Test with meet.livekit.io

7. Run the Agent

Staging Backend

Development Status

License

Support

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

Voxist/livekit-plugins-voxist

Folders and files

Latest commit

History

Repository files navigation

LiveKit Plugin for Voxist ASR

Features

Installation

Quick Start

Configuration

Environment Variables

Advanced Configuration

Supported Languages

Performance

Documentation

Local Development with Docker

1. Setup Python Environment

2. Start LiveKit Server

3. Install LiveKit CLI

4. Generate Access Token

5. Expose with ngrok (for external access)

6. Test with meet.livekit.io

7. Run the Agent

Staging Backend

Development Status

License

Support

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages