Skip to content

feat/switch to pyannote audio#628

Merged
fedirz merged 15 commits intomasterfrom
feat/switch-to-pyannote-audio
Mar 26, 2026
Merged

feat/switch to pyannote audio#628
fedirz merged 15 commits intomasterfrom
feat/switch-to-pyannote-audio

Conversation

@fedirz
Copy link
Copy Markdown
Collaborator

@fedirz fedirz commented Mar 25, 2026

  • chore: remove unused piper-phonemize override

  • deps: update openai package

  • deps: remove hf-transfer
    Removing due to its instability:
    Consider disabling HF_HUB_ENABLE_HF_TRANSFER for better error handling huggingface/hf_transfer#63

  • deps: update required uv version (pt2)

  • feat: switch from onnx-diarization to pyannote

  • chore: reduce CUDA image size by switching to nvidia/cuda base image
    Switch base image from cudnn-runtime to base variant so torch's bundled
    nvidia pip packages are the sole source of CUDA libraries, eliminating
    the ~2-4GB duplication that occurred when both the base image and torch
    provided the same CUDA toolkit libs.

  • chore: suppress torchcodec warnings

  • chore: add speaches-hot-reload task

  • feat: propagate hf gated model repo errors

  • deps: add debugpy dev package

Fedir Zadniprovskyi added 10 commits March 22, 2026 13:13
@fedirz fedirz force-pushed the feat/switch-to-pyannote-audio branch from c2317f0 to ba0111c Compare March 25, 2026 03:01
Fedir Zadniprovskyi added 2 commits March 25, 2026 08:57
_scan_cached_repo stores only the basename in file_name, so all README.md
files across subdirectories (e.g. embedding/, plda/) match the same filter.
Sort by path depth to always select the root-level README.md which contains
the model card metadata.
Fixes a segmentation fault (exit code 139) on ubuntu-24.04-x86_64 CI runners.
onnxruntime-gpu was crashing inside _create_inference_session when initializing
the Silero VAD v5 ONNX model on a CPU-only environment. Updating to the latest
onnxruntime version resolves the crash.
@fedirz fedirz force-pushed the feat/switch-to-pyannote-audio branch 2 times, most recently from 778cc28 to 6aa0a96 Compare March 25, 2026 16:32
@fedirz fedirz force-pushed the feat/switch-to-pyannote-audio branch from 6aa0a96 to e8b1145 Compare March 26, 2026 12:44
The fixed 0.25s sleep was shorter than the VAD pipeline overhead
(audio decode + VAD load + inference ~0.57s), so the Whisper model
hadn't been added to loaded_models yet when the DELETE fired.
@fedirz fedirz merged commit 870e1e1 into master Mar 26, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant