SpeechLine is a speech labeling pipeline that handles end-to-end, offline, batch audio categorization, transcription, segmentation, and logging. It supports multiple state-of-the-art speech recognition models including Wav2Vec2, Whisper, Parakeet, Parakeet TDT, and Canary.
Figure inspired by BERTopic's Modularity Diagram
