multimodal-tools/segment_transcript_by_topic at main · mixpeek/multimodal-tools

Name	Name	Last commit message	Last commit date
parent directory ..
examples	examples
README.md	README.md
requirements.txt	requirements.txt
segment_transcript.py	segment_transcript.py
topic_segmenter.py	topic_segmenter.py
utils.py	utils.py

Name

Last commit message

Last commit date

examples

README.md

requirements.txt

segment_transcript.py

topic_segmenter.py

utils.py

🧠 Segment Transcript by Topic

This Python tool accepts a video or audio file, transcribes it using OpenAI Whisper, embeds transcript segments using SentenceTransformers, and clusters them by topic using HDBSCAN.

🔧 Features

Automatic speech transcription (Whisper)
Sentence segmentation
Embedding via SentenceTransformers
Clustering via HDBSCAN
Topic summary generation (optional)

🏁 Quickstart

# Install deps
pip install -r requirements.txt

# Run segmentation
python segment_transcript.py --input examples/sample_video.mp4 --output segments.json

📂 Output Format

[
  {
    "topic_id": 0,
    "start": 12.3,
    "end": 54.6,
    "text": "Discussion about industry challenges..."
  },
  ...
]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

🧠 Segment Transcript by Topic

🔧 Features

🏁 Quickstart

📂 Output Format

FilesExpand file tree

segment_transcript_by_topic

Directory actions

More options

Directory actions

More options

Latest commit

History

segment_transcript_by_topic

Folders and files

parent directory

README.md

🧠 Segment Transcript by Topic

🔧 Features

🏁 Quickstart

📂 Output Format