Add video-to-spectrogram batch conversion utilities using existing STFT functions #68
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Implements standalone utilities for converting audio/video files to spectrogram images using the existing
fourier_transformationandmake_logscalefunctions from the Video Node, enabling batch dataset preparation for audio classification tasks.Changes
Core Scripts
simple_video_to_spectrogram.py- Minimal batch processor following ESC-50 CSV pattern withfourier_transformation,make_logscale, andplot_spectrogramfunctionsvideo_to_spectrogram.py- Full CLI tool with single/batch modes, video support via ffmpeg audio extraction, configurable FFT bins and colormapsDocumentation & Examples
VIDEO_TO_SPECTROGRAM_README.md- Technical reference with parameters, output structure, performance tipsQUICKSTART_VIDEO_TO_SPECTROGRAM.md- Quick start guide with common use casesexamples/video_to_spectrogram_example.py- Working examples for single file, batch, and ESC-50 processingTesting
tests/test_video_to_spectrogram.py- Integration tests for STFT, log scaling, and end-to-end spectrogram generationrequirements.txtwith scipy and pandasUsage
Batch processing with CSV metadata:
CLI for single files:
Spectrograms use identical STFT parameters as the Video Node (1024 bins, 50% overlap, Hanning window) with amplitude-to-decibel conversion and configurable colormaps.
Original prompt
💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.