ComfyUI Audio Segment Splitter

🎵 A ComfyUI custom node for intelligent audio segmentation with overlap support

Split audio at integer-second start points with decimal-duration segments, designed for context-aware audio processing tasks.

📑 Table of Contents

Features
Use Cases
Installation
Usage
How It Works
Example Output
Workflow Examples
Technical Details
Troubleshooting
Contributing
License
Changelog

✨ Features

🎯 Integer-Second Start Points: Split at 0s, 10s, 20s, 30s...
📏 Decimal Segment Duration: Support for 10.44s, 20.28s, etc.
🔄 Intentional Overlap: Preserve context between segments
⏱️ Timestamp Information: Detailed start/end times for each segment
📊 ASCII Visualization: Timeline preview of segmentation
🚀 Zero Dependencies: Uses ComfyUI built-in libraries
🎛️ Flexible Output: Independent audio segments as list

🎯 Use Cases

Perfect for audio processing tasks requiring contextual information:

🎤 Speech Recognition: Prevent sentence truncation at split points
🎵 Music Analysis: Maintain note and beat integrity
🔊 Audio Transcription: Ensure sufficient context per segment
🎬 Video Dubbing: Align audio segments for post-production
🧪 Audio Research: Consistent windowing for ML/AI applications

📦 Installation

Method 1: ComfyUI Manager (Recommended)

Open ComfyUI Manager
Search for "Audio Segment Splitter" or "comfy_AudioSeg"
Click Install
Restart ComfyUI

Method 2: Manual Installation

cd ComfyUI/custom_nodes
git clone https://github.com/huangkun1985/comfy_AudioSeg.git
# Restart ComfyUI

Method 3: Direct Download

Download the latest release
Extract to ComfyUI/custom_nodes/comfy_AudioSeg/
Restart ComfyUI

🚀 Usage

Quick Start

Find the Node: Right-click in ComfyUI → Search "Audio Segment Splitter" (under audio category)
Connect Input: Link an audio source (e.g., LoadAudio)
Set Duration: Configure segment_duration parameter (default: 10.0s)
Run: Execute workflow to get segmented audio

Node Parameters

Parameter	Type	Description	Default	Range
`audio`	AUDIO	Input audio data	-	-
`segment_duration`	FLOAT	Segment length in seconds	10.0	0.1 - 3600.0

Node Outputs

Output	Type	Description
`segments`	AUDIO (List)	Independent audio segment list
`segment_info`	STRING	Detailed timing info & visualization

📊 How It Works

Segmentation Logic

Example: 60-second audio with segment_duration = 10.44s

Start Point (int)  │  Segment Range      │  Duration  │  Overlap
─────────────────────────────────────────────────────────────────
0s                 →  [0.00 - 10.44s]    │  10.44s   │  -
10s                →  [10.00 - 20.44s]   │  10.44s   │  0.44s
20s                →  [20.00 - 30.44s]   │  10.44s   │  0.44s
30s                →  [30.00 - 40.44s]   │  10.44s   │  0.44s
40s                →  [40.00 - 50.44s]   │  10.44s   │  0.44s
50s                →  [50.00 - 60.00s]   │  10.00s   │  0.44s (final)

Algorithm

# Integer-second start points
split_interval = int(segment_duration)  # 10.44 → 10
start_points = [0, 10, 20, 30, ...]

# Extract segments with decimal duration
for start in start_points:
    segment = audio[start : start + segment_duration]

📸 Example Output

Visual Preview (segment_info output)

================================================================================
                                 Audio Segmentation Preview
================================================================================
Total Duration: 60.00 seconds
Segment Duration: 10.44 seconds
Start Interval: 10 seconds (integer)
Segment Overlap: 0.44 seconds
Number of Segments: 6
--------------------------------------------------------------------------------

Segment Details:

Index  Start Time   End Time     Duration   Notes
--------------------------------------------------------------------------------
0      0.00         10.44        10.44
1      10.00        20.44        10.44      (0.44s overlap with previous)
2      20.00        30.44        10.44      (0.44s overlap with previous)
3      30.00        40.44        10.44      (0.44s overlap with previous)
4      40.00        50.44        10.44      (0.44s overlap with previous)
5      50.00        60.00        10.00      (Final segment, shorter)
================================================================================

Timeline Visualization:
Time:    0.0   6.0  12.0  18.0  24.0  30.0  36.0  42.0  48.0  54.0  60.0
      |------|------|------|------|------|------|------|------|------|------|
  # 0: [============]
  # 1:       [============]
  # 2:                [============]
  # 3:                         [============]
  # 4:                                  [============]
  # 5:                                           [==========]
================================================================================

🎨 Workflow Examples

Example workflows are included in the workflow/ directory:

Basic Workflow

LoadAudio → AudioSegmentSplitter → PreviewAudio
              ↓
        segment_info → ShowText

Advanced Workflow

LoadAudio → AudioSegmentSplitter → [Process Each Segment] → AudioConcat
              ↓
        segment_info → SaveText

Common Configurations

Scenario	`segment_duration`	Effect
With Overlap	10.44	0.44s overlap between segments
No Overlap	10.0	Exact split, no overlap
Short Segments	5.5	0.5s overlap, 5s intervals
Long Segments	30.2	0.2s overlap, 30s intervals

⚙️ Technical Details

Audio Format

{
    "waveform": torch.Tensor,  # Shape: (batch, channels, samples)
    "sample_rate": int         # Sample rate in Hz
}

Dependencies

torch: PyTorch tensor operations
logging: Console output

All dependencies are included with ComfyUI - no additional installation required!

Performance

⚡ Optimized with PyTorch native operations
💾 Memory-efficient tensor slicing
🔧 Works with any sample rate
📦 Supports mono and stereo audio

Special Cases Handling

Segment < 1s: Start interval automatically adjusted to 1s
Final Segment: Automatically truncated to audio end
No Overlap: Use integer values (e.g., 10.0, 20.0)

🐛 Troubleshooting

Node Not Appearing in Menu

Solution:

Verify installation path: ComfyUI/custom_nodes/comfy_AudioSeg/
Check ComfyUI console for errors
Restart ComfyUI completely

Audio Artifacts at Boundaries

Cause: Sharp cutoff in waveform
Solution: Add fade in/out in post-processing (future feature)

Python/Import Errors

Solution: Ensure ComfyUI is up-to-date with PyTorch installed

🧪 Testing

Run the included test suite:

cd ComfyUI/custom_nodes/comfy_AudioSeg
python test_splitter.py

Tests include:

✅ Basic segmentation (60s → 10.44s segments)
✅ No-overlap mode (30s → 10.0s segments)
✅ Short segments (10s → 0.5s segments)
✅ Final segment handling

🤝 Contributing

We welcome contributions! Here's how you can help:

🐛 Report Bugs: Open an issue
💡 Suggest Features: Start a discussion
🔧 Submit PRs: Fork, code, test, and submit!
📖 Improve Docs: Help us make the documentation better

Development Setup

git clone https://github.com/huangkun1985/comfy_AudioSeg.git
cd comfy_AudioSeg
# Make changes and test
python test_splitter.py

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📝 Changelog

See CHANGELOG.md for version history and release notes.

🙏 Acknowledgments

ComfyUI Team: For the amazing framework
Community: For feedback and suggestions
Contributors: See Contributors

📞 Support & Contact

📫 Issues: GitHub Issues
💬 Discussions: GitHub Discussions
⭐ Star us: If you find this useful!

Made with ❤️ for the ComfyUI Community

⬆ Back to Top

Version: 1.0.0 | Last Updated: 2025-11-29

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
workflow		workflow
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
PUBLISHING_GUIDE.md		PUBLISHING_GUIDE.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
__init__.py		__init__.py
audio_segment_splitter.py		audio_segment_splitter.py
pyproject.toml		pyproject.toml
test_splitter.py		test_splitter.py

License

huangkun1985/comfy_AudioSeg

Folders and files

Latest commit

History

Repository files navigation