belatedMetronome

A highly accurate video beat correction system.

This program provides both an automatic mode (using midi for alignment) and a manual mode (via GUI) for adjusting the timing of notes in performance videos. Users can fine-tune the alignment by stretching or contracting the timing of both audio and video to match a reference.

Automatic Mode: A double-loop sequence matching and Dynamic Time Warping (DTW) are used to align performance notes to score notes, while recording the relative time offset coefficients and performance video cut points. The relative time coefficients are then applied directly to stretch or compress the corresponding video and audio segments, followed by smooth stitching.

Manual Mode: Offers a graphical interface for manually editing the timing, pitch, and duration of notes. Recommended for complex sheet music or where precise customization is required.

基于crepe notes的视频节拍纠正系统。自动模式下使用双循环序列匹配和DTW来匹配演奏音符至乐谱音符，并记录相对时值偏移系数和演奏视频切割点。应用相对时值系数直接伸缩对应的视频和音频段，然后平滑拼接。

如果乐谱较为复杂或需要自定义时值，则推荐使用手动模式。

未修正的演奏音频：

转换后的演奏midi：

参考midi：

演奏音符列表和参考音符列表中，每个音符被表示为：(i, pitch, relative_duration, relative_position)

计算dtw距离得分，匹配演奏音符和参考音符：

实际计算中考虑了全部音符属性

context_score = dtw_context_similarity(perf_idx, ref_idx)

匹配结果列表

额外记录了演奏音符的起始时间以切割原演奏视频。

            matches.append({
                "order": perf_order,
                "performance_note": {
                    "start": performance_notes[perf_idx][0],
                    "pitch": performance_notes[perf_idx][1],
                    "duration": performance_notes[perf_idx][2]
                },
                "reference_note": {
                    "start": reference_notes[ref_idx][0],
                    "pitch": reference_notes[ref_idx][1],
                    "duration": reference_notes[ref_idx][2]
                },
                "original_relative_position": perf_position,
                "corrected_relative_position": ref_position,
                "time_correction": time_correction,
                "relative_offset": relative_offset,
                "match_round": 1,
                "dtw_score": dtw_score
            })

计算 time correction：

ref_duration和perf_duration是归一化到节拍的相对值。

time_correction = ref_duration / perf_duration if perf_duration > 0 else 1.0

计算 adjusted time correction（补偿交叉渐变造成的时长损失，crossfade_duration默认为10ms）：

adjusted_time_correction = time_correction * (1 + crossfade_duration / duration)

应用 adjusted time correction

corrected_audio = np.stack([
    librosa.effects.time_stretch(segment_audio[0], rate=1 / adjusted_time_correction),
    librosa.effects.time_stretch(segment_audio[1], rate=1 / adjusted_time_correction)
]).astype('float32')

修正后的演奏音频。稳态高能段（音符）的相对时值和参考midi完全一致：

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
LICENSE		LICENSE
README.md		README.md
belatedMetronome.py		belatedMetronome.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

belatedMetronome

未修正的演奏音频：

转换后的演奏midi：

参考midi：

计算dtw距离得分，匹配演奏音符和参考音符：

匹配结果列表

计算 time correction：

修正后的演奏音频。稳态高能段（音符）的相对时值和参考midi完全一致：

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

belatedMetronome

未修正的演奏音频：

转换后的演奏midi：

参考midi：

计算dtw距离得分，匹配演奏音符和参考音符：

匹配结果列表

计算 time correction：

修正后的演奏音频。稳态高能段（音符）的相对时值和参考midi完全一致：

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages