support input waveform by YichongLeng · Pull Request #125 · MoonshotAI/Kimi-Audio

YichongLeng · 2025-06-21T15:33:44Z

Support using waveform as input besides audio path.

Discussion in #124 .

MoyanZitto · 2025-06-23T02:41:48Z

kimia_infer/api/prompt_manager.py

+        if isinstance(wav_path_or_waveform, str):
+            wav_tokens = self.audio_tokenizer.tokenize(audio_path=wav_path_or_waveform)
+        else:
+            wav_tokens = self.audio_tokenizer.tokenize(speech=wav_path_or_waveform)


这种情况下好像没办法保证wav_path_or_waveform一定sr=16000，如果用错了会有难发现的bug（能正常推但是结果不对）

感觉可以来个org_sr，当wav_path_or_waveform是一个ndarray / tensor的时候，要求同时提供一下这个wavform对应的sr，这样我们可以里面resample一下？

support input waveform

43b4020

YichongLeng requested a review from MoyanZitto June 21, 2025 15:33

YichongLeng self-assigned this Jun 21, 2025

MoyanZitto reviewed Jun 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support input waveform#125

support input waveform#125
YichongLeng wants to merge 1 commit intomasterfrom
support_input_waveform

YichongLeng commented Jun 21, 2025

Uh oh!

MoyanZitto Jun 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

YichongLeng commented Jun 21, 2025

Uh oh!

MoyanZitto Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants