Commit 304c0b9
feat(plugins): add Parakeet TDT speech-to-text plugin (#281)
* feat(plugins): add Parakeet TDT speech-to-text plugin
Add a new native plugin for fast English speech recognition using NVIDIA's
Parakeet TDT (Token-and-Duration Transducer) 0.6B model via sherpa-onnx.
Parakeet TDT is approximately 10x faster than Whisper on consumer hardware
with competitive accuracy (#1 on HuggingFace ASR leaderboard).
Plugin implementation:
- Offline transducer recognizer (encoder/decoder/joiner) via sherpa-onnx C API
- Silero VAD v6 for streaming speech segmentation
- Recognizer caching keyed on (model_dir, num_threads, execution_provider)
- Configurable VAD threshold, silence duration, and max segment length
- 16kHz mono f32 audio input, transcription output
Justfile additions:
- build-plugin-native-parakeet: build the plugin
- download-parakeet-models: download INT8 quantized model (~660MB)
- setup-parakeet: full setup (sherpa-onnx + models + VAD)
- Added parakeet to copy-plugins-native loop
Includes sample oneshot pipeline (parakeet-stt.yml) and plugin.yml manifest.
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* fix(plugins): address review feedback for parakeet plugin
- Add build-plugin-native-parakeet to build-plugins-native target
- Fix plugin.yml repo_id to match actual HuggingFace source repos
(csukuangfj/sherpa-onnx-nemo-parakeet-tdt-0.6b-v2-int8 for model,
streamkit/sensevoice-models for silero-vad)
- Regenerate marketplace/official-plugins.json with parakeet entry
- Add download-parakeet-models as optional in download-models output
(skipped by default due to ~660MB size, similar to pocket-tts)
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* docs(plugins): add parakeet docs page, fix model checksums and download
- Add plugin docs page (plugin-native-parakeet.md) with parameters,
example pipeline, and JSON schema
- Update plugin index to include parakeet (10 → 11 official plugins)
- Fix model download: individual files from HuggingFace instead of
non-existent tar.bz2 archive
- Add per-file sha256 checksums via file_checksums field (matching
ModelSpec struct) for integrity verification
- Fix expected_size_bytes to actual total (661190513)
- Regenerate marketplace/official-plugins.json
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* fix(engine): skip synthetic nodes in oneshot content-type backward walk
The content-type backward walk in run_oneshot_pipeline walks backwards
through the pipeline graph to find a node that declares a content_type.
When no node in the chain returns a content_type (e.g. STT pipelines
ending in json_serialize), the walk reaches streamkit::http_input which
is a synthetic node not in the registry, causing a 500 error.
Skip synthetic oneshot nodes (http_input/http_output) in the backward
walk since they are handled separately by the engine and are not
registered in the node registry.
This fixes all STT-style oneshot pipelines (parakeet-stt, sensevoice-stt,
speech_to_text, etc.) that use json_serialize → http_output.
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* style(engine): format oneshot backward walk
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* docs(plugins): add README for parakeet plugin
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* fix(plugins): update parakeet model repo_id to streamkit/parakeet-models
Update the parakeet plugin.yml to point to the controlled
streamkit/parakeet-models HuggingFace repo instead of the external
csukuangfj repo. Regenerate marketplace metadata accordingly.
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* fix(plugins): update parakeet model download URL to streamkit HF space
Point the justfile download target and README references to
streamkit/parakeet-models instead of the external csukuangfj repo.
Original export attribution preserved in README.
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
* fix(plugins): address parakeet review feedback
- Point silero-vad repo_id to streamkit/parakeet-models instead of
streamkit/sensevoice-models to avoid cross-plugin dependency
- Remove unused cc build-dependency
- Remove unused once_cell dependency (code uses std::sync::LazyLock)
- Fix misleading update_params comment that claimed VAD params could
be updated at runtime
- Remove const from set_threshold (f32::clamp is not const-stable)
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-Authored-By: Claudio Costa <cstcld91@gmail.com>
---------
Signed-off-by: StreamKit Devin <devin@streamkit.dev>
Co-authored-by: StreamKit Devin <devin@streamkit.dev>
Co-authored-by: Claudio Costa <cstcld91@gmail.com>1 parent 9f2742e commit 304c0b9
File tree
16 files changed
+3012
-3
lines changed- crates/engine/src
- docs/src/content/docs/reference/plugins
- marketplace
- plugins/native/parakeet
- src
- samples/pipelines/oneshot
16 files changed
+3012
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
384 | 384 | | |
385 | 385 | | |
386 | 386 | | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
387 | 392 | | |
388 | 393 | | |
389 | 394 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| 23 | + | |
23 | 24 | | |
24 | 25 | | |
25 | 26 | | |
| |||
Lines changed: 144 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
723 | 723 | | |
724 | 724 | | |
725 | 725 | | |
| 726 | + | |
| 727 | + | |
| 728 | + | |
| 729 | + | |
| 730 | + | |
| 731 | + | |
| 732 | + | |
| 733 | + | |
| 734 | + | |
| 735 | + | |
| 736 | + | |
| 737 | + | |
| 738 | + | |
| 739 | + | |
| 740 | + | |
| 741 | + | |
| 742 | + | |
| 743 | + | |
| 744 | + | |
| 745 | + | |
| 746 | + | |
| 747 | + | |
| 748 | + | |
| 749 | + | |
| 750 | + | |
| 751 | + | |
| 752 | + | |
| 753 | + | |
| 754 | + | |
| 755 | + | |
| 756 | + | |
| 757 | + | |
| 758 | + | |
726 | 759 | | |
727 | 760 | | |
728 | 761 | | |
| |||
792 | 825 | | |
793 | 826 | | |
794 | 827 | | |
| 828 | + | |
| 829 | + | |
| 830 | + | |
795 | 831 | | |
796 | 832 | | |
797 | 833 | | |
| |||
979 | 1015 | | |
980 | 1016 | | |
981 | 1017 | | |
982 | | - | |
| 1018 | + | |
983 | 1019 | | |
984 | 1020 | | |
985 | 1021 | | |
| |||
1042 | 1078 | | |
1043 | 1079 | | |
1044 | 1080 | | |
1045 | | - | |
| 1081 | + | |
1046 | 1082 | | |
1047 | 1083 | | |
1048 | 1084 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
143 | 143 | | |
144 | 144 | | |
145 | 145 | | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
146 | 199 | | |
147 | 200 | | |
148 | 201 | | |
| |||
0 commit comments