-
Notifications
You must be signed in to change notification settings - Fork 36
feat: Added libfvad library and integrated Voice Activity Detection #457
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
added unit test for VAD
| p.wait_for_all(); | ||
| } | ||
|
|
||
| BOOST_AUTO_TEST_CASE(vad_aggressiveness_test, *boost::unit_test::disabled()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Incomplete test for now, Please add src to verify , all the different modes
base/CMakeLists.txt
Outdated
| include/ColorConversionXForm.h | ||
| include/Overlay.h | ||
| include/AudioToTextXForm.h | ||
| include/VADTransform.h |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please Rename it to VoiceAudioDetector
base/CMakeLists.txt
Outdated
| bigint::bigint | ||
| sfml-audio | ||
| whisper::whisper | ||
| fvad |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we bring it from vcpkg
|
|
||
| enum FrameLength { | ||
| FRAME_10MS = 10, | ||
| FRAME_20MS = 20, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add comments to describe this modes, as in effects of setting frame length
| bool init() | ||
| { | ||
| // Create libfvad instance | ||
| mVad = fvad_new(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please use complete name (avoid abbreviation )
|
|
||
| // Test all 4 aggressiveness modes | ||
| VADTransformProps::AggressivenessMode modes[] = { | ||
| VADTransformProps::QUALITY, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems we are supporting getProps and setProps , Please add UT for same
| auto outFrame = makeFrame(sizeof(int)); | ||
| int vadResult = (result == 1) ? 1 : 0; | ||
| memcpy(outFrame->data(), &vadResult, sizeof(int)); | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should also send input frame also , along with Voice Detection info
1. renamed vad ->voice activity detector 2. bringing fvad from vcpkg 3 . made changes in unit test 4. sending input frame also , along with Voice Detection info
false:audio has booth speech and silence true:audio has only speech
No description provided.