SileroVAD VAD Node
The voice activity detector (VAD) node uses SileroVAD to detect the presence of voice activity in the audio signal.
Number of inputs | 1 |
Number of outputs | 0 |
Configuration​
frameSize
(Optional)​
Type: Int
The size of the audio buffer (in number of frames) used for processing. Default value is 512.
threshold
(Optional)​
Type: Float
The sensitivity threshold for detecting voice activity. Higher values make detection stricter. Default value is 0.5.
minSilenceDurationMs
(Optional)​
Type: Int
The minimum duration of silence (in milliseconds) required to consider speech as ended. Default value is 100.
speechPadMs
(Optional)​
Type: Int
The amount of padding (in milliseconds) added before and after detected speech segments. The default value is 0.
Values​
Inherits values from its parent, SingleBusAudioSinkNode.
Actions​
Inherits actions from its parent, SingleBusAudioSinkNode.
Events​
Inherits events from its parent, SingleBusAudioSinkNode.
start
​
Indicates when voice activity starts.
Data​
None
end
​
Indicates when voice activity ends.
Data​
Name | Data | Description |
---|---|---|
start | Double | Indicates when voice activity started. |
end | Double | Indicates when voice activity ended. |