Skip to main content

SileroVAD VAD Node

The voice activity detector (VAD) node uses SileroVAD to detect the presence of voice activity in the audio signal.

Number of inputs1
Number of outputs0

Configuration​

frameSize (Optional)​

Type: Int

The size of the audio buffer (in number of frames) used for processing. Default value is 512.

threshold (Optional)​

Type: Float

The sensitivity threshold for detecting voice activity. Higher values make detection stricter. Default value is 0.5.

minSilenceDurationMs (Optional)​

Type: Int

The minimum duration of silence (in milliseconds) required to consider speech as ended. Default value is 100.

speechPadMs (Optional)​

Type: Int

The amount of padding (in milliseconds) added before and after detected speech segments. The default value is 0.


Values​

Inherits values from its parent, SingleBusAudioSinkNode.


Actions​

Inherits actions from its parent, SingleBusAudioSinkNode.


Events​

Inherits events from its parent, SingleBusAudioSinkNode.

start​

Indicates when voice activity starts.

Data​

None

end​

Indicates when voice activity ends.

Data​

NameDataDescription
startDoubleIndicates when voice activity started.
endDoubleIndicates when voice activity ended.