AudioTranscribeWhisperVadConfiguration
Configure VAD parameters to use a VAD model integrated in whisper-cpp. These parameters are directly from the whisper VAD configuration
Field |
Type |
Repeated |
Description |
model |
The GGML VAD model file path (required) |
||
threshold |
|||
min_speech_duration_ms |
Min duration for a valid speech segment. |
||
min_silence_duration_ms |
Min silence duration to consider speech as ended. |
||
max_speech_duration_s |
Max duration of a speech segment before forcing a new segment. |
||
speech_pad_ms |
Padding added before and after speech segments. |
||
samples_overlap |
Overlap in seconds when copying audio samples from speech segment. |