Skip to main content

Speech-to-Text (STT)

A speech-to-text (STT) node transcribes spoken audio into written text, enabling voice control, transcription services, and accessibility features. It typically supports real-time and batch processing, multiple languages, speaker diarization, and punctuation handling. Some implementations also offer confidence scoring and formatting options.

The following extensions provide STT nodes: