Overview
TheSegment and Token types represent transcribed audio with timing information. A Segment represents a complete phrase or sentence, while Token represents individual words within that segment.
Segment
Represents a segment of transcribed audio with timing information. May include detailed token-level (word-by-word) information when available.Type definition
Properties
The transcribed text for this segment.
Start time of the segment in seconds.
End time of the segment in seconds.
Confidence score for this transcription (between 0 and 1).Higher values indicate greater confidence in the transcription accuracy.
Word-by-word breakdown of the transcription with individual timings.
Token
Represents a token (word or phrase) in the transcription with timing information.Type definition
Properties
The transcribed text (word or phrase).
Start time in seconds.Inherited from
TimeRange type (from ffmpeg-simplified).End time in seconds.Inherited from
TimeRange type (from ffmpeg-simplified).Confidence score for this transcription (between 0 and 1).
Usage examples
Response structure example
Related
transcribe
Returns an array of Segment objects
Callbacks
onTranscriptionFinished receives Segment[]
Error handling
TranscriptionError includes partial segments
Resume failed transcriptions
Continue from partial transcriptions