SpeechRecognitionResult
public struct SpeechRecognitionResult : Codable, Equatable
Component results for a speech recognition request.
-
If the
split_transcript_at_phrase_end
parameter istrue
, describes the reason for the split:end_of_data
- The end of the input audio stream.full_stop
- A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.reset
- The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.silence
- A pause or silence that is at least as long as the pause interval.
Declaration
Swift
public enum EndOfUtterance : String
-
An indication of whether the transcription results are final. If
true
, the results for this utterance are not updated further; no additional results are sent for aresult_index
once its results are indicated as final.Declaration
Swift
public var final: Bool
-
An array of alternative transcripts. The
alternatives
array can include additional requested output such as word confidence or timestamps.Declaration
Swift
public var alternatives: [SpeechRecognitionAlternative]
-
A dictionary (or associative array) whose keys are the strings specified for
keywords
if both that parameter andkeywords_threshold
are specified. The value for each key is an array of matches spotted in the audio for that keyword. Each match is described by aKeywordResult
object. A keyword for which no matches are found is omitted from the dictionary. The dictionary is omitted entirely if no matches are found for any keywords.Declaration
Swift
public var keywordsResult: [String : [KeywordResult]]?
-
An array of alternative hypotheses found for words of the input audio if a
word_alternatives_threshold
is specified.Declaration
Swift
public var wordAlternatives: [WordAlternativeResults]?
-
If the
split_transcript_at_phrase_end
parameter istrue
, describes the reason for the split:end_of_data
- The end of the input audio stream.full_stop
- A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.reset
- The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.silence
- A pause or silence that is at least as long as the pause interval.
Declaration
Swift
public var endOfUtterance: String?