SpeechRecognitionResult
public struct SpeechRecognitionResult : Codable, Equatable
Component results for a speech recognition request.
-
If the
split_transcript_at_phrase_endparameter istrue, describes the reason for the split:end_of_data- The end of the input audio stream.full_stop- A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.reset- The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.silence- A pause or silence that is at least as long as the pause interval.
Declaration
Swift
public enum EndOfUtterance : String -
An indication of whether the transcription results are final. If
true, the results for this utterance are not updated further; no additional results are sent for aresult_indexonce its results are indicated as final.Declaration
Swift
public var final: Bool -
An array of alternative transcripts. The
alternativesarray can include additional requested output such as word confidence or timestamps.Declaration
Swift
public var alternatives: [SpeechRecognitionAlternative] -
A dictionary (or associative array) whose keys are the strings specified for
keywordsif both that parameter andkeywords_thresholdare specified. The value for each key is an array of matches spotted in the audio for that keyword. Each match is described by aKeywordResultobject. A keyword for which no matches are found is omitted from the dictionary. The dictionary is omitted entirely if no matches are found for any keywords.Declaration
Swift
public var keywordsResult: [String : [KeywordResult]]? -
An array of alternative hypotheses found for words of the input audio if a
word_alternatives_thresholdis specified.Declaration
Swift
public var wordAlternatives: [WordAlternativeResults]? -
If the
split_transcript_at_phrase_endparameter istrue, describes the reason for the split:end_of_data- The end of the input audio stream.full_stop- A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.reset- The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.silence- A pause or silence that is at least as long as the pause interval.
Declaration
Swift
public var endOfUtterance: String?
View on GitHub
SpeechRecognitionResult Structure Reference