SpeechRecognitionResult

public struct SpeechRecognitionResult : Codable, Equatable

Component results for a speech recognition request.

  • If the split_transcript_at_phrase_end parameter is true, describes the reason for the split:

    • end_of_data - The end of the input audio stream.
    • full_stop - A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.
    • reset - The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.
    • silence - A pause or silence that is at least as long as the pause interval.
    See more

    Declaration

    Swift

    public enum EndOfUtterance : String
  • An indication of whether the transcription results are final. If true, the results for this utterance are not updated further; no additional results are sent for a result_index once its results are indicated as final.

    Declaration

    Swift

    public var final: Bool
  • An array of alternative transcripts. The alternatives array can include additional requested output such as word confidence or timestamps.

    Declaration

    Swift

    public var alternatives: [SpeechRecognitionAlternative]
  • A dictionary (or associative array) whose keys are the strings specified for keywords if both that parameter and keywords_threshold are specified. The value for each key is an array of matches spotted in the audio for that keyword. Each match is described by a KeywordResult object. A keyword for which no matches are found is omitted from the dictionary. The dictionary is omitted entirely if no matches are found for any keywords.

    Declaration

    Swift

    public var keywordsResult: [String : [KeywordResult]]?
  • An array of alternative hypotheses found for words of the input audio if a word_alternatives_threshold is specified.

    Declaration

    Swift

    public var wordAlternatives: [WordAlternativeResults]?
  • If the split_transcript_at_phrase_end parameter is true, describes the reason for the split:

    • end_of_data - The end of the input audio stream.
    • full_stop - A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars.
    • reset - The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use.
    • silence - A pause or silence that is at least as long as the pause interval.

    Declaration

    Swift

    public var endOfUtterance: String?