public class SpeechRecognitionResult
extends com.ibm.cloud.sdk.core.service.model.GenericModel
Modifier and Type | Class and Description |
---|---|
static interface |
SpeechRecognitionResult.EndOfUtterance
If the `split_transcript_at_phrase_end` parameter is `true`, describes the reason for the
split: * `end_of_data` - The end of the input audio stream.
|
Constructor and Description |
---|
SpeechRecognitionResult() |
Modifier and Type | Method and Description |
---|---|
List<SpeechRecognitionAlternative> |
getAlternatives()
Gets the alternatives.
|
String |
getEndOfUtterance()
Gets the endOfUtterance.
|
Map<String,List<KeywordResult>> |
getKeywordsResult()
Gets the keywordsResult.
|
List<WordAlternativeResults> |
getWordAlternatives()
Gets the wordAlternatives.
|
Boolean |
isXFinal()
Gets the xFinal.
|
public Boolean isXFinal()
An indication of whether the transcription results are final: * If `true`, the results for this utterance are final. They are guaranteed not to be updated further. * If `false`, the results are interim. They can be updated with further interim results until final results are eventually sent.
**Note:** Because `final` is a reserved word in Java and Swift, the field is renamed `xFinal` in Java and is escaped with back quotes in Swift.
public List<SpeechRecognitionAlternative> getAlternatives()
An array of alternative transcripts. The `alternatives` array can include additional requested output such as word confidence or timestamps.
public Map<String,List<KeywordResult>> getKeywordsResult()
A dictionary (or associative array) whose keys are the strings specified for `keywords` if both that parameter and `keywords_threshold` are specified. The value for each key is an array of matches spotted in the audio for that keyword. Each match is described by a `KeywordResult` object. A keyword for which no matches are found is omitted from the dictionary. The dictionary is omitted entirely if no matches are found for any keywords.
public List<WordAlternativeResults> getWordAlternatives()
An array of alternative hypotheses found for words of the input audio if a `word_alternatives_threshold` is specified.
public String getEndOfUtterance()
If the `split_transcript_at_phrase_end` parameter is `true`, describes the reason for the split: * `end_of_data` - The end of the input audio stream. * `full_stop` - A full semantic stop, such as for the conclusion of a grammatical sentence. The insertion of splits is influenced by the base language model and biased by custom language models and grammars. * `reset` - The amount of audio that is currently being processed exceeds the two-minute maximum. The service splits the transcript to avoid excessive memory use. * `silence` - A pause or silence that is at least as long as the pause interval.
Copyright © 2022 IBM Cloud. All rights reserved.