Package | Description |
---|---|
com.ibm.watson.developer_cloud.speech_to_text.v1 |
Copyright 2015 IBM Corp.
|
com.ibm.watson.developer_cloud.speech_to_text.v1.websocket |
Modifier and Type | Method and Description |
---|---|
RecognizeOptions |
RecognizeOptions.contentType(String contentType)
The format of the audio data specified as one of the following values:
audio/flac for Free Lossless Audio Codec (FLAC)
audio/l16 for Linear 16-bit Pulse-Code Modulation (PCM)
audio/wav for Waveform Audio File Format (WAV)
audio/ogg;codecs=opus for Ogg format files that use the opus codec
|
RecognizeOptions |
RecognizeOptions.continuous(Boolean continuous)
If true, multiple final results that represent multiple consecutive phrases separated by pauses
are returned.
|
RecognizeOptions |
RecognizeOptions.inactivityTimeout(Integer inactivityTimeout)
Inactivity timeout.
|
RecognizeOptions |
RecognizeOptions.interimResults(Boolean interimResults)
If true, the service sends interim results for the transcription.
|
RecognizeOptions |
RecognizeOptions.keywords(String[] keywords)
Specifies an array of keyword strings to be matched in the input audio.
|
RecognizeOptions |
RecognizeOptions.keywordsThreshold(Double keywordsThreshold)
Specifies a minimum level of confidence that the service must have to report a matching keyword
in the input audio.
|
RecognizeOptions |
RecognizeOptions.maxAlternatives(Integer maxAlternatives)
Maximum number of alternative transcripts returned.
|
RecognizeOptions |
RecognizeOptions.model(String model)
Sets the model name used for the recognition.
|
RecognizeOptions |
RecognizeOptions.session(SpeechSession session)
Sets the session id.
|
RecognizeOptions |
RecognizeOptions.sessionId(String sessionId)
Sets session id.
|
RecognizeOptions |
RecognizeOptions.timestamps(Boolean timestamps)
If true, time alignment for each word is returned.
|
RecognizeOptions |
RecognizeOptions.wordAlternativesThreshold(Double wordAlternativesThreshold)
Specifies a minimum level of confidence that the service must have to report a hypothesis for a
word from the input audio.
|
RecognizeOptions |
RecognizeOptions.wordConfidence(Boolean wordConfidence)
If true, confidence measure per word is returned if available.
|
Modifier and Type | Method and Description |
---|---|
SpeechResults |
SpeechToText.recognize(File audio,
RecognizeOptions options)
Recognizes an audio file and returns
SpeechResults .Here is an example of how to recognize an audio file: |
SpeechResults |
SpeechToText.recognize(File audio,
String contentType,
RecognizeOptions options)
Deprecated.
Deprecated in 2.6.0
Use SpeechToText.recognize(File, RecognizeOptions) |
void |
SpeechToText.recognizeUsingWebSockets(InputStream audio,
RecognizeOptions options,
RecognizeDelegate delegate)
Recognizes an audio
InputStream using WebSockets. |
Modifier and Type | Method and Description |
---|---|
void |
WebSocketSpeechToTextClient.recognize(InputStream stream,
RecognizeOptions options,
RecognizeDelegate delegate)
Creates a WebSocket connection to the Speech To Text service and sends the audio bytes from the
input stream for recognition
|
Copyright © 2015–2016. All rights reserved.