The seconds of audio that the service has received as of this response. The value of the field is greater
than the values of the transcription and speaker_labels fields during speech recognition processing, since
the service first has to receive the audio before it can begin to process it. The final value can also be
greater than the value of the transcription and speaker_labels fields by a fractional number of seconds.
The seconds of audio that the service has passed to its speech-processing engine as of this response. The
value of the field is greater than the values of the transcription and speaker_labels fields during speech
recognition processing. The received and seen_by_engine fields have identical values when the service has
finished processing all audio. This final value can be greater than the value of the transcription and
speaker_labels fields by a fractional number of seconds.
If speaker labels are requested, the seconds of audio that the service has processed to determine speaker
labels as of this response. This value often trails the value of the transcription field during speech
recognition processing. The transcription and speaker_labels fields have identical values when the service
has finished processing all audio.
The seconds of audio that the service has processed for speech recognition as of this response.
Generated using TypeDoc
Detailed timing information about the service's processing of the input audio.