audio property
The natural language speech audio to be processed.
A single request can contain up to 2 minutes of speech audio data. The
transcribed text cannot contain more than 256 bytes. For non-streaming
audio detect intent, both config
and audio
must be provided. For
streaming audio detect intent, config
must be provided in the first
request and audio
must be provided in all following requests.
Implementation
core.String? audio;