responseModalities property
Modalities the server should produce. Common values: ['AUDIO'] for
pure voice, ['TEXT'] for speech-to-text mode, ['AUDIO', 'TEXT']
for both (the API limits combinations — check current docs).
Implementation
final List<String> responseModalities;