Add server-backed realtime transcription for prompt voice input and expose speech settings to choose realtime mode and models.
Add server-backed realtime transcription for prompt voice input and expose speech settings to choose realtime mode and models.