Changing the speech provider

At any stage of the call, the bot can dynamically change the speech provider (speech-to-text or text-to-speech) of the call dynamically. The provider change can be done for the entire call duration, or for the current activity (the current text/audio that is played by the bot).

For example, some speech-to-text providers may be better at recognizing dates while others are better at recognizing streets. The bot can change the active speech-to-text or text-to-speech during the call.

This feature is supported only by VoiceAI Connect Enterprise (from Version 3.6 and later).

How do I use it?

At any stage of the call, the bot can dynamically change the speech provider (speech-to-text or text-to-speech) of the call dynamically. The provider change can be done for part or an entire call duration (activity/session params). The following is an example of dynamically changing both the text-to-speech and speech-to-text providers using the ttsProvider and sttProvider activityParams.

This feature is configured on VoiceAI Connect by the Administrator, using the additionalSttProviders and additionalTtsProviders bot parameters (see Configuring multiple speech providers).

Example:

The following shows an example of a bot configuration for changing text-to-speech and speech-to-text providers:

{ 
  "type": "message", 
  "text": "Please tell me your date of birth", 
  "activityParams": { 
    "ttsProvider": "my_aws_provider", 
    "sttProvider": "my_google_provider" 
  } 
}

This config works the same as all config events. You can use activityParams for changing the next Activity or sessionParams for changing sttProvider/ttsProvider till the end of the session.