Speech

Voice input and output settings for hands-free interaction with your agent.

Voice Input (STT)

Configure speech-to-text settings for voice input.

Features:

Voice Output (TTS)

Configure text-to-speech settings for voice output.

Features:

Whisper Settings

Configure OpenAI Whisper for transcription.

Model Selection:

Larger models are more accurate but slower. Choose based on your accuracy and performance needs.

GPU Acceleration:

Enable CUDA/GPU for faster transcription when hardware acceleration is available.

Fig 10.11

Voice input and output settings for hands-free interaction with your agent.

Voice Input (STT)

Configure speech-to-text settings for voice input.

Features:

Voice Output (TTS)

Configure text-to-speech settings for voice output.

Features:

Whisper Settings

Configure OpenAI Whisper for transcription.

Model Selection:

Larger models are more accurate but slower. Choose based on your accuracy and performance needs.

GPU Acceleration:

Enable CUDA/GPU for faster transcription when hardware acceleration is available.

Fig 10.11