Tag
voice AI
Voice AI covers systems that listen, speak, and respond in real time, from speech-to-text and TTS to live multimodal assistants. For developers, latency, noise handling, language support, and custom voices determine whether it works in call centers, cars, and live apps.
3 articles

Model Releases/Apr 3
Mistral’s Voxtral TTS targets voice AI builders
Mistral’s open source Voxtral TTS supports 9 languages, 90 ms TTFA, and custom voices from under 5 seconds of audio.

Model Releases/Apr 3
Google's Gemini 3.1 Flash Live Targets Real-Time Voice AI
Gemini 3.1 Flash Live brings low-latency audio, video, and tool use to Google’s Live API, with 90.8% on ComplexFuncBench Audio.

Model Releases/Apr 3
Gemini Live gets a major upgrade with 3.1 Flash Live
Google’s Gemini 3.1 Flash Live cuts latency, improves noise handling, and expands Search Live to 200+ countries.