Today we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved control, expression and quality – enabling developers, enterprises and everyday users to create the next generation of AI-powered speech applications.
As of today, version 3.1 of Flash TTS is being introduced:
Improved speech quality and controllability
We’ve improved the overall speech quality of Gemini 3.1 Flash TTS, making it our most natural and expressive model ever. On Artificial TTS Analysis Leaderboarda benchmark that takes thousands of blind people’s preferences into account, 3.1 Flash TTS achieved an impressive Elo score of 1211.
