Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Share

Today we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved control, expression and quality – enabling developers, enterprises and everyday users to create the next generation of AI-powered speech applications.

As of today, version 3.1 of Flash TTS is being introduced:

Improved speech quality and controllability

We’ve improved the overall speech quality of Gemini 3.1 Flash TTS, making it our most natural and expressive model ever. On Artificial TTS Analysis Leaderboarda benchmark that takes thousands of blind people’s preferences into account, 3.1 Flash TTS achieved an impressive Elo score of 1211.

The AI Sckool

Categories

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Improved speech quality and controllability

China’s open AI models challenge Silicon Valley’s playbook

Even subtle sleep deprivation can lead to weight gain

Accelerating the frontiers of scientific discovery: Google’s $40 million commitment to the Genesis mission

Run the Mythos enhanced encoding model locally using llama.cpp and Pi files

Introducing Gemini 3.6 Flash, 3.5 Flash-Lite and 3.5 Flash Cyber

More News

Accelerating the frontiers of scientific discovery: Google’s $40 million commitment to the Genesis mission

Introducing Gemini 3.6 Flash, 3.5 Flash-Lite and 3.5 Flash Cyber

Our approach to bioresistance

Empowering India’s next generation of innovators with ATL Saathi

China’s open AI models challenge Silicon Valley’s playbook

Even subtle sleep deprivation can lead to weight gain

Accelerating the frontiers of scientific discovery: Google’s $40 million commitment to the Genesis mission