Question 1

What is Gemini 3.1 Flash TTS?

Accepted Answer

Gemini 3.1 Flash TTS is Google's text-to-speech model, announced on April 15, 2026. It produces expressive, natural narration that you direct with plain-language instructions and inline audio tags, supports multi-speaker dialogue, and watermarks every clip with SynthID.

Question 2

What can I create with it on Morphic?

Accepted Answer

Use Gemini 3.1 Flash TTS for voice-over, narration, character dialogue, localized reads, and expressive ad reads. Generate the audio on Morphic, then drop it into Canvas alongside your video clips in the same workflow.

Question 3

How do I direct the voice?

Accepted Answer

Two ways, and you can combine them. Write a plain-language instruction before your line, like 'Say this warmly and slowly:', and add inline cues in square brackets, like [laughs] or [whispering], where you want them. Gemini performs the cue instead of reading it aloud.

Question 4

Does it support multiple speakers?

Accepted Answer

Yes. Gemini 3.1 Flash TTS can voice a back-and-forth between two speakers in a single generation, giving each speaker a distinct voice. Label each line with the speaker's name and assign a voice to each one before you generate.

Question 5

How many languages does it support?

Accepted Answer

Gemini 3.1 Flash TTS narrates across many languages, with control over accent, pacing, and style in each. Pick the voice and language that suit your script before generating.

Question 6

How is it different from ElevenLabs on Morphic?

Accepted Answer

Both produce human-quality voice on Morphic. ElevenLabs is a full audio suite spanning speech, music, and sound effects with fine voice-tuning controls. Gemini 3.1 Flash TTS focuses on expressive, directable speech, with plain-language direction, inline audio tags, and multi-speaker dialogue. Many creators use both, one for voice, the other for music and effects.

Question 7

Does it watermark the audio?

Accepted Answer

Yes. Every clip generated by Gemini 3.1 Flash TTS carries Google's imperceptible SynthID watermark for AI provenance. It is inaudible to listeners and built to survive common edits like re-encoding.

Question 8

How do I use Gemini 3.1 Flash TTS on Morphic?

Accepted Answer

Open Morphic, switch the prompt bar to Audio, and choose Speech. Pick Gemini 3.1 Flash TTS as the audio model, write your script with any direction or tags, choose a voice and language, then generate.

Gemini 3.1 Flash TTS

Key features

Expressive narration

Natural-language direction

Inline audio tags

Multi-speaker dialogue

Multilingual and accent control

SynthID watermarking

Technical specifications

Use cases

Video narration and voice-over

Character dialogue

Localized voice-over

Audiobook and long-form

Explainers and tutorials

Ad reads and promos

Prompt examples

Warm narration

Inline reaction

Whisper to normal

Accent control

Dramatic pacing

Two-speaker scene

Simple pricing

FAQs

Other models