Copy of Voice emotion control
Add emotion, reactions, and delivery direction to your generated speech. How you write your prompt directly controls how the voice performs.
Morphic supports two speech models — ElevenLabs and MiniMax. Each uses a different syntax for emotion control. Select your model, then write your prompt using the format below.
ElevenLabs
ElevenLabs uses bracket tags written directly in your prompt. Wrap any emotion, reaction, or direction in square brackets and the model will interpret it as a performance cue — not spoken text.
How it works
[tag] Your dialogue text here.Tags affect everything after them until a new tag appears. You can combine multiple tags and place them anywhere in your text.
[excited] We just hit a million users!
[whispers][tense] Don't let them hear you.
[tired] I've been working for 14 hours straight. [sigh] I can't even feel my hands anymore.You can use any tag
ElevenLabs is open-ended. There is no fixed list — write any emotion or direction inside brackets and the model will try to interpret it. Tags like [jealous], [romantic], [awkward], [suspicious tone], or [continues after a beat] all work.
The tags below are commonly used and reliably effective, but you're not limited to them.
Emotions
[excited] · [happy] · [cheerfully] · [sad] · [sorrowful] · [angry] · [nervous] · [frustrated] · [calm] · [tired] · [curious] · [sarcastic] · [playful] · [mischievously] · [deadpan] · [flatly]
Nuance: [hesitant] · [relieved] · [tense] · [warm] · [resigned tone] · [stammers] · [regretful] · [sympathetic] · [reassuring] · [professional] · [questioning]
Reactions
Non-verbal sounds that add realism.
[laughs] · [giggles] · [sigh] · [gasps] · [gulps] · [crying] · [clears throat] · [light chuckle]
Delivery
[whispers] · [shouts] · [quietly] · [loudly] · [rushed] · [drawn out] · [dramatic tone]
Accents
Switch accents without changing voice.
[American accent] · [British accent] · [French accent] · [Southern US accent] · [strong Russian accent] · [strong X accent]
Characters
[pirate voice] · [old man voice] · [robot voice]
Multi-character dialogue
[interrupting] · [overlapping]
Pauses
ElevenLabs does not support explicit pause durations. Pause length is inferred from context, tags, and punctuation:
[pause]
Dramatic silence (model decides length)
...
Hesitant trailing pause
—
Short, sharp pause
ALL CAPS
Emphasis on the word
New paragraph
Clear pause + intonation reset
Tips
Tags work best when they match the text.
"No... please [crying] don't go."works better than adding[crying]to neutral text.Combine tags:
[whispers][tense]or[hesitant][nervous].The voice you choose matters — a calm voice won't shout well, and a hyped voice won't whisper well.
Use Creative or Natural stability for best tag responsiveness. Robust is more consistent but less expressive.
Prompts under ~250 characters may produce inconsistent results. Longer is better.
MiniMax
MiniMax uses parenthetical sound tags in your prompt and a separate emotion selector in Morphic's UI.
Emotion
Select the emotion from the dropdown when generating. This sets the overall tone of the entire output.
Auto
Model reads the text and picks the best emotion (default)
Happy
Upbeat, positive
Sad
Downcast, melancholic
Angry
Forceful, aggressive
Fearful
Anxious, scared
Disgusted
Repulsed, averse
Surprised
Startled, astonished
Calm
Relaxed, serene
Fluent
Clean, broadcast-style — ideal for news or technical narration
Neutral
No emotional bias
Sound tags
Add non-verbal sounds directly in your prompt using parentheses. These are preset only — only the tags listed below are supported.
(laughs)
(chuckle)
(coughs)
(clear-throat)
(groans)
(breath)
(pant)
(inhale)
(exhale)
(gasps)
(sniffs)
(sighs)
(snorts)
(burps)
(lip-smacking)
(humming)
(hissing)
(emm)
(whistles)
(sneezes)
(crying)
(applause)
(yawns)
Unlike ElevenLabs, you cannot invent custom tags. Writing
(nervous)or(jealous)won't work — the model will speak them as text. Use the emotion selector for emotional tone.
Pauses
Insert timed silences using <#x#> where x is seconds (0.01–99.99).
Tips
Use sound tags sparingly — too many can sound unnatural.
Set emotion to Auto for most cases. Override manually when you need consistent tone across long text.
Punctuation matters — commas and periods guide the model's pacing and intonation.
Last updated