Video generation
Available now

Happy Horse 1.0

by Alibaba

Happy Horse 1.0 by Alibaba's Taotian Future Life Lab generates video and synchronized audio in a single forward pass. 1080p output, 3–15 second clips, native lip-sync across English, Mandarin, Cantonese, Japanese, Korean, German, and French, and reference-driven control across text, image, and video inputs.

Text-to-videoImage-to-videoReference-to-videoVideo editingNative audio generationMultilingual lip-syncCharacter consistency

Happy Horse 1.0

by Alibaba

Key features

What makes Happy Horse 1.0 stand out from other AI models.

Technical specifications

Key specs and capabilities at a glance.

1080p

Resolution

Full HD output at 24 fps

3–15s

Duration

Per generation

5

Aspect ratios

16:9, 9:16, 1:1, 4:3, 3:4

7

Languages

Native lip-sync across 7 languages

Up to 5

Reference images

For reference-to-video and video editing

15B

Parameters

Unified 40-layer transformer

Use cases

How creators and businesses use Happy Horse 1.0 on Morphic.

Dialogue-driven scenes

Scenes where characters speak in any of 7 languages with synced lip movement, ambient sound, and timing.

Music videos and performance clips

Video and audio generated together means motion lands on beat from the first pass, no manual sync work needed.

Ad and campaign spots

Reference-driven control keeps product, talent, and brand visuals consistent across multiple shots.

Character-consistent storytelling

Lock in characters with reference images and carry them across multiple scenes for narrative video work.

Multilingual content localization

Same scene, same characters, dialogue swapped across languages with native lip-sync, suited for global campaigns.

Video editing without full re-renders

Adjust details, swap elements, or restyle existing footage through text instructions instead of starting over.

Prompt examples

Get started with these prompts. Paste them into Morphic and hit generate.

Dialogue scene

Two friends laughing in a Paris café, French dialogue, handheld

Performance clip

Cellist on a rooftop at sunset, sweeping orchestral score

Product spot

Sneakers spin on glossy floor, hip-hop beat, macro lens

Frequently asked questions

What is Happy Horse 1.0?
Happy Horse 1.0 is Alibaba's video generation model from the Taotian Future Life Lab, released April 2026. It generates video and synchronized audio together in a single pass and held the #1 Elo on the Artificial Analysis Video Arena at launch.
Which languages does Happy Horse support for lip-sync?
Seven languages with native lip-sync: English, Mandarin, Cantonese, Japanese, Korean, German, and French.
How long can a Happy Horse 1.0 video be?
Each generation is 3 to 15 seconds at 1080p, across five aspect ratios including 16:9, 9:16, and 1:1.
How is it different from Seedance 2.0 or Veo 3?
Happy Horse generates video and audio jointly in one pass with native multilingual lip-sync across 7 languages. Seedance 2.0 emphasizes multimodal inputs and music beat sync. Veo focuses on cinematic photorealism.
Can I edit existing video with Happy Horse?
Yes. The video-edit endpoint accepts natural language instructions and up to 5 reference images to modify existing footage without a full re-render.

Try Happy Horse 1.0
on Morphic

Sign up for Morphic to start creating with Happy Horse 1.0. No downloads, no setup, just describe what you want and generate.