Multi-modal
Available now

Grok Imagine

by xAI

Cross-modal AI from xAI. Text-to-image, image-to-image, text-to-video, image-to-video, and video-to-video from one model, on Morphic.

Text-to-imageImage-to-imageText-to-videoImage-to-videoVideo-to-video

Grok Imagine

by xAI

Key features

What makes Grok Imagine stand out from other AI models

Technical specifications

Key specs and capabilities at a glance

1080p

Resolution

Full HD output

6–10s

Duration

6–10 seconds for video

5

Modes

TTI, ITI, TTV, ITV, VTV

24 fps

Frame rate

Standard frame rate

Use cases

How creators and businesses use Grok Imagine on Morphic

Multi-format campaigns

Create matching images and videos from a single concept. Design a still hero image, then animate it to video, maintaining perfect visual consistency.

Video transformation

Restyle existing footage with AI. Transform the look and feel of video clips while preserving the original motion, timing, and structure.

Image-to-video pipeline

Design a still image, refine it with image editing, then animate it to video. The seamless cross-modal flow enables precise creative control.

Creative exploration

Experiment across image and video formats without switching models. Rapidly explore ideas in both stills and motion from a single creative brief.

Style transfer

Apply new visual styles to existing images and videos, transform photos into paintings, convert live footage into animation, or restyle content for different audiences.

Content repurposing

Transform existing visual assets across formats. Turn product photos into promotional videos, animate illustrations, or convert video clips into stylized versions.

Prompt examples

Open any of these to tweak and generate

Image generation

A cyberpunk city at night with holographic advertisements floating between buildings, reflections on wet streets, detailed neon colors, cinematic wide angle

Edit prompt
Video from image

Upload a landscape photo and describe: Bring this scene to life with gentle wind through trees, moving clouds, and soft light changes

Edit prompt
Video transformation

Upload a video clip and describe: Transform to watercolor painting style, maintain all original motion, soft pastel colors

Edit prompt

FAQs

What is Grok Imagine?
Grok Imagine is xAI's multimodal AI model that generates both images and videos. It supports five modes: text-to-image, image-to-image editing, text-to-video, image-to-video animation, and video-to-video transformation.
Can Grok Imagine create both images and videos?
Yes. Grok Imagine is one of the few models that spans both image and video generation from a single model, making it uniquely versatile for creators who work across formats.
What is video-to-video?
Video-to-video lets you transform existing video clips, change visual styles, edit aesthetics, or reimagine footage while preserving the original motion, structure, and timing.
How long are Grok Imagine videos?
Grok Imagine generates videos between 6 and 10 seconds at 1080p, suitable for social media clips, creative shorts, and promotional content.
What makes Grok Imagine unique?
Its cross-modal versatility. While most models specialize in either image or video, Grok Imagine excels at both plus offers video-to-video transformation, five modes from a single model.
How do I use Grok Imagine on Morphic?
Open Copilot, describe what you want, and pick Grok Imagine. You can generate stills, animate an image, or transform an existing video clip, all from one model, which keeps cross-format work in a single conversation.

Try Grok Imagine on Morphic

Sign up for Morphic to start creating with Grok Imagine. No downloads, no setup, just describe what you want and generate.