Ideogram 4.0
by Ideogram
Ideogram's open-weight image model. Frontier in-image text, layout control, and 2K output.

Key features
Technical specifications
Open
Open weights under a commercial license
0.97 OCR
X-Omni English OCR for in-image text
16 colors
Condition output on up to 16 hex colors
Up to 2K
256 to 2048 px per side, flexible ratios
Use cases
Posters and packaging
Design where the title, tagline, and small print all have to read correctly. Text renders legibly, not as shapes.
Multilingual campaigns
Localize one visual across markets by swapping the text per language while layout and palette stay fixed.
Brand-locked visuals
Feed the brand's hex palette into the prompt and every generation stays on-brand, tile to banner.
Unusual formats
One set of weights covers square thumbnails, widescreen, 2048 by 768 ultrawide banners, and social headers.
Programmatic generation
JSON prompts are built for code. Generate catalogs or ad variants from a script, each element typed and validated.
Self-hosted pipelines
Teams that can't use a third-party API can fine-tune the open weights and run them in their own infrastructure.
Prompt examples





Multilingual sign
Tokyo storefront with accurate Japanese signage, soft rain, evening glow
Edit prompt
Simple pricing
Get started for free today, with the option to upgrade or cancel anytime.
Basic
900 monthly credits
1 user only
All models
Workflows
Standard
3200 monthly credits
1 user only
All models
Workflows
Pro
6200 shared monthly credits
1 user
All models
Workflows
Pro Max
24000 shared monthly credits
1 user
All models
Workflows
Enterprise
For higher limits
Custom
pricing and billing terms

Free
For playing around
$0
forever free
FAQs
Reve 2.0
Reve AI
Reve AI's layout-first image model. Place every element by hand, edit the result like a design file, and render crisp text at up to 4K.
Bernini
ByteDance
ByteDance's open-source video model for instruction-based editing, with the rest of the frame locked and subject identity held.
Grok Imagine v1.5
xAI
xAI's image-to-video model with native synchronized audio. Animate any still into a clip with sound, dialogue, and music.
Veo 4
Google DeepMind
Google DeepMind's next video model. Native 4K, longer clips, multi-shot character consistency, and a cinematic camera language in a single prompt.