DALL-E
What is DALL-E?
DALL-E is OpenAI's first AI model that could generate images from text descriptions, proving that a computer could create new pictures from written instructions.
At a glance
- Type of model
- Text-to-image generation model
- Developed by
- OpenAI
- Key capability
- Generating coherent images from natural language prompts, including novel combinations of concepts not seen during training
- How it fits in AI workflow
- The original DALL-E established text-to-image generation as a practical modality and is the ancestor of DALL-E 2 and DALL-E 3, which are the versions currently used in production creative workflows
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.
How it compares
DALL-E is a proprietary model developed and controlled by OpenAI, accessed through their API and products. Stable Diffusion is an open-source model whose weights are publicly available, enabling community customization, local deployment, and a wide ecosystem of fine-tuned variants. DALL-E prioritizes commercial safety and ease of use; Stable Diffusion prioritizes openness, flexibility, and community extension.
Pro tip
Understanding DALL-E's historical role helps contextualize the entire text-to-image generation field. When encountering literature, tutorials, or discussions about AI image generation from 2021 and 2022, DALL-E references typically mean the original model or DALL-E 2. Distinguishing between the three generations by their release context avoids confusion when evaluating older capability claims against current model performance.
Types and variations
- The original DALL-E used a transformer-based autoregressive architecture and produced lower-resolution outputs relative to its successors.
- DALL-E 2 replaced the architecture with a diffusion-based approach, significantly improving quality and enabling inpainting and outpainting.
- DALL-E 3 further advanced prompt adherence, text rendering, and compositional sophistication.
- Each version represents a distinct model with different capabilities, though they share the same founding concept and naming lineage.
Ready to make your first scene in Morphic?
Try MorphicCommon use cases
- Research and education contexts where the original model's historical significance and foundational capabilities are the subject of study.
- Early commercial creative workflows where DALL-E outputs were used for concept exploration and ideation before higher-quality successors were available.
- Demonstrations of AI creative capability to audiences unfamiliar with text-to-image generation.
- The original DALL-E is less commonly used for current production work, which typically relies on DALL-E 2, DALL-E 3, or third-party models.
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.