Stable Diffusion
What is Stable Diffusion?
Stable Diffusion is a free, open-source AI model that generates images from text descriptions, and because anyone can download and modify it, it has become the basis for a huge number of AI creative tools.
At a glance
- Type of model
- Open-source latent diffusion text-to-image generation model
- Developed by
- Stability AI, with research contributions from LMU Munich and Runway ML
- Key capability
- High-quality text-to-image generation, img2img, inpainting, and outpainting; foundational architecture for a large ecosystem of fine-tuned models and extensions
- How it fits in AI workflow
- Used for image generation, concept art, character and environment design, img2img refinement, compositing support, and as the base architecture for many specialised image and video generation tools
- Related terms
- Diffusion modelCLIPLoRAControlNetLatent spaceMidjourneyAUTOMATIC1111
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.
How it compares
Stable Diffusion is open-source, can be run locally, and offers deep customisation through fine-tuning and extensions, while Midjourney is a hosted proprietary service with no local deployment. Midjourney typically produces more aesthetically polished results out of the box with less prompting effort, while Stable Diffusion provides far greater technical control, customisability, and flexibility for professional and research workflows.
Pro tip
For consistent character generation across a production, train a LoRA on ten to twenty images of your character using Stable Diffusion, then use that LoRA across all image generations: this gives you far more reliable character identity than prompt descriptions alone and is the standard technique for AI character consistency workflows.
Types and variations
- Stable Diffusion has been released in several major versions: SD 1.
- 4, SD 1.
- 5, SD 2.
- 0, SD 2.
- 1, SDXL (Stable Diffusion XL), and Stable Diffusion 3.
- Each version brought improvements in resolution, prompt adherence, and image quality.
- The community has produced thousands of fine-tuned checkpoints specialised for photorealism, anime, concept art, and many other aesthetics.
- LoRA adaptors allow lightweight fine-tuning for specific characters, styles, and subjects.
- ControlNet adds spatial conditioning using edge maps, depth maps, and pose inputs for greater compositional control.
Ready to make your first scene in Morphic?
Try MorphicCommon use cases
Stable Diffusion is used for generating concept art and visual development assets, creating consistent AI characters through LoRA training, producing background and environment imagery, img2img refinement of rough sketches or reference images, inpainting and outpainting for image editing and extension, generating storyboard frames, producing textures and assets for 3D and compositing workflows, and as a foundation layer for custom AI image pipelines.
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.