DreamBooth

What is DreamBooth?

DreamBooth is a technique for training an AI image model on a small set of photos of a specific subject so it can generate that subject in new situations, styles, and contexts.

At a glance

Type of model
Fine-tuning technique for personalizing existing text-to-image diffusion models
Developed by
Google Research
Key capability
Training an AI image generation model on three to thirty images of a specific subject to enable generation of that subject in new contexts, poses, and styles
How it fits in AI workflow
Used to create custom character models, brand-consistent visual tools, and personalized generators within AI production pipelines; typically applied to Stable Diffusion-based models and workflows

Ready to create?

Direct scenes, design characters, and ship full films

All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.

How it compares

How it compares

DreamBooth produces a full fine-tuned model checkpoint and typically achieves strong, comprehensive personalization of the subject across diverse prompt contexts. LoRA is a more computationally efficient fine-tuning approach that trains a small set of additional weights rather than the full model, requiring less storage and training time while achieving strong but sometimes less comprehensive personalization. In practice, DreamBooth with LoRA combines both approaches, using the DreamBooth training methodology with the LoRA efficiency framework to balance quality against resource requirements.


Pro tip

Image curation for DreamBooth training has a disproportionate impact on output quality. Rather than collecting as many images as possible, prioritize ten to twenty diverse, high-quality images that show the subject from varied angles, in different lighting conditions, and with different backgrounds. Including near-duplicate images, multiple very similar frames, or images with other visually dominant elements teaches the model the wrong patterns. Variety within a small, well-curated set consistently outperforms large sets of redundant images.

Types and variations

  • Full DreamBooth fine-tuning updates all or most of the model's weights on the subject dataset, producing comprehensive and flexible personalization but requiring more storage as a full model checkpoint is produced.
  • DreamBooth with LoRA integrates the DreamBooth approach with the LoRA efficient fine-tuning framework, reducing storage requirements and training time while maintaining strong personalization results.
  • Class-specific DreamBooth training uses prior preservation loss, training the model with additional generic class images to prevent the fine-tuning from degrading the model's general capability while it learns the specific subject.

Ready to make your first scene in Morphic?

Try Morphic

Common use cases

  • Training a custom character model from a small set of reference images to generate that character consistently across many different prompts and scenes.
  • Creating a brand-specific generation model trained on product imagery, enabling consistent product visualization in any context described in a prompt.
  • Personalizing an image generation model with a specific artistic style by training on a curated set of stylistically consistent reference images.
  • Building a recurring AI spokesperson or avatar from a photograph set for use across marketing, educational, and communications content.
  • Fine-tuning models for domain-specific creative applications where the default base model does not perform well on the specific subjects or styles required.

Ready to create?

Direct scenes, design characters, and ship full films

All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.

FAQs

What is DreamBooth?

DreamBooth is a fine-tuning technique that trains an existing AI image generation model on a small set of images, typically three to thirty, depicting a specific subject. The trained model can then generate that subject in any context, style, or pose described in a prompt.

How many images do you need to train a DreamBooth model?

DreamBooth can work with as few as three to five images for basic results, but ten to thirty carefully curated, diverse images typically produce more flexible and consistent output. Image quality and variety matter more than volume.

Who developed DreamBooth?

DreamBooth was developed by researchers at Google and described in a paper published in 2022. It has since been widely adopted and adapted by the open-source AI image generation community.

What is the difference between DreamBooth and LoRA?

DreamBooth produces a full fine-tuned model checkpoint and typically achieves comprehensive personalization. LoRA trains a smaller set of additional weights that are overlaid on the base model, requiring less storage and training time. DreamBooth with LoRA combines both approaches for a balance of quality and efficiency.

What can DreamBooth be used to train?

DreamBooth can train models on specific people, characters, products, artistic styles, pets, objects, and any other subject with distinctive visual features that need to be reproducible across diverse generated contexts.

Does DreamBooth work with any AI image model?

DreamBooth is most commonly applied to Stable Diffusion-based models and their variants, where the open-source model weights can be fine-tuned locally or through cloud training services. It is not applicable to proprietary closed models where the underlying weights are not accessible.

How long does DreamBooth training take?

Training time varies with hardware, dataset size, and training parameters, but a standard DreamBooth run typically takes between fifteen minutes and several hours on consumer-grade or cloud GPU hardware. DreamBooth with LoRA generally trains faster than full weight DreamBooth.

What is prior preservation loss in DreamBooth training?

Prior preservation loss is a technique used during DreamBooth training where additional generic class images are included alongside the subject images to prevent the fine-tuning from degrading the model's ability to generate the general class of subject. For example, when training on a specific person, generic portrait images are included to prevent the model from forgetting what general portraits look like.

Can't find what you are looking for?
Contact us and let us know.
bg