Question 1

What is ControlNet?

Accepted Answer

ControlNet is a neural network architecture that adds spatial control to image generation models by conditioning the generation process on structural input images such as pose maps, edge maps, or depth maps. It allows creators to specify the compositional and spatial structure of generated outputs with far greater precision than text prompts alone.

Question 2

How does ControlNet work?

Accepted Answer

ControlNet trains additional neural network modules that process structural control images alongside the base diffusion model. These modules extract spatial information from the control input and pass it as conditioning to the generation process, constraining where elements appear in the output without overriding the base model's visual style.

Question 3

What types of control inputs does ControlNet support?

Accepted Answer

ControlNet supports pose maps for body position control, edge maps for structural line control, depth maps for spatial depth relationships, segmentation maps for regional content control, and normal maps for surface geometry control, among others. Multiple control types can be used simultaneously.

Question 4

What is the difference between ControlNet and image-to-image generation?

Accepted Answer

Image-to-image uses a reference image directly, influencing both structure and visual content. ControlNet extracts specific structural information from a reference and uses only that as a spatial constraint, allowing text and base model to determine visual content and style independently of the reference's appearance.

Question 5

What is pose ControlNet used for?

Accepted Answer

Pose ControlNet uses skeleton keypoint maps to ensure generated characters match a specific body position. It is widely used for generating character variations in identical poses, matching a reference pose for product or fashion visualisation, and ensuring consistent character stance across multiple generations.

Question 6

Can ControlNet be used with any image generation model?

Accepted Answer

ControlNet modules are architecture-specific and must be compatible with the base model. Most ControlNet development has been for Stable Diffusion and its variants. Each base model architecture requires its own ControlNet modules trained for that specific architecture.

Question 7

What does ControlNet weight mean?

Accepted Answer

ControlNet weight controls how strongly the control module's spatial conditioning influences the generation output. Higher weights produce outputs that follow the control image more precisely but may reduce visual quality. Lower weights allow more generative freedom while still applying directional spatial guidance.

Question 8

Is ControlNet used in commercial AI tools?

Accepted Answer

ControlNet principles are used or referenced in many commercial AI generation tools, though implementations vary. The architecture originated in the open-source Stable Diffusion ecosystem and has influenced how spatial control features are developed across a broader range of commercial and research AI generation platforms.

ControlNet

What is ControlNet?

Direct scenes, design characters, and ship full films

Types and variations

Ready to make your first scene in Morphic?

Common use cases

Direct scenes, design characters, and ship full films

FAQs