Question 1

What is a reference image in AI generation?

Accepted Answer

A reference image is a visual input provided to an AI generation model to guide aspects of the generated output: style, character appearance, composition, colour palette, or other visual qualities. It communicates visual information that text prompts cannot fully specify, providing a direct visual anchor for the model to extract and apply to its generation.

Question 2

What is the difference between IP-Adapter and ControlNet for reference images?

Accepted Answer

IP-Adapter encodes the overall visual features of a reference image ( aesthetic qualities, colour relationships, visual style ) and uses them to influence the generation without requiring spatial alignment between reference and output. ControlNet extracts specific structural information ( pose, edges, depth ) from a reference and uses it to constrain the spatial arrangement of the generated output while allowing visual re-styling. IP-Adapter guides aesthetic; ControlNet guides structure.

Question 3

Can I use any image as a reference?

Accepted Answer

Any image can in principle serve as a reference, but the quality and clarity of the reference directly affects the quality and precision of the conditioning. Clear, unambiguous images that prominently feature the specific quality you want to extract: the character face for character consistency, the distinctive colour palette for style guidance, the specific pose for pose conditioning: produce better conditioning results than cluttered, ambiguous, or low-quality references. Choose references that clearly and unambiguously show what you want the model to pick up.

Question 4

How do reference images help with character consistency?

Accepted Answer

Character reference images provide the model with a specific visual specification of a character's appearance ( their face, proportions, hair, and distinctive features ) that text description alone cannot precisely anchor. By conditioning each generation on the same character reference through IP-Adapter or platform-specific consistency features, the model produces outputs that reflect the reference character's appearance rather than generating a new variation of the described type for each output.

Question 5

What is a style reference image?

Accepted Answer

A style reference image guides the overall aesthetic, colour palette, tone, and visual character of the generation: communicating a desired look and feel rather than specific subject content. It tells the model how to render the scene, not what to render. Style references are particularly effective for establishing consistent visual identity across a body of generated work and for communicating aesthetic directions that are difficult to fully specify in text.

Question 6

What is a mood board and how does it relate to reference images?

Accepted Answer

A mood board is a curated collection of reference images that collectively define the visual direction, aesthetic sensibility, and tonal character for a project or production. In AI generation, mood board images serve as style references that guide the overall visual identity of generated content. Some platforms support multiple reference images simultaneously; others require selecting the single most representative reference. A well-curated mood board distils complex aesthetic vision into concrete visual examples the model can respond to.

Question 7

Can reference images override text prompts?

Accepted Answer

The balance between reference image conditioning and text prompt influence depends on the technical approach used and its strength settings. Strong reference conditioning (high IP-Adapter weight, strong ControlNet guidance) can dominate the generation, with text prompt guidance playing a secondary role. Lighter conditioning allows more text prompt influence. In practice, the most effective approach is to set conditioning strength so that both reference and text contribute meaningfully: the reference anchoring the visual quality or structure while the text prompt guides content and context.

Question 8

Is using a copyrighted image as a reference legal?

Accepted Answer

The legal status of using copyrighted images as references in AI generation is an area of active legal development and genuine uncertainty. Providing a reference image to condition generation is technically distinct from reproducing the image, but the outputs may reflect the style or visual character of the reference in ways that could be considered legally relevant, depending on jurisdiction and specific circumstances. When in doubt about commercial use of reference-conditioned generations, consult relevant legal guidance and consider using original, owned, or licence-cleared images as references.

Reference Image

What is Reference Image?

Direct scenes, design characters, and ship full films

Types and variations

Ready to make your first scene in Morphic?

Common use cases

Direct scenes, design characters, and ship full films

FAQs