Question 1

How does an AI model use a style reference image?

Accepted Answer

Different AI systems handle style references in different ways, but most encode the reference image into a vector representation that captures its visual characteristics: colour distribution, texture, spatial frequency, lighting quality: and then use this representation to condition the generation process alongside the text prompt. The degree to which the reference influences the output versus the text prompt is often controlled by a strength or weight parameter, allowing creators to blend reference conditioning with prompt direction at different ratios.

Question 2

What makes a good style reference image?

Accepted Answer

A good style reference clearly represents the target aesthetic without containing competing visual styles or distracting content. It should be technically clean: sharp, well-exposed, and free of compression artefacts. It should be relevant to the type of content being generated: a reference from a closely related genre, medium, or production context will condition outputs more effectively than one with a different visual language. High-contrast, stylistically distinctive references tend to produce stronger conditioning effects than images with neutral or averaged aesthetics.

Question 3

Can you use a film still as a style reference?

Accepted Answer

Film stills are among the most commonly used style references in AI generation workflows, as they efficiently communicate cinematographic language including colour grade, lighting quality, lens characteristics, and compositional approach. A well-chosen frame from a film whose aesthetic matches the target visual direction can condition AI generation outputs toward that cinematic look more precisely than extended text description. When using film stills, be aware that the content of the frame ( its characters, environment, and staging ) may also influence the generation content, not only the visual style.

Question 4

How many style references should you use at once?

Accepted Answer

The optimal number depends on the generation system and the complexity of the target aesthetic. Single references work well when the target style is unified and clearly represented by one image. Multiple references allow different visual dimensions to be specified separately: colour from one reference, lighting from another, texture from a third: but increase the risk that conflicting visual information produces an averaged or incoherent result. Most generation tools support two to four simultaneous references effectively; beyond this, the conditioning signals tend to interfere with each other.

Question 5

How do style references relate to LoRA models?

Accepted Answer

A style reference conditions a single generation session by providing visual information at inference time. A LoRA is a fine-tuned model component trained on a set of style examples that encodes that style into the model's weights, affecting every generation without needing a reference image at each session. LoRAs produce stronger and more consistent style conditioning than reference images for well-defined styles, but require a training process and a sufficient body of training examples. Style references are more flexible and require no training, making them the default approach for style conditioning and LoRAs the appropriate tool when a specific style needs to be applied consistently at production scale.

Question 6

Can style references be used in video generation as well as image generation?

Accepted Answer

Yes, and style references are particularly valuable in video generation because maintaining consistent visual aesthetic across multiple clips in a production is more challenging than applying it to a single image. Providing the same style reference across all generation sessions for a project anchors the visual language in a way that text prompts alone cannot reliably sustain. Some video generation platforms allow style references to condition not only the colour and light quality of the output but also the motion character and camera movement aesthetic, extending style conditioning beyond the static visual treatment into the temporal dimension of the content.

Question 7

Where does Morphic store style references?

Accepted Answer

Morphic stores style references in the Assets tab of a project, alongside character references, location references, and other input materials. Organising all style references in the project's Assets tab at the outset of a production ensures they are available consistently across all generation sessions within the project, and that all team members working on the project have access to the same reference materials. Naming and annotating reference images in the Assets tab helps maintain clarity about which reference communicates which aspect of the visual direction as the project grows.

Question 8

What is the difference between a style reference and a prompt describing a style?

Accepted Answer

A text prompt describing a style communicates aesthetic qualities through language, which the model interprets based on associations learned during training. A style reference communicates visual qualities directly through the actual visual data of the image. Text descriptions are imprecise: different people mean different things by words like "cinematic," "moody," or "painterly" — while a reference image communicates exact colour relationships, contrast ratios, and textural qualities without ambiguity. The most effective approach combines both: a style reference anchors the visual treatment while text prompts add specificity about subject, context, and aspects of the style that the reference alone cannot communicate.

Style Reference

What is Style Reference?

Direct scenes, design characters, and ship full films

Types and variations

Ready to make your first scene in Morphic?

Common use cases

Direct scenes, design characters, and ship full films

FAQs