Question 1

What is guidance scale in AI image generation?

Accepted Answer

Guidance scale is a parameter that controls how closely a diffusion model's output adheres to the text prompt. Higher values cause the model to follow the prompt more strictly; lower values give the model more creative freedom to draw on its own learned aesthetic sense, which can produce more visually natural but less literally accurate results.

Question 2

What does CFG stand for in AI image generation?

Accepted Answer

CFG stands for classifier-free guidance, the technical mechanism underlying guidance scale in diffusion models. It works by amplifying the difference between the model's conditioned output (following the prompt) and its unconditioned output (generating without direction), steering the generation toward the prompted content without requiring a separate classifier model.

Question 3

What happens if guidance scale is too high?

Accepted Answer

At very high guidance scale values, outputs tend to become over-saturated, visually harsh, and artificially sharp, with a quality sometimes described as burnt. The model overcommits to each element of the prompt independently without balancing them naturally, often producing images that feel hyperreal or plasticky rather than cohesive.

Question 4

What happens if guidance scale is too low?

Accepted Answer

At very low values, the model largely ignores the prompt and generates images based on its own learned prior, which may be aesthetically pleasing but will not match the described content. Specific subjects, objects, or compositional elements called for in the prompt may be absent or ambiguous in the output.

Question 5

What is a good guidance scale to start with?

Accepted Answer

A value between 7 and 12 is a reasonable starting point for most Stable Diffusion-based models, while newer architectures like Flux often perform better at lower values in the 2–5 range. The optimal value depends on the specific model, prompt complexity, and desired aesthetic, so experimentation within the effective range of the model being used is the most reliable approach.

Question 6

Does guidance scale affect video generation as well as images?

Accepted Answer

Guidance scale applies to video diffusion models in the same way it does to image models, controlling how closely the generated video follows the text prompt at each denoising step. The interaction between guidance scale and temporal coherence in video generation can be more complex than in still image work, and different video models may have narrower effective guidance ranges.

Question 7

Is guidance scale the same in all AI generation tools?

Accepted Answer

The underlying concept is consistent across diffusion-based models, but the effective numerical range, the default value, and how the parameter is labelled varies between tools and model architectures. What reads as a high guidance scale in one model may behave differently in another, so understanding the specific behaviour of the model being used is more useful than applying a universal rule.

Question 8

Can guidance scale fix a bad prompt?

Accepted Answer

Guidance scale modulates how strongly the model follows the prompt but cannot compensate for a prompt that is unclear, contradictory, or outside the model's capability. If the concept described is not well represented in the model's training data, increasing guidance scale will only force a more committed but still incorrect interpretation. Improving the prompt itself is always more effective than adjusting guidance scale alone.

Guidance Scale

What is Guidance Scale?

Direct scenes, design characters, and ship full films

Types and variations

Ready to make your first scene in Morphic?

Common use cases

Direct scenes, design characters, and ship full films

FAQs