Noise / Noise Level
What is Noise / Noise Level?
In AI generation, noise is the random static that diffusion models start from and progressively clean up to create images. The noise level controls how much the model departs from an input image when doing image-to-image generation.
At a glance
- Also known as
- Denoising strength (in img2img context)Noise strengthGaussian noise (technical term for the specific noise type used)
- Used for
- The starting point of all diffusion model generation (pure noise → image)Controlling the degree of transformation in image-to-image generationUnderstanding the relationship between randomness, seed, and generation output
- Common tools
- Stable diffusionComfyUIAutomatic1111All diffusion-based generation platforms
- Related terms
- Diffusion modelDenoisingSamplingCFG scaleSeedImage-to-imageLatent space
- How it works in simple terms
- A diffusion model is trained by learning how to reverse a process of adding random noise to images. At generation time, it starts with random noise and uses the prompt to guide its removal of noise step by step, producing a structured image as the noise is progressively resolved.
- Where you encounter this
- Noise level appears as a denoising strength or img2img strength parameter in image-to-image generation workflows. Every generation from a text prompt begins with noise, but the starting noise is usually automatically managed; the noise level parameter is directly user-controllable primarily in img2img and inpainting contexts.
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.
How it compares
Compared with related concepts
Noise level (denoising strength) and CFG scale are the two primary parameters that control how strongly a diffusion model follows its conditioning signals. CFG scale controls how strongly the model follows the text prompt relative to producing a generic, prompt-free output. Noise level in img2img controls how strongly the model follows the prompt relative to preserving the input image. High CFG scale produces outputs that more aggressively match the prompt. High noise level produces outputs that more aggressively diverge from the input image. Both parameters shape the balance between conditioning strength and generation freedom, but at different points in the conditioning hierarchy.
Think of it like…
Noise level in image-to-image generation is like a sculptor deciding how much clay to remove from an existing sculpture before reshaping it: low noise level is like making small, refinement-focused changes to the original form; high noise level is like removing so much clay that only the rough general mass remains, then reshaping it almost from scratch according to new intentions.
Pro tip
In image-to-image workflows, use noise level as a creative parameter rather than treating it as a binary 'strong' or 'weak' transformation setting. A noise level of 0.4–0.6 is often the most productive creative range: enough freedom for the model to meaningfully reinterpret the input according to the prompt, but enough structural continuity with the input that the composition, lighting, and spatial relationships are preserved as a useful foundation. Very high noise levels (0.8+) are appropriate when using the input image primarily as a loose reference for composition; very low levels (below 0.3) are appropriate for light stylistic adjustments while preserving the original image almost intact.
Types and variations
- Gaussian noise is the specific random noise type used in standard diffusion model processes: values that follow a normal (bell-curve) statistical distribution.
- In text-to-image generation, the starting noise is pure Gaussian noise.
- In image-to-image generation, a controlled amount of Gaussian noise is blended with the encoded input image before denoising.
- Different noise schedules (linear, cosine, others) define how noise is distributed across the denoising timesteps and produce different generation characteristics.
- Film grain is a visually related but technically distinct form of image noise introduced as an aesthetic element in photographic and cinematic imagery, and can be requested in AI generation prompts as a stylistic element separate from the functional noise of the diffusion process.
Ready to make your first scene in Morphic?
Try MorphicCommon use cases
- Understanding noise level is most directly relevant in image-to-image (img2img) generation workflows, where the denoising strength parameter directly controls the degree of transformation of the input image.
- It is also relevant in inpainting, where the noise added to masked regions determines how liberally the model fills them.
- In text-to-image generation, the seed ( the specific random noise pattern used as starting point ) is the primary noise-related user parameter, as it controls generation reproducibility.
- In advanced workflows, custom noise injection and noise schedule manipulation are used to achieve specific stylistic or compositional effects.
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.
FAQs
In AI image generation, noise refers to random statistical variation ( similar to television static ) that diffusion models use as the starting point for generation. The model is trained to progressively remove noise from a random starting tensor, guided by a text prompt or other conditioning signal, until a coherent image emerges. Every generation from a text prompt begins with pure noise and ends with a structured image.
Noise level (often called denoising strength in image-to-image contexts) is a parameter that controls how much noise is added to an input image before the denoising generation process begins. A high noise level adds more noise, giving the model more freedom to diverge from the original image. A low noise level adds less noise, keeping the output closer to the input. It is the primary parameter for controlling the degree of transformation in img2img generation workflows.
The seed is a number that determines the specific pattern of random noise used as the starting point for generation. Because diffusion models begin with noise and the model's denoising path depends on the specific noise it starts with, different seeds produce different noise patterns, which lead the denoising process along different paths and produce different outputs, even with identical prompts and settings. This is why the same prompt can produce very different images depending on the seed used.
Generation steps refer to the number of denoising iterations the model performs, progressively removing noise from the starting image at each step. More steps allow finer, more incremental noise removal, generally producing higher quality, more detailed outputs but requiring more computation time. Fewer steps produce faster results that may lack fine detail. The noise level decreases with each step: full noise at step one, near-zero noise at the final step.
For meaningful creative transformation while preserving the input's general composition and spatial structure, a range of 0.4–0.65 is typically most productive. Below 0.3 makes only light stylistic changes, useful for subtle adjustments. Above 0.75 produces strong divergence from the input, treating it primarily as a loose compositional reference. The ideal value depends on how much you want the output to reflect the input versus the prompt: it is worth experimenting across this range to understand how a specific model responds.
No. Film grain is a visual aesthetic element: a pattern of visible texture that results from the silver halide crystals in photographic film or is synthetically introduced in digital imaging for aesthetic effect. Generation noise in diffusion models is a mathematical construct ( Gaussian random values ) used as the starting material for the denoising generation process. The two are conceptually related in that both involve random variation in image values, but film grain is a visible aesthetic choice; generation noise is an internal technical mechanism that is resolved away during the generation process.
A noise schedule defines how noise is distributed and reduced across the denoising steps of the generation process: how much noise is removed at each step, from maximum noise at the start to minimum noise at the end. Different noise schedules (linear, cosine, exponential) produce different distributions of the denoising work across steps, affecting generation quality and the character of the output. The noise schedule is typically managed by the sampler algorithm selected for the generation and is not usually directly user-controlled in consumer-facing interfaces.
Yes, in several ways. In img2img workflows, noise level is a direct creative parameter: varying it produces outputs that range from close refinements to radical reinterpretations of the input. Different seed values produce creative variation from a single prompt, which can be exploited systematically by generating multiple seeds and selecting the most interesting output. Some advanced workflows inject custom noise patterns or use specialised noise types to achieve specific stylistic effects. Film grain as an aesthetic element can also be requested in prompts as a distinct visual quality of the output.