Model Architecture
What is Model Architecture?
Model architecture is the blueprint of an AI's brain: it describes how many layers it has, what type of calculations each layer performs, and how information travels from one end to the other. Different blueprints make AI better at different tasks.
At a glance
- Also known as
- Network architectureNeural network architectureModel design
- Used for
- Defining AI capabilitiesImage and video generationLanguage understandingModel selection and evaluation
- Common tools
- PyTorchTensorFlowHugging face transformersJAX
- Related terms
- TransformerDiffusion modelGANModel trainingLatent space
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.
How it compares
Architecture is the fixed blueprint: the arrangement of layers and operations. Weights are the numerical values learned during training that fill in that blueprint. You can have two models with identical architectures but completely different weights (and therefore completely different behaviours), just as two buildings with the same floor plan can be furnished and decorated entirely differently.
Think of it like…
Think of model architecture like the design of a factory. The architecture specifies how many assembly lines there are, what machines sit on each line, and in what order materials pass through them. The specific settings and calibrations of those machines ( learned through training ) are like the model weights. The factory design (architecture) determines what it's capable of making; the calibration (weights) determines how well it makes it.
Pro tip
When evaluating AI tools for a specific task, look beyond marketing and check which architectural family the underlying model belongs to: diffusion models, transformers, and GANs have meaningfully different trade-offs in terms of inference speed, output diversity, and fine-tuning flexibility that will affect your production workflow.
Types and variations
- The major architectural families relevant to AI media tools include convolutional neural networks (CNNs), which dominated image recognition and early generative tasks; generative adversarial networks (GANs), which pair a generator and discriminator in an adversarial training loop; variational autoencoders (VAEs), which learn compressed latent representations of data; transformer architectures, which use self-attention mechanisms and form the backbone of most modern language and multimodal models; and diffusion architectures, which model data generation as a learned denoising process.
- Hybrid architectures that combine elements of these families: such as the latent diffusion models used in Stable Diffusion: are increasingly common.
Ready to make your first scene in Morphic?
Try MorphicCommon use cases
- Model architecture is a consideration whenever selecting or comparing AI tools for image generation, video synthesis, audio processing, or language tasks.
- Understanding that Stable Diffusion uses a latent diffusion architecture, for instance, explains why it can be run on consumer GPUs (the diffusion process operates in a compressed latent space rather than full pixel space).
- Architecture also matters when fine-tuning models: different architectures accept different fine-tuning methods, and techniques like LoRA (Low-Rank Adaptation) are designed around the specific structure of transformer layers.
Ready to create?
Direct scenes, design characters, and ship full films
All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.