Shot

What is Shot?

A shot is a single unbroken clip from start to stop: the basic building block of every film and AI video. A scene is built from multiple shots; a sequence from multiple scenes.

At a glance

Also known as
ClipTakeCamera angleGeneration (in AI video context)
Used for
Providing the fundamental unit from which scenes, sequences, and complete productions are assembledCommunicating specific emotional, spatial, and narrative information through framing, angle, and movementDefining the output unit of each AI video generationPlanning production coverage through shot lists that enumerate every required capture
Common tools
AI video generation platforms (each generation produces one shot)Camera and lens systems (for physical production)Shot list tools (for pre-production planning)Non-linear editing software (for assembling shots into sequences)
Related terms
SceneSequenceCoverageShot listFramingCamera movement

Ready to create?

Direct scenes, design characters, and ship full films

All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.

How it compares

How it compares

Compared with related concepts

A shot and a scene are often confused by creators new to film structure. A shot is a single unbroken recording; a scene is a dramatic unit made up of multiple shots. A single conversation might be represented by ten or fifteen shots: establishing wide, medium coverage of each participant, close-ups, reaction shots: all assembled into one scene. The distinction matters practically for AI generation planning: each shot requires a separate generation; each scene requires planning which shots it needs before generation begins.


Think of it like…

A shot is the basic sentence of the visual language of film: just as a paragraph is built from sentences and a chapter from paragraphs, a scene is built from shots and a film from scenes. Just as a well-formed sentence serves a clear communicative purpose, a well-conceived shot serves a clear narrative or emotional function within the scene it belongs to.


Pro tip

Before generating any AI video clip, define the shot's specific function within its scene: what information does it establish, what emotion does it convey, what happens after it in the edit, and what came before? A shot generated with a clear functional purpose produces more useful output than a shot generated as footage without an assigned role. The most common efficiency problem in AI video production is generating appealing but functionally undefined footage that does not edit together into coherent scenes.

Types and variations

  • Shots are categorised primarily by their framing: close-up (CU) and extreme close-up (ECU) for intimate emotional detail; medium close-up (MCU) and medium shot (MS) for the balance of character and context used in most dialogue coverage; wide shot (WS) and long shot (LS) for establishing geography and scale; and establishing shot (ES) for scene-opening orientation.
  • By movement: static shots, pans, tilts, dolly shots, push ins, pull outs, tracking shots, crane shots, and handheld shots each produce different spatial and emotional qualities.
  • By angle: eye level, low angle, high angle, Dutch angle, bird's-eye, and worm's-eye each communicate different psychological relationships between viewer and subject.

Ready to make your first scene in Morphic?

Try Morphic

Common use cases

  • Shot planning is the foundation of every production: the process of identifying what shots a script, brief, or creative vision requires before generation or filming begins.
  • Shot-by-shot generation is the primary AI video production workflow: generating each planned shot individually, then assembling them in editing.
  • Shot analysis is applied in film study and creative research, breaking down how specific directors and cinematographers use shot choices to tell stories and shape viewer experience.
  • Shot communication is the language through which directors, DPs, and AI prompt designers specify their creative intentions: the shared vocabulary of framing, angle, and movement that translates a creative vision into a concrete visual instruction.

Ready to create?

Direct scenes, design characters, and ship full films

All-in-one AI creative platform with simple, transparent pricing, no speed throttles, and an infinite Canvas for max creativity.

FAQs

What is a shot in filmmaking?

A shot is a single unbroken recording from when the camera starts capturing until it stops: the fundamental unit of filmmaking and AI video generation. All visual content is composed of shots assembled in editing into scenes, sequences, and complete productions. Every generation in an AI video workflow produces one shot.

What is the difference between a shot and a scene?

A shot is a single unbroken recording. A scene is a dramatic unit set in a single location and time period, made up of multiple shots that together cover the scene's action from different angles, distances, and perspectives. A dialogue scene might require ten to fifteen shots; an action scene might require more. The scene is constructed in editing by assembling individual shots.

What are the main types of shots defined by framing?

The primary shot types by framing are the extreme close-up (ECU), close-up (CU), medium close-up (MCU), medium shot (MS), medium long shot (MLS), long shot (LS), and extreme long shot (ELS). Each brings the viewer closer to or further from the subject, communicating different levels of emotional intimacy and spatial context. The choice of framing is one of the most fundamental creative decisions in shot design.

What information does camera angle communicate in a shot?

Camera angle communicates the psychological relationship between the viewer and the subject. Eye level implies equality and neutral observation. Low angle ( camera below the subject looking up ) makes the subject appear powerful, dominant, or threatening. High angle ( camera above looking down ) makes the subject appear smaller, vulnerable, or observed. Extreme angles like bird's-eye (directly overhead) and worm's-eye (directly below) create dramatic, unusual perspectives that call attention to themselves as deliberate expressive choices.

How does shot duration affect its function?

Shot duration directly shapes the viewer's experience of time and pacing. Brief shots of one to two seconds create urgency and energy in action sequences. Standard coverage shots of four to eight seconds allow action and performance to develop. Extended shots of thirty seconds or more ( long takes ) place the viewer in real time with the action, creating a sense of presence and continuous experience impossible through editing. The appropriate duration depends on the scene's emotional register and the pacing of the surrounding edit.

Why does thinking in shots improve AI generation results?

Generating with a clear shot definition: specific framing, camera movement, angle, and narrative function: produces outputs with defined purpose and editorial utility. Generating footage without specifying what type of shot it is and what function it serves tends to produce generically composed, editorially ambiguous clips that may look appealing individually but do not assemble into coherent scenes. Shot-specific prompting translates directly into generation outputs with defined compositional intent.

What is a master shot?

A master shot is a wide or medium wide shot that captures the complete action of a scene from a single camera position, providing coverage of the full spatial and dramatic event without emphasising any specific element. Directors often shoot the master first to establish the geography and action of the scene, then move in for closer coverage. In AI generation, generating a master shot of a scene provides the spatial reference against which all other coverage shots should be visually coherent.

How many shots does a typical scene require?

The number of shots per scene varies considerably by scene type and complexity. A simple two-person dialogue scene might require eight to fifteen shots for complete coverage: an establishing wide, a two-shot, singles on each participant, close-ups for emotional beats, and reaction shots. An action scene might require twenty or more shots from multiple angles to convey geography, choreography, and energy. A simple single-character moment might need only two or three. In AI generation, each shot requires a separate generation, so coverage planning directly determines the number of generation runs a scene requires.

Can't find what you are looking for?
Contact us and let us know.
bg