A medium wide shot frames a subject from approximately the knees up, sitting between the medium shot that cuts at the waist and the full wide shot that shows the entire figure in its environment. It captures more of the body and lower limbs than a standard medium shot while retaining more environmental context than a closer framing, making it a versatile middle ground that shows physical action and movement without losing the subject in their surroundings.
The medium wide shot is particularly useful for scenes where the subject's full body language and physical movement matter but the environment is also an important part of the visual narrative. It allows gesture, gait, and physical positioning to read clearly while still establishing spatial context - a character's relationship to a room, their position relative to other people, or the physical environment they are moving through. In television drama and film, medium wide shots often serve as the default coverage position for scenes involving movement, physical interaction, or spatial relationships that closer framings would fragment.
When prompting AI video generation for medium wide shots, describing the subject from knee level upward within a visible environment communicates the intended framing. Specifying both what the subject is doing physically and something about the surrounding space helps generate footage that uses the full value of this framing - showing enough of the body for physical action to read while grounding the subject in a visible, contextual environment.