← Back to blog

What Is a Dolly Shot? Definition + Examples

A dolly shot is a camera move where the entire camera physically travels toward or away from the subject on a track or wheeled platform. Plus how it works, examples, and where to use it in AI workflows.

What Is a Dolly Shot?

A dolly shot is a camera move where the entire camera physically travels toward or away from a subject on a track, wheeled platform, or slider, producing smooth, continuous motion that no pan or zoom can replicate.

The difference between a dolly and a zoom matters more than it sounds. A zoom changes the focal length; the background scales but the camera stays put. A dolly moves the whole camera through space; the parallax shift between foreground and background layers changes, giving the viewer a genuine sense of depth and presence. That quality is why directors reach for a dolly when they want the audience to feel like they're entering a scene rather than watching it through a window.

How a dolly shot works

A dolly move has two basic directions: dolly-in (camera travels toward the subject) and dolly-out (camera moves away, also called a pull-back). Both are smooth and motivated by the subject's emotional weight in the scene.

Dolly-in creates intimacy. The frame tightens, the subject grows, background elements drift apart. It's used to signal that something important is about to happen or to draw the viewer into a character's emotional state.

Dolly-out creates distance or revelation. The camera retreats, context expands around the subject, and the viewer understands their place in a larger world. Pull-backs often end scenes, or open them to reveal scale.

There's a third technique that combines the dolly with a simultaneous zoom in the opposite direction: the dolly-zoom, or Vertigo effect, named for Hitchcock's use of it in 1958. The camera dollies out while the lens zooms in (or vice versa), keeping the subject the same size in the frame while the background warps. The result is spatial disorientation used to signal psychological unease or sudden dread.

In traditional filmmaking, a dolly shot requires a physical track, a wheeled camera platform, and a dolly grip coordinating with the camera operator. In AI video generation, you describe the move in the prompt and the model synthesizes it from training data.

When you use a dolly shot

Use a dolly-in when you need to signal emotional escalation without a cut. A character receiving difficult news, a product reveal where the final frame should feel earned, an opening title sequence that wants to feel cinematic rather than static.

Use a dolly-out (pull-back) when you want to establish scale, end a moment, or leave the audience with context after a tight emotional scene. A pull-back that reveals a crowd around a lone figure, or a product in the environment it belongs to, communicates more than a static wide shot.

Use a dolly-zoom when you want deliberate visual unease. It's a recognized cinematic shorthand that audiences read immediately, so reserve it for moments that justify the effect.

Examples in AI prompts

Veo 3.1 dolly-in. Veo 3.1 responds well to explicit motion language. A prompt like "slow dolly-in toward a woman reading at a cafe window, late afternoon light, shallow depth of field, 8 seconds" produces a move that feels physically grounded rather than generated. The model handles the parallax shift between the figure and the background window, which is the visual tell that distinguishes a real dolly from a simulated zoom. See Veo 3 prompt guide for how to structure motion language alongside lighting and lens descriptors.

Veo 3.1 dolly-zoom. For the Vertigo effect, the prompt needs both directions stated clearly: "dolly-out zoom-in, subject stays centered in frame, background stretches, 4 seconds, psychological tension." Veo 3.1 at standard settings can approximate this. The background distortion won't be pixel-perfect, but the spatial disorientation reads in the output. Learn how to chain this with other cinematic moves in Veo 3 prompts for cinematic shots.

Kling 3.0 pull-back reveal. Kling 3.0 handles slow pull-backs with strong environmental coherence. A prompt like "slow pull-back from a coffee mug on a table, camera retreats to reveal a morning kitchen, natural light, cinematic color grade, 6 seconds" gives you a product-in-context reveal that works as an ad opener without needing a physical shoot.

Related concepts


Ready to generate a dolly shot? Open the canvas on 8frame and run a Veo 3.1 prompt with a slow dolly-in.

Related articles

glossaryWhat Is a Creative Canvas? Definition + ExamplesglossaryWhat Is a Hero Shot? Definition + ExamplesglossaryWhat Is a Reference Image in AI? Definition + Examples

Your frames start here

Watch the canvas power your creative flow in real time

Stay in the loop

Be the first to hear about our launch and get product updates