Veo 3.1 · video
Veo 3.1 Cinematic Prompt Pack
Ten field-tested Veo 3.1 prompts you can paste straight into Flow, Vertex, fal, or ShortsFast — and a JSON file you can pipe into your own scripts.
What's in the pack — Each prompt follows Google's recommended five-part structure (cinematography + subject + action + context + style), respects Veo 3.1's 8-second ceiling, and quotes dialogue inline so the model lip-syncs instead of mumbling.
Download as JSON
10 prompts · structured · pipe into any pipeline
Prompt structure
- Subject — specific noun phrase with distinctive detail
- Action — precise verb chain with a clear motion endpoint
- Environment — 3-4 concrete elements, not adjective dumps
- Camera — one shot type and one movement, never two
- Lighting / mood — direction + quality + emotional word
- Audio — quoted dialogue, ambient bed, and sound effects
- Style — one or two film references over generic adjectives
Walkthrough on the prompt structure post and the Veo 3.1 fact sheet.
Paste-ready recipes
1 — Talking-head confession (dialogue + sync audio)
A man in his early 40s in a grey hoodie sits alone in a cramped home studio lit only by a monitor. He looks directly into the camera and says, "I shipped three products this year. Two of them failed, and I am so grateful for it." Locked 50mm medium shot, shallow depth of field. Cool monitor glow from the front, warm key light from a small lamp on the right. Audio: only the subject's voice and a soft room tone, no music. Shot like a video diary, 8 seconds.
Note: Quote the line verbatim. Veo 3.1 lip-syncs quoted speech far more reliably than described speech.
2 — Product reveal with diegetic sound
A matte black coffee grinder sits on a stainless steel counter in an industrial kitchen. Two gloved hands enter frame, pour whole beans into the hopper, and press the button. The grinder whirs for two seconds, then stops. Handheld slow push-in, 35mm feel. Overcast daylight from a large window on the left, soft shadows. Audio: the pour of beans, the grinder whir, distant rain on the skylight, no music. 8 seconds.
3 — Two-person bar conversation (multi-speaker)
Two women in their late 30s lean on a dark wooden bar, both holding amber cocktails. The first says, "So what's your actual plan?" The second laughs softly, then answers, "I don't have one yet." Locked medium two-shot, 35mm, shallow focus. Warm tungsten edge light from behind the bar, cool backlight from a street window. Audio: both voices clear, quiet jazz bed, glassware clink. 6 seconds, handheld feel.
4 — Kinetic POV street shot (faceless narrator)
First-person POV. A leather boot steps off a curb into a wet crosswalk in Seoul. The camera rises and holds on a red-and-yellow taxi passing left to right, then tilts up to a neon storefront. Handheld POV, 28mm wide. Overcast cool blue hour, pink neon fill. Audio: wet footsteps, taxi hiss, distant traffic, one far-off horn. No music. 8 seconds.
5 — Intimate close-up emotional beat
A locked extreme close-up on a woman's hand holding a folded letter. Her thumb traces the crease twice, then her hand slowly lowers out of frame. Static 85mm macro, shallow depth. Late-afternoon window light from the right, soft golden fall-off. Audio: rustle of paper, slow exhale, faint vinyl crackle, no music. 6 seconds.
6 — Food top-down beauty shot
Overhead locked shot. Two hands lower a round of fresh dough onto a flour-dusted marble counter, press it flat, then sprinkle torn basil across the top. 50mm overhead, faint motion from the hand movement. Warm key from a pendant lamp above, no harsh shadows. Audio: press of dough, rustle of basil, soft kitchen ambience, light piano bed. 8 seconds.
7 — Reference-image continuation (Extend workflow)
Using the reference image as the subject: the man walks three steps forward, stops, and glances over his right shoulder, then exits frame right. Tracking handheld 35mm, following from behind. Lighting consistent with the reference (low sun from the left). Audio: gravel crunch, wind in dry grass, one distant bird call. No music. 6 seconds.
Note: When using a reference image, describe what the subject does — not their face. Redescription fights the reference and produces morph.
8 — Architecture establishing shot (no people)
A slow crane-up shot starting at street level on a rain-slick concrete plaza, rising to reveal a brutalist library facade in the distance. Pre-dawn blue hour, single warm sodium streetlamp on the right. Locked dolly arm, 24mm wide. Audio: light rain on stone, distant traffic, one car door closing offscreen. No music. 8 seconds.
9 — Hands-only tutorial beat
Top-down medium shot of two hands on a wooden desk. The right hand holds a black fountain pen and writes the words 'ship it' on a cream index card, then taps the card twice with the back of the pen. Locked 50mm overhead, soft warm key from a desk lamp on the left, gentle shadow on the right. Audio: pen scratch on paper, two soft taps, paper rustle, no music. 6 seconds.
10 — Ambient transition shot (B-roll)
Static wide shot of an empty cafe at 6am — chairs upturned on tables, a single barista wiping down the counter in soft focus far background. Cold daylight from the storefront window, warm pendant lamp above the counter. 35mm locked, no camera movement. Audio: distant cloth wipe, espresso machine hiss, one gentle door bell offscreen. No music. 8 seconds.
Note: Static-locked B-roll wins where in-clip movement would clash. Pair with a Kling 2.5 Turbo kinetic shot in your edit.
Primary sources
Run every recipe in one account
ShortsFast bundles Veo 3.1 with every other frontier model under one flat $20/mo plan. Paste any recipe, pick Veo 3.1, render.
Start freeLast updated 2026-04-30. Recipes are updated when the model is updated. ShortsFast has no affiliation with the model vendors listed.