← Back to Work
Project 04 Cinematic Commercial · Helados Himmel · Appenzell, Switzerland

HIMMEL
APPENZELL

A cinema-grade commercial — eighteen shots, a full alpine world, one hero character — directed end to end by a single person. No actors. No location scout. No crew. I'm documenting it here for one reason: to show where the value actually sits now. Everyone has the same tools. Almost no one knows how to aim them.

The finished commercial. Eighteen shots, cut to one film.
// What this project is really about

The era of creative directors · 2026

For a hundred years, the thing that kept commercials like this out of reach wasn't talent. It was the bill — and now the bill is gone, the tools are everywhere, and what's left standing is the director.

A film-grade commercial used to mean a budget you had to respect: actors and their day rates, a location you fly to and pay for, a crew of a dozen — DP, gaffer, art department, grips, a post house. That cost structure is exactly what made this level of work the property of brands with deep pockets, and the reason a small ice cream maker in Appenzell could never have afforded what you're about to see.

I directed this entire commercial alone, and not one of those line items existed. No actor was hired — the Senn is generated and held identical across the film. No location was rented or traveled to — the hütte, the barn, the valley, the cellar, all summoned. No wrangler, no crew, no shoot day. The functions of a full production house were all there. They simply lived in one head that knew how to direct them.

This is the part most people miss about this moment. The tools are now everywhere and nearly free; what's scarce is the judgment to direct them. The budget was never the real constraint. The direction was. That's the whole point of documenting this shot by shot — to make the difference visible.

// The archer, not the arrow

I directed. The tools executed.

Here is how it was actually made, with the credit where it belongs. I built the prompt system together with Claude — refining the structure, the photo-role logic, and the anti-deformation defenses shot by shot. I iterated the winning frames in Nano Banana Pro until each still held. The motion came from Seedance, Kling 3.0, and Cinema Studio 3.5, each chosen for what it does best. The grade and the by-hand fixes were mine, in Photoshop and Lightroom. The final voice was made in ElevenLabs.

These tools are extraordinary, and I'll name them every time. But that's exactly the point of this case study: anyone reading this can open the same software tonight. The tools are not the advantage. They are arrows — and arrows hit nothing on their own.

Everyone can buy the same bow now. Coordinating it with direction and taste until it lands? Few can.

What the tools cannot do is decide. Direction, judgment, the order of operations, knowing what's wrong in a frame and how to fix it — that is the work, and it doesn't come in a subscription. Take the director out and you get pretty noise. That is the thing worth paying for, and it's the thing this whole project exists to demonstrate.

// Step one — the vision

The storyboard, first

Before a single shot was generated, the whole film had to exist on one page. The storyboard is eighteen 16:9 panels on cream paper, loose graphite, director's-sketch style: two or three strokes per form, the irregular relief of a human hand. No anime, no vector, no heavy rendering. The same Senn carried across every frame so they read as one artist's hand, not eighteen.

It matters that this came first, and it matters how it was asked for. The storyboard prompt is the seed of the entire system — it dictates each panel explicitly: the lone hut under the Säntis, the vest going on, the milking from behind, the dripping comb, the copper cauldron, the spoon to the mouth at dusk. A generator told to "imagine a commercial" improvises and drifts. Dictate every frame, lock the character, and you get a coherent vision you can then execute shot by shot. The storyboard didn't just illustrate the film — it proved the prompting method that built it.

Storyboard — lámina completa, 18 viñetas
The storyboard — eighteen panels, one hand, pencil style.
El Senn / casting
The cast of one — never auditioned, never paid, held identical across 18 shots.
// The workflow

How eighteen shots got made

The pipeline was the same for every shot, and it was anything but improvised: storyboard → a written prompt → a still image → a video built from that still.

  1. The storyboard set the vision — all eighteen frames, locked.
  2. Eighteen image prompts were written — one per shot, each a rigorous, structured document.
  3. Eighteen stills were generated — iterated in Nano Banana Pro until each frame held: the photographic truth of the shot, the hero image.
  4. Each still became a video — the motion was generated from the still, so the look was locked before anything moved. The engine was chosen by the mechanics of the shot, not by habit.

Three shots refused this clean path and demanded their own technique. They get their own section — they're the best proof that the shot dictates the tool, never the reverse.

The anatomy of a prompt

This is where the expertise lives, and it's the opposite of a one-line wish. Every image prompt followed eleven fixed blocks, in order: Main concept · Subject · Location · Framing · Camera/Lens · Lighting · Color · Visual direction · Realism · Avoid · Final result. Nothing was left to the model's taste.

On top of that ran a scalable system of photo roles. Each reference image was numbered and given a job, with an explicit instruction of what to preserve and what to change: LOCATION (the place), CHARACTER (the Senn — face, morphology, Tracht), PROP (a hero object), ANIMAL (the breed — Braunvieh, Appenzeller dog), POSE/ACTION (a specific posture, when needed). A simple shot used two references; a complex one, four. It scaled cleanly.

// Surgical precision — real prompt
Shot 08 · The Milking
Create a realistic cinematic still based on the established Appenzell Senn character, the alpine stable, and the Braunvieh cow reference. Photo 01 location · Photo 02 character · Photo 03 animal · Photo 04 prop. Medium shot, three-quarter rear angle: the Senn on a low three-legged Melkstuhl, hand-milking, forehead against the cow's flank, the red vest the single point of color…

The campaign signature — locked on every shot

Camera / lens: Hasselblad X2D 100C · 75mm · f/4 · ISO 200 · 1/800s · 8K look · cinematic micro-contrast · no CGI look, no AI artifacts.
Color (Tarantino): high contrast · red vest as saturated hero accent · yellow Gääle as secondary note · near-black dark woods as contrast ground.
Audio (golden rule): real diegetic Alpine ambience + the sound of the action. Never music.
Duration: 2–3 s per shot (exception: the freezer, 4 s).

The anti-deformation doctrine

Central principle: minimum action = minimum risk. The less movement asked for, the less room the AI had to deform. Negatives were specific by living zone, never generic — cows (exactly four legs, correct joints, stable herd count, no fusing), hands (five fingers, natural grip), liquids (real behavior, no gelatin, no frozen splashes), steam and fire (natural rise, no strobing), and the face (stable identity across the clip, a smile with real muscle movement, never "creepy").

And each engine was written for differently: Seedance takes long negative lists split by zone (cheap, good for risky shots); Kling 3.0 prefers dense cinematic prose with physics stated as a positive ("exactly four legs with correct joints"), strongest on motion, fluids and multiple living elements; Cinema Studio 3.5 uses labeled call-sheet blocks for the best authorial finish, but choked on complex motion and had to hand a clapping gesture over to Kling.

// The premise of the film

Schwiizer Glace —
so schmeckt Himmel.

"Swiss ice cream — this is what heaven tastes like." After a hard morning's work, the z'morge is a reward well earned — and for an Appenzell farmer who has milked, drawn the honey, and made the cheese with his own hands, a spoon of his own ice cream at dusk is the line that closes the day. That's the whole film in one sentence: the pleasure is earned, and it tastes like Himmel.

// The creative concept

Einfach Himmel

The premise — Schwiizer Glace, so schmeckt Himmel. The farmer works his craft all day and, as evening falls, eats his own ice cream, made from what his hands harvested. The product doesn't interrupt the scene. It closes it. The ice cream at the end rhymes with the mid-morning breakfast (z'morge): pleasure arrives after the work, and it tastes like heaven.

The mother rules

// Truth as foundation

The Senn's real day

The day was built on official Swiss sources, not guesswork. The real morning order of an Appenzell Senn: wake at 3:30–4:00; tend yesterday's cheese (unmold, salt bath) and skim the evening milk before milking; bring the herd in; milk around 5:30–7:00; return the cows; make the cheese (käsen); clean the barn — and only then the z'morge, the mid-morning breakfast.

Breakfast doesn't open the day. It's the mid-morning reward — and that rhymes, structurally, with the ice cream at the end.

The same rule held for everything in frame: the dog is the Appenzeller Sennenhund (the local working breed, lying back, never in the bees); the cow is the Braunvieh (alpine brown), declared by breed in every prompt it appears in. An element entered only if it had a real root in the territory.

// Brand first, then the film

A brand, not a piece

It's worth remembering what this actually is: a practical exercise in Brand Elevation, tied to Himmel. We didn't start with a commercial. We started with the brand — the name, the positioning, the flavors, the territory, the world Himmel lives in. Only once that universe existed did we move it into a cinematic commercial, built scene by scene.

Anyone can make a piece. Few build the brand the piece comes from.

That's the difference that matters. A single ad is a piece — disposable, isolated, true only to itself. A brand is an ecosystem: a coherent world where every shot, every prop, every color already belongs because the world was defined first. This commercial reads as inevitable precisely because Himmel existed before the camera did. The film didn't invent the universe — it visited one that was already real.

And inside that universe, the shots followed a discipline. A shot is not an action — it's three lenses on the same action. "Farmer milking" isn't a shot; it's an idea. It breaks into Location (where we are — the establishing), Context (what the subject does — the medium), and Detail (the insert that drops you inside — the close-up). The milking, worked out: Location = barn interior with the row of cows; Context = the Senn from behind on the stool, facing the flank; Detail = the stream of milk hitting the steel bucket.

El ordeño
Shot 08 — milking. Three lenses, one action.
// The principle behind the pipeline

The shot dictates
the tool.
Never the reverse.

Engine, structure and technique changed with the risk and mechanics of each take. A "postcard" shot — subject nearly still — went to Cinema Studio for the finish. A "dangerous" shot — body movement, fluids, animals, bees — went to Kling or Seedance with armored negatives. Choosing the tool for the mechanics, not the habit, is what kept eighteen shots from four different generators reading as one film.

// Where mechanics beat aesthetics

Three impossible shots

Three shots broke the clean still→video path and demanded a technique used nowhere else in the film. Each one carries its real prompt — open them.

// Special case · motion transfer
Shot 17 · The Freezer
The most sophisticated case. I physically shot the take with my own iPhone, in my own kitchen — phone into the freezer, the real movement: door opening, the angle from inside, the timing. The base video carried the true mechanics; the prompt transferred that take onto the farmer and the hütte via Kling Motion. When a shot is hard to generate clean, act it out yourself and let the AI dress it.
// Special case · focus lock
Shot 09 · The Bees
The hives stay sharp; the subject enters from the right, from behind, and never comes into focus. The only risk was the engine "helpfully" racking focus toward him — so the prompt is built around forbidding a behavior (no focus pull, fixed entry direction), not requesting one.
// Special case · first→last frame
Shot 16 · The Fridge
Two photos given (closed fridge / nosing inside) and the engine interpolates the bridge between them, plus one extra photo as an identity anchor. Triple lock: back always turned, face never revealed. The prompt doesn't describe a free action — it describes a bridge between two stills.
Colmenas / sacando la miel
Shots 09–10 — the hives. The subject stays out of focus, on purpose.
// The voice

No cinematic spot lives without a voice

There's no cinematic product commercial without an enveloping voice that does its quiet magic — the warm, grounded narration that lands one potent line and lets the image breathe. Picture alone informs; the right voice is what makes a film felt.

The final voice for Himmel was made in ElevenLabs — a warm, weathered Swiss-German timbre that matches the Senn and the territory, closing on the line that carries the whole film: "Schwiizer Glace — so schmeckt Himmel." No music underneath, true to the golden rule; only the diegetic alpine ambience and that single, earned sentence.

// The result

The eighteen shots

Constants on all: Hasselblad X2D 100C · 75mm · f/4 · ISO 200 · 1/800s · 8K · Tarantino color · diegetic audio, no music · 2–3 s (except shot 17, 4 s).
01
House from afar

Wide establishing: hütte + barn under the Säntis, one lit window; the slowest possible push.

Blue hourSeedance
02
The vest

Medium from behind: the red vest goes on over the white shirt.

Pre-dawnSeedance
03
The earring

Close-up profile: the gold spoon-earring set into the right ear.

Oil-lampSeedance
04
Opening the door

Medium: the sliding barn door pushed to the left, weight visible.

Blue hourSeedance
05
Cows coming out

Medium: Braunvieh exit the gate toward camera, one hero cow in focus.

Pre-dawnSeedance
06
Feet & orbit

Extreme close-up: white socks + buckle shoe wet with dew, cows behind; camera orbits.

Misty morningSeedance
07
Looking at the horizon

Wide: the Senn in profile contemplates the valley; Braunvieh grazing, pink Alps.

SunriseKling 3.0
08
Milking

Medium 3/4 rear: on the stool, facing the flank; stream of milk to the bucket.

5:30 amSeedance
09
Reaching the hives

Locked-off f/1.8: hives in focus, the Senn enters from the right, from behind, out of focus.

MorningSeedance
10
Drawing the honey

Medium backlit: lifts the dripping comb, smoker, few bees, dog lying down.

Golden hourKling 3.0
11
The cheese boiling

Wide interior: copper cauldron over the fire, a column of steam backlit. No people.

Mid-morningKling 3.0
12
Stirring (the harp)

Top/shoulder close-up: hands with the Käseharfe cutting the curd, steam veiling the face.

Mixed lightKling 3.0
13
Inspecting the wheel

Medium: inspects the cheese wheel, two firm pats; rows of cheeses behind.

Cold cellarKling 3.0
14
The cured cheese

Extreme close-up: brush with brine in circles over the rind, little water; enters already moving.

CellarKling 3.0
15
Feeling the hunger

Medium, slight low angle: hand to the stomach, tired and sweaty, a contained gesture.

Golden afternoonKling 3.0
16
Raiding the fridge

Locked-off (first→last frame): enters from behind, opens, noses around. No face.

Warm + coldSeedance
17
Opening the freezer

Motion transfer, 4s, from my own iPhone: door opens → he looks → he smiles at what he finds.

4 secondsKling Motion
18
Eating the ice cream

Medium: leaning on the hütte door, spoon to mouth, Himmel tub in hand — the pleasure, finally, on the face.

Golden afternoonKling 3.0
El queso hirviendo en caldero de cobre
Shot 11 — the copper cauldron. No people, pure atmosphere.
El curado con cepillo y salmuera
Shot 14 — curing the rind. The brush enters already in motion.
// Post-production

The technical hand

Direction didn't end at the prompt. Where the tool failed at coherence or finish, technical knowledge stepped in. This is the point that defines the new workflow: creative direction fixes the what; applied technical skill, used where needed, guarantees the how. One without the other isn't enough.

Detail coherence (Photoshop)

AI is inconsistent on small details between takes. They were fixed by hand to keep the character continuous across the film: the embroidered Tracht showed a different cow in each photo — unified to one motif everywhere; the gold earring appeared, vanished, or jumped ears — locked to the right ear, as Appenzell Innerrhoden demands. Done only where necessary, never for gratuitous perfectionism.

Cinematography & grade (Lightroom)

The cinema feel didn't come whole out of the generator — it was directed in post, photo by photo: which lens each shot wanted (the 75mm signature, the right f-stops, and why — compression, separation, depth); a color grade per image to hold the Tarantino palette; grain, texture, and selective sharpening to sell a real 8K feel and pull it away from AI's flat finish.

The final cut

The finished film is eighteen separate videos, joined and edited into one — cut, sequenced, and harmonized with a unifying color grade across all of them so the shots from four different engines read as a single piece shot by one camera. Over the diegetic ambience sits a backing track that, in the end, is what gives the whole thing its soul: the score is what turns eighteen clips into a commercial that feels like something. Picture is the body; the music is the breath.

AI accelerates. It doesn't replace the eye that knows which lens, which color, which texture an image needs to look filmed.

// The close

It's not the bow.
It's the archer.

This project doesn't prove that AI makes commercials. It proves something narrower and more useful: that a director with judgment can coordinate, alone, what used to demand a whole team and a budget to match — direction, casting, art, photography, voice, post — with no actors, no location, no crew.

Order matters: judgment first, the tool second. Without knowledge of creative direction, photography, color, and brand coherence, these tools hand you flashy, useless material. With it, every prompt becomes a precise instruction, every engine is chosen by the mechanics of the shot, and every failure is caught and fixed before it reaches the screen.

The tools are extraordinary, and they're available to everyone — that's exactly why they're not the advantage. Anyone can draw the bow now. The work, the scarce and valuable part, is knowing where to aim. That's what this film is here to show, one shot at a time.

Comiendo el helado — clímax, tarde dorada
Shot 18 — the pleasure, finally. The reward.
Shot 08 · The Milking — Full Prompt
Create a realistic cinematic still based on the established Appenzell Senn character, the alpine stable location, and the Braunvieh cow reference. Photo 01 is the stable interior (location reference). Photo 02 is the Appenzell Senn (subject/character reference). Photo 03 is the Braunvieh cow (animal reference). Photo 04 is the milking stool and metal pail (prop reference). MAIN CONCEPT This is a medium shot from behind and slightly to the side — a three-quarter rear angle. The Appenzell Senn is seated on a low traditional three-legged milking stool (Melkstuhl), hand-milking a Braunvieh cow. His forehead rests against the cow's flank. His shoulders are working in a steady milking rhythm. We see his broad back; the red Tracht vest is the single point of color in a cool, dim scene. — the body is engaged in the rhythmic motion of milking — the head leans into the warm flank of the cow — the moment is quiet, physical, authentic — captured, not posed VERY IMPORTANT: angle is three-quarter rear (we see his back and the side of the cow); we do NOT see his face clearly (this is a "hands and labor" shot); the red vest reads as the only saturated accent. SUBJECT Use the Senn from Photo 02 as the identity reference. Preserve the authentic Sennentracht: yellow leather breeches (Gääle), red vest, white shirt; the broad, strong build of a working herdsman; same morphology and proportions. Seen mostly from the back — wardrobe and posture carry the identity, not the face. LOCATION Use Photo 01 as the exact stable interior. Preserve the row of tied Braunvieh cattle, wooden beams and posts, stone floor with realistic straw and wear, the rustic Anbindestall character, a single hanging bulb and cool blue light from an open door. Do not redesign it. Do not make it decorative. ANIMALS Use Photo 03 for the Braunvieh: realistic brown-gray coat, the hero cow presented from the flank where the Senn leans his forehead, believable dairy-cow morphology and scale. Do not stylize. Do not exaggerate anatomy. PROP Use Photo 04: a low traditional three-legged Melkstuhl beneath the Senn, a steel pail under the udder. Realistic, worn, tactile. FRAMING Medium shot, three-quarter rear angle. The Senn's broad back and working shoulders anchor the foreground; the flank of the cow he leans into; the steel pail below catching milk; the row of cattle falling out of focus behind. The red vest is the compositional focal accent. CAMERA / LENS Hasselblad X2D 100C · 75mm · f/4 · ISO 200 · 1/800s · 8K look · clinical sharp rendering · elegant compression · shallow depth of field · premium subject separation · rich cinematic micro-contrast · no CGI look, no AI-looking artifacts. LIGHTING / COLOR Cool, dim stable interior; blue light from the open door against a warm bulb. Tarantino-inspired bold contrast: the red vest a saturated edge accent, warm skin, deep soft background shadow. Heightened but believable realism. REALISM Anatomically correct back and arms in milking motion; believable Braunvieh (four legs, correct joints); coherent shallow-focus falloff; stable identity across the clip. No deformed hands, no warped animal anatomy, no surreal lighting. AVOID Clear frontal face in focus · bright flat daylight · washed-out colors · posed studio look disconnected from the scene · CGI appearance · extra or missing limbs on the cow. AUDIO (diegetic only — NO MUSIC) Cool stable ambience: the rhythmic hiss of milk striking the steel pail, low cow breaths, the shift of hooves on stone, distant cowbells, faint morning wind through the open door. Absolutely no music. FINAL RESULT A realistic cinematic three-quarter rear medium of the Appenzell Senn milking a Braunvieh by hand in a cool alpine stable, forehead to the flank, milk striking the steel pail, the red vest the single accent of color — continuity with Photo 01, character from Photo 02, animal from Photo 03, prop from Photo 04. Full campaign signature: Hasselblad X2D 100C, 75mm, f/4, ISO 200, 1/800s, 8K realism, bold Tarantino contrast.
Shot 17 · The Freezer — Motion Control (4s)
METHOD Base video shot physically on my own iPhone, inside my own kitchen freezer — the door opening, the angle from inside, the real timing. That take carries the true mechanics. The prompt below transfers it onto the Appenzell Senn and the hütte world via Kling Motion. PROMPT MOTION CONTROL — 4 seconds. Single continuous shot, locked-off camera positioned inside/at the freezer looking back out toward the subject. The motion is the freezer door opening and the subject's reaction. No camera movement. START STATE (0s) The white freezer door fills most of the frame, nearly closed, blocking almost all the light — only a thin sliver of warm kitchen light at the edge. Dim, cool. BEAT 1 (0–1.5s) — the door opens The white freezer door swings/pulls open smoothly and steadily, sweeping out of frame and letting warm kitchen light flood in. The Appenzell herdsman (red Tracht vest, white shirt) is revealed beyond, leaning toward the freezer. BEAT 2 (1.5–3s) — he looks in He leans in and looks down into the freezer, scanning its contents, calm and curious, his face now lit by the cool freezer light mixed with warm kitchen light. BEAT 3 (3–4s) — he finds it and smiles His eyes settle on something; a slow, warm, genuine smile spreads across his face as he finds what he was looking for. CRITICAL DEFENSE The final smile is the gesture most prone to coming out "creepy" — natural muscle movement, real warmth, stable identity across the whole clip, no distortion.
Shot 09 · The Bees — Focus Lock
PRINCIPLE The hives stay sharp; the subject enters and never comes into focus. The prompt is built around forbidding a behavior (no focus pull), not requesting one. PROMPT Cinematic shot. 2-3 seconds. Locked-off static camera, focus fixed on the bee houses. The only motion is the subject entering, plus the smallest natural life in the scene. SCENE A row of traditional Swiss painted wooden bee houses (Bienenhäuschen) at the edge of an alpine meadow, sharp and in crisp focus in the foreground/mid-ground. Soft daytime alpine light, green meadow and mountains soft behind. Use the uploaded photo as the exact location and framing reference (the bee houses stay the hero of the frame). Use Photo 02 as the subject identity reference (the Appenzell Senn: red Tracht vest, white shirt, yellow Gääle breeches). THE ACTION (clear and simple) — the shot begins clean: only the bee houses, in sharp focus, no person — the Appenzell Senn enters from the RIGHT side of the frame, walking in slowly, his BACK to the camera, looking toward the bee houses — he stays in the foreground/edge and remains COMPLETELY OUT OF FOCUS the entire time — a soft, blurred presence — the focus NEVER shifts to him; it stays locked on the sharp bee houses throughout (no focus pull, no rack focus) — he enters quietly, without demanding attention — faint natural life otherwise: grass barely moving, maybe a few bees drifting near the hives FOCUS / LENS Hasselblad X2D 100C, 75mm, at f/1.8 — very shallow depth of field. The bee houses are tack-sharp; the entering subject is a soft out-of-focus shape in the foreground. Focus remains fixed on the hives for the entire clip. LOOK 8K, crisp premium detail on the in-focus bee houses (painted wood, weathered texture), rich cinematic micro-contrast, no CGI look. Tarantino-inspired bold contrast: faded painted hive colors, rich alpine greens, the soft blurred red of the vest as a gentle color note.
Shot 16 · The Fridge — First→Last Frame
METHOD — interpolation between two stills This shot isn't described as a free action — it's described as a bridge between two fixed images. The engine interpolates the motion between a start frame and an end frame. INPUTS — FIRST FRAME: the kitchen with the fridge closed, the Senn approaching from behind (back to camera). — LAST FRAME: the Senn leaning into the open fridge, nosing through its contents, still from behind. — ANCHOR: one extra reference photo of the Senn for identity stability (Tracht, morphology). INSTRUCTION Generate the natural in-between motion that bridges the first frame to the last frame: the approach, the hand to the door, the door opening, the lean-in. Locked-off camera, no camera movement. TRIPLE LOCK — the subject's back is to the camera the entire time — the face is never revealed — identity stays stable across the bridge (no drift in build, Tracht, or hair) AVOID Any rotation that reveals the face · warped hands on the door handle · identity drift between the two frames · extra limbs · CGI look.
Next Project
FORMA
STUDIO
View Project →