The Process · Will Sartorius · AI Creative Consulting

CHAPTER 0 · HOW TO READ THIS

Three colors, three roles in the system.

Every file you'll see has one of three tags. The tag tells you whether the file is a fixed part of the engine, a piece I customize for your brand at install, or an output that gets refreshed every batch from your Meta account. Knowing which is which is the difference between "this thing runs" and "this thing runs for your brand."

SHARED ENGINE

Same for every brand.

Universal pipeline. Pipeline doc, the 9 copy agents, agent 10, the audit template, the safe-zone block. I don't rewrite these per client. They've already been hardened across every brand the system runs, and yours inherits all of it on day one.

BRAND-SPECIFIC

Customized for your brand at install.

Brand bible, voice agent, brand spec card, visual style card, persona library, banned-word list. Built once during month one, then read by every batch forever. This is the part that makes the system speak your brand, not generic DTC.

DATA-DRIVEN

Refreshed every batch from your Meta.

The data brief. EXTEND / RETIRE / FILL / NET-NEW buckets. On-screen text from winners and losers. Trending hooks. This is the file that makes round 5 different from round 4, because round 4's results are inputs to round 5.

TL;DR: The engine is shared. The brand layer is yours. The data layer feeds itself. Three months of work installs all three. After that, the system can run a batch a week with light calibration, by any seat on your team.

CHAPTER I · PHASE A · BRAND SETUP

One-time install + per-batch data pull.

Two jobs in this phase. First, the one-time-per-brand install: scrape the site, build the brand bible, lock the voice agent, render the spec cards. Run this once in month one, never again. Second, the per-batch data pull: ask Meta what worked and what didn't, get back a structured brief that drives every slot in the next slate.

The one-time install lands in week 1. The per-batch data pull lives forever, runs at the start of every batch, and is where every year of historical Meta data finally pays you back.

SHARED ENGINE

A0 — SETUP_NEW_CLIENT.md

The onboarding flow.

When Claude reads this file, it enters new-client install mode. It asks for the brand name, shortcode, website URL, category, vertical, and any extra materials (founder interviews, prior briefs, brand guidelines). Then it walks the install in order: scrape the site, build the brand bible, draft the voice agent, render the brand spec card and visual style card, populate the product catalog.

Run once per brand, in week 1 of month one. Everything downstream reads from the artifacts this file produces. Skip it and the rest of the pipeline has nothing brand-specific to anchor to.

▶ HOW IT GETS CUSTOMIZED Inputs: your brand name, your shortcode, your e-commerce URL, your vertical (closest category bucket: Beauty / Fashion / Food & Beverage / Wellness / Fitness / Pet / Home / Tech), plus any founder content, brand guidelines, or prior winning campaigns you want fed in. One sitting.

SHARED ENGINE

A1 — Product and Logo Scraper.md

Scrapes every product, every logo.

Targets Shopify out of the box. Pulls every product (title, price, description, ingredients, variants, all photos) into Product Assets/{handle}/, and every logo + favicon + apple-touch-icon into Brand Kit/logo/. Each product folder has its photos sitting next to a description.md and meta.json, plus a master catalog.md that links the whole thing together.

This is the file that means every future ad uses real product photography, not stock, not invented bottles, not a generic substitute.

▶ HOW IT GETS CUSTOMIZED Run against your e-commerce site once. Re-run any time you launch a new SKU. The product catalog Phase D pulls reference photos from is whatever this scraper last produced.

SHARED ENGINE

A2 — Color and Font Scraper.md

Pulls your exact hex, your exact type.

Fires up a real headless Chromium so it can read JS-rendered SPAs and resolve CSS variables to actual values. Output is a JSON of every color used on the site (with frequency), every font stack, the type hierarchy, and the spacing tokens. Saved into Reference/ as design tokens.

Why it matters: the brand spec card in Phase B is built from these tokens, not from guesses. The proposal site, the spec ads, the visual style card all reference your real hex codes, not a designer's read of "looks about right."

▶ HOW IT GETS CUSTOMIZED One run against your site, output saved into your brand folder. From then on, every ad knows your real palette and typography.

DATA-DRIVEN

Data Pull / pull-meta-data.mjs

The script that asks Meta what worked.

One round-trip to your Meta account. Returns the top 20 ads by spend, bottom 20 by ROAS, action-plan recommendations (extend / retire / test / fill), per-dimension learnings (format / persona / angle / emotion), the on-screen text from every winner and loser, plus a Voice-of-Customer rollup from Reddit and a calendar-moment lookahead.

Then it writes DATA_BRIEF.md: the markdown brief Phase B reads to allocate the next 14 slots. EXTEND / RETIRE / FILL / NET-NEW buckets, each row with a "Pair with" companion and (when relevant) a "Trending hook" with a ship-by deadline.

▶ THE MONTH-ONE BUILD Month one builds your version of this script: it hits Meta's Marketing API directly with your token, ingests every available month of historical ads (the last ~37 months via API, plus older data via one-time CSV export), classifies them on the same taxonomy, and writes DATA_BRIEF.md into your repo. The output stays the same. The source is yours.

What you've installed by the end of Phase A: your brand's product catalog, brand kit, design tokens, and a recurring data brief. Every year of ad history is now legible to the system, not buried in your account.

CHAPTER II · PHASE B · 14-CONCEPT SLATE

Where the data brief turns into 14 specific ads.

This is the brain. Phase A delivers the data brief. Phase B turns that brief into 14 concrete ad slots, each with a format, a persona, an angle, an emotion, and a specific copy hook. Then it audits the slate before a single image is generated, because catching a clustering problem here costs nothing, and catching it after Fal renders 14 images costs real dollars.

The slate is allocated by buckets. Extend (3-4 winners to riff on), Fill (3-4 vertical-gap tests), Net-New (3-4 mandatory untested combos), Open (the rest). The Net-New floor is what stops the pipeline from converging on a local maximum after 6 batches.

SHARED ENGINE

B1 — BATCH_GENERATION_PIPELINE.md

The full pipeline doc. Phases 1 through 7 inside.

The end-to-end playbook for shipping a graded batch of statics. Phase 1 is concept slate (the 14-row table). Phase 2 is variant selection (which style of UGC, which style of headline). Phase 3 is the visual diversity audit. Phase 4 is the 9-agent copy refinement. Phase 5 is generation via Fal. Phase 6 is Agent 10 grading. Phase 7 is iteration on anything below 90.

This file is what Claude reads to know what comes next. Every other file in Phases B-G is referenced by this one. If you only opened one file in the whole system, this is the one.

▶ HOW IT GETS CUSTOMIZED Untouched. Inherited as-is. The brand-specific layer (voice agent, brand bible) plugs in at the read points, but the playbook itself is the same playbook used to ship 60+ statics a month for every brand running on the engine.

SHARED ENGINE

B2 — Asset Types _README.md

The diversity quotas.

Every batch of 14 must satisfy: ≥2 platform-native UGC, ≥1 macro close-up, ≥1 outdoor location, ≥1 type-only, ≥1 chaotic tablescape. Without these quotas, every batch comes out 14 variations of the same warm editorial mood, because the model and the writer both default to whatever the brand's "house style" already is.

The quotas are the antidote. They force at least 6 of the 14 slots into visually distinct registers, even when the brief and the brand voice would naturally cluster. Plus the variant-rotation rule: same archetype can't run two batches in a row at the same Style letter.

▶ HOW IT GETS CUSTOMIZED Same quotas apply for every brand. Especially at scale, where cluster-fatigue is a real risk if 60+ ads/month all default to the same mood.

SHARED ENGINE

B3 — AUDIT_TEMPLATE.md

The slate-level audit. Two passes.

Copied into every batch as AUDIT.md. Filled in twice. §1 Visual Diversity Audit runs at the end of Phase B: 14-row table mapping each slot's surface bucket, lighting bucket, primary subject, color cast, plus cluster-cap checks (max 3 per surface, max 4 per lighting, max 5 single-can-hero) and an adjacent-pair check. §2 Voice Diversity Audit runs at the end of Phase C: same shape, but for voice register, anchor type, and source citations.

The reason this exists: the per-ad rubrics catch per-ad problems. Slate-level cluster failure is a different bug, where 14 ads each pass their own rubric but render at thumbnail scale as the same warm-editorial mood. The audit template is the file that surfaces that pattern before generation runs.

▶ HOW IT GETS CUSTOMIZED Used every batch. The audit fills itself in once Claude has the slate locked.

SHARED ENGINE

Asset Types Library / 16 archetype files

The format library. One file per archetype.

Each file has a Style Selection Matrix with 4-9 styles inside (Style A, B, C, …) plus rules for when to use which. Phase 2 of the pipeline reads these to pick a specific variant per ad. The 16 archetypes:

▶ UGC

Platform-native creator-style. iMessage, Reddit, Notes, Slack composites + retro-photo register.

▶ BEFORE / AFTER

The transformation arc. Setup, payoff, contrast forced to read at thumbnail scale.

▶ FOUNDER'S STORY

First-person voice with attribution line. Quote marks + dash, never buried in subhead.

▶ SOCIAL PROOF

Reviews, ratings, customer voices. Real specific numbers, never round invented ones.

▶ STATISTICS

One number, big, defended by a citation. Treatment-claim risk gates apply.

▶ HEADLINE

Type-as-the-message. The wordmark IS the brand mark. No can required.

▶ TESTIMONIALS

Single voice, full quote, attribution. Different rhythm from social-proof aggregations.

▶ FEATURES & BENEFITS

The classic. Bullet rhythm, parallel structure, cut weak verbs.

▶ BULLET POINTS

Tight list register. 3-5 items, short, no commentary.

▶ HANDWRITING

Caveat-style, casual, friend-to-friend. Use sparingly. Big tonal shift.

▶ NEWS

Editorial article skin. Native-camouflage. CTA pill OFF.

▶ PRESS

Magazine clipping. Pull quotes, masthead lockup, body copy. Long-copy register.

▶ NEGATIVE MARKETING

What we don't do, what we won't sell. Earned negation, not edgy for edgy's sake.

▶ OFFER / PROMOTION

The offer pill carries the deal. CTA + Logo both ON. Different rules from default.

▶ US VS THEM

Comparison fulcrum word on its own line. "Vs." stacked, italic, ~50% size.

▶ RETRO UGC

Retro-photo carve-out for illustrated brands. Late-90s digital-camera character.

▶ HOW IT GETS CUSTOMIZED You inherit all 16. Month one customizes which Style letters in each archetype map to your brand's visual conventions, what your "before / after" should and shouldn't look like (claim language that triggers FDA / FTC risk gets ruled out), and which archetypes get over-indexed for your category.

What runs in Phase B: data brief in, 14-row slate out, two audits passed. Nothing has been generated yet. Every problem caught here is caught for free.

CHAPTER III · PHASE C · COPY REFINEMENT

Nine sequential agents. Every ad scores ≥90 on every rubric.

Each agent is a markdown file with a role, a rubric, and a scoring loop. Claude reads the file, takes on that role, scores the ad, returns a numeric score plus specific rewrites. Iterate until the ad clears 90. Then hand to the next agent. Nine passes total. This is the cheap end of the pipeline. Catching a fabricated stat or a banned word here costs pennies. Catching it after Fal renders the image costs dollars. Catching it after the ad goes live costs the campaign.

SHARED ENGINE

01_persona_fit.md

Does this speak to the persona?

A consumer-psychologist agent. Inputs a persona definition (age, role, pain points, language patterns, worldview) and the ad brief. Scores 1-100 across 6 dimensions: language fit, pain-point fit, worldview match, evidence type, social context, register. Returns specific rewritten copy where the score lags.

▶ HOW IT GETS CUSTOMIZED Month one builds your persona library: your buyers segmented by use-case, prior-experience level, age cohort, value language. Agent 01 reads from that library, not generic DTC personas.

SHARED ENGINE

02_angle.md

Is the angle sharp, or a category cliché?

Scores whether the ad's angle (the specific argument it's making) is differentiated, defensible, and matches the awareness level of the target traffic. Catches "natural ingredients" and "made for you" type drift back into category-default messaging.

▶ HOW IT GETS CUSTOMIZED Tuned to your category's defaults so it knows what to push against. The agent gets the list of category clichés as part of the brand bible.

SHARED ENGINE

03_emotion.md

Does the ad earn the emotion it's evoking?

Maps the ad to a target emotion (frustration, relief, validation, curiosity, FOMO, pride) and grades whether the copy + scene + persona combination actually generates that emotion or just claims it. The "claim vs. earn" distinction is the whole rubric.

▶ HOW IT GETS CUSTOMIZED Pulls from your brand voice agent for which emotional registers your brand has earned the right to play in.

SHARED ENGINE

04_copy_excellence.md

Is the copy actually well-written?

Pure craft pass. Active verbs, parallel structure, cut filler, cut hedges, cut weak modifiers, headline rhythm, line-break placement. Independent of brand and persona. Catches the "this technically scans but reads flat" problem.

▶ HOW IT GETS CUSTOMIZED Universal. Same rubric whether the brand is scalp-care or supplements or footwear.

SHARED ENGINE

05_format_compliance.md

Does the copy match the chosen format?

If Phase 2 picked "Style C of UGC archetype," does the copy actually read as platform-native UGC, or is it editorial copy crammed into a UGC visual? Cross-checks the chosen Style's rules against the actual copy block. Rejects mismatches.

▶ HOW IT GETS CUSTOMIZED Engine-level. Same rubric for everyone.

BRAND-SPECIFIC

06_brand_compliance.md

Banned words, hard rules, brand voice.

The agent that reads from your brand voice agent file, not a universal rubric. Banned words (yours, not mine). Hard rules ("never claim treatment of a medical condition"). Voice register (warm-clinical vs. casual-friendly vs. founder-podcast). Brand-swap test: would this ad read as your brand or could a competitor have shipped it?

This is the most brand-specific agent in the system. Month one spends real time getting it right, because it gates every ad downstream.

▶ HOW IT GETS CUSTOMIZED Built in month one from your founder voice, your past winners, your category constraints (FDA / FTC / category-specific compliance), and your team's input on what's true to your brand and what isn't. Custom banned-word list. Custom voice registers. Custom hard rules.

SHARED ENGINE

07_kahneman_heuristics.md

System 1 vs. System 2. Does it land in the first second?

Behavioral-econ pass. Anchoring, loss aversion, social proof weight, framing effects, availability heuristic. Grades whether the ad pulls a System-1 reaction in the first 1-2 seconds (the only window Meta gives you), or whether it relies on System-2 cognitive work the viewer won't do.

▶ HOW IT GETS CUSTOMIZED Universal. The cognitive science doesn't care what category you're in.

SHARED ENGINE

08_static_conversion.md

Will it actually convert?

The hardest pass. Grades against a checklist of conversion-killers: weak hook, missing offer, unclear CTA, scene-format mismatch, social-proof drop, awareness-mismatch (cold traffic getting Most-Aware copy). Returns specific fixes, not just scores.

▶ HOW IT GETS CUSTOMIZED Same rubric, but the pass is informed by your historical winners, so "what conversion looks like for this brand" is concrete, not abstract.

SHARED ENGINE

09_ad_reviewer.md

The final-pass ad reviewer.

Composite pass that holds the ad against the previous 8 scores and asks: would a senior creative director ship this? Catches the "every individual rubric passed but the ad still feels off" problem. Final gate before Phase D burns Fal credits.

▶ HOW IT GETS CUSTOMIZED Engine-level. The CD voice is universal.

The hard gate: every ad clears ≥90 across all 9 rubrics, or it doesn't go to Phase D. Most ads go through 2-3 iteration rounds. Some need 5. The 9-agent loop is what makes the per-ad copy quality independent of who's running the batch.

CHAPTER IV · PHASE D · IMAGE GENERATION

Fal calls. Two models. Three blocks pasted into every prompt.

Once the slate is locked and the copy has cleared the 9-agent gate, Phase D actually renders the images. The pipeline picks between GPT Image 2 (default, photographic credibility) and Nano Banana 2 (override, illustrated / halftone / multi-product). Each prompt is assembled from three universal blocks plus the per-ad scene language.

SHARED ENGINE

GPT_IMAGE_2_PIPELINE.md

The end-to-end generation playbook.

Brief input → prompt assembly → Fal API call → finished PNG. Single source of truth for how Claude builds a generation prompt. Includes the model picker (GPT2 vs. NB2 by visual register), the multi-SKU rendering ceiling (both models top out at ~2 distinct SKUs in one frame), the aspect-ratio pre-check (reference images must be ≤3:1 or Fal hard-fails), and the 4-attempt retry pattern.

Why this exists: the same brief assembled two different ways produces wildly different outputs. This file locks the assembly so the variance is in the brief, not in how the brief was prompted.

▶ HOW IT GETS CUSTOMIZED Inherited as-is. The decision tree applies to your category like any other.

SHARED ENGINE

SAFE_ZONE_BLOCK.md

Keeps headlines out of Instagram's UI overlay.

Pasted verbatim at the top of every prompt. Defines the 840×1350 safe rectangle inside a 1080×1920 frame, with explicit pixel coordinates, plus a "rows 1-2 and 9-10 must be empty" simple version for the model to fall back on. Without this block, GPT Image 2 routinely places text in the top 400px Instagram-overlay zone. With it, the headline stays legible across Stories, Reels, and Feed.

▶ HOW IT GETS CUSTOMIZED Universal. Pasted into every prompt.

SHARED ENGINE

PRODUCT_FIDELITY_BLOCK.md

Forces real-world product proportions.

Models default to making 12oz cans look like 16oz tallboys, supplement bottles look chunkier than they are, pouches float weirdly. This block locks proportions to the uploaded reference photo and provides explicit real-world dimensions. Pasted after the safe-zone block, before the per-ad scene language.

▶ HOW IT GETS CUSTOMIZED Tuned to your product proportions during month one, then inherited by every batch. The reference photo is whatever Phase 6 picked from your product catalog.

What lands at the end of Phase D: 14 PNGs. Not graded yet. Nothing leaves Phase D for the client until Phase E says ≥90 across the board.

CHAPTER V · PHASE E · AGENT 10 HARD GATE

The single biggest quality lever in the system.

Agent 10 reads every rendered PNG, scores it across 11 gates and 58 dimensions, and flags anything below 90 for re-roll. Nothing ships to your Meta account without all 14 ads clearing this gate. This is the file that turns "we generated 14 ads" into "we generated 14 ads that actually perform."

SHARED ENGINE

10_creative_grader.md

11 gates, 58 dimensions, 0-100 composite.

The agent simulates the actual cognitive journey of a person being served the ad: thumb-stop, hook recognition, claim digestion, social-proof weighting, action engineering, brand recall. 11 gates in sequence. Gates 0-9 are universal (native camouflage, hook strength, copy clarity, format compliance, credibility, persuasion, action, brand fit, conversion math, polish). Gate 10 is the brand-performance overlay: scores the ad's classification (format × persona × angle × emotion) against the action plan from Phase A. An ad whose dimensions the brand has already proven don't work gets capped at 75, no matter how good the craft is.

This is the gate that catches: safe-zone violations, dropped headlines, paper-inset artifacts, fabricated stats that survived Phase 4, banned-word leakage, brand-fit failures the per-ad rubrics missed, AND ads built on dimensions the data has already retired.

▶ WHY GATE 10 MATTERS MOST Gate 10 is what your historical mining feeds into. Every winner from your Meta history is a row in your action plan. Every loser is a row in the avoid list. Gate 10 reads from there. The longer the system runs, the smarter Gate 10 gets, because every shipped ad updates the action plan.

SHARED ENGINE

PERFORMANCE_FEEDBACK_LOOP.md

How shipped ads get their performance back into the system.

The closed loop, in plain English. Every ad that ships generates Meta performance data. This file defines what gets logged per ad (Brief ID, brand, format, style, persona, angle, headline hook, scene, the actual rendered copy), how it gets logged, and where it goes back into the action plan so future briefs are informed by what actually worked, not just what looked good in generation.

This is the self-iteration mechanism. Without this file, the pipeline ships pretty ads. With it, the pipeline gets smarter every batch.

▶ THE MONTH-THREE HANDOFF MOMENT The feedback loop installs in month three. By the end of the engagement, every ad that runs on your Meta is being read back into the system within 24 hours, automatically classified, and pushed into the action plan that drives the next batch's data brief. The system literally cannot ship the same batch twice.

SHARED ENGINE

QC_CHECKLIST.md

30-second human eyes-on pass.

Run on every generated image before Agent 10 scores it. Hard-fail items: instruction leakage (font names, hex codes, "Hook:" labels visible on the image), safe-zone violations, multi-panel collages, product fidelity errors, label misspellings. Any single fail blocks the ad. Designed to take 30 seconds per image.

Why a human pass when Agent 10 exists: because Agent 10 is comprehensive but slow. The QC checklist is fast and catches the 80% of obvious failures before Agent 10 spends cycles on them.

▶ HOW IT GETS CUSTOMIZED Run by whoever owns the batch that week. Universal checklist, doesn't change per brand.

The hard gate restated: 14 ads, all ≥90 on Agent 10, no caps triggered. If even one fails, the batch isn't done. Iterate, regenerate, re-grade. Non-negotiable. This is the rule that makes "shipped 14 statics" mean the same thing every time.

CHAPTER VI · PHASE F + G · SHIP

Final delivery + resize. The audit trail that closes the loop.

Once all 14 ads clear Agent 10, Phase F moves them into the final-delivery folder structure. Phase G resizes to 1:1, then re-grades through Agent 10 (because aspect-ratio shifts can drift typography or trim CTAs). What ships to your Meta account is two files per concept: the 9:16 source-of-truth and a 1:1, both ≥90.

SHARED ENGINE

PHASE F · Deliver (folder structure)

Final delivery + audit trail.

No standalone playbook, just file moves into Final Delivery/9x16/ with the canonical naming convention ([BRAND]_[Concept]_V[1-N]_9x16.png). Alongside the PNGs, four artifacts always go into the same folder: the locked generation script, the Agent 10 report, the filled-in audit, the data brief that drove the slate. That four-artifact triplet is the audit trail. Next batch reads them.

▶ HOW IT GETS CUSTOMIZED Identical structure regardless of source. Every batch leaves the same four-artifact trail.

SHARED ENGINE

RESIZE_WORKFLOW.md

9:16 source → 1:1, re-graded.

Universal resize playbook, single rulebook for getting approved 9:16 ads into 1:1 sizes. Reads from Final Delivery/9x16/ verbatim, runs each PNG through Fal's gpt-image-2/edit with NB2 fallback, drops outputs into Final Delivery/1x1/. Refuses to overwrite without a --force flag, so re-running is safe. Critically: every 1:1 output gets re-graded through Agent 10. Aspect changes re-shape the safe-zone constraints (1:1 has different blocked rows than 9:16), and resize models occasionally drift typography or trim a CTA. Anything below 90 falls back to manual.

▶ HOW IT GETS CUSTOMIZED Used every batch. 9:16 + 1:1 ship by default. (4:5 only when a specific batch needs it, documented as exception.)

End state, every batch: 14 concepts × 2 aspect ratios = 28 ad files, all ≥90 on Agent 10, plus a four-artifact audit trail that the next batch reads from. This is what "shipped" means.

CHAPTER VII · WHY THIS HOLDS UP

Five things to take away.

DATA-DRIVEN

1 · The data brief is the brain.

Without it, every batch is a creative judgment in a vacuum. With it, every slot maps to a specific signal: extend a winner, kill a loser, fill a vertical gap, test a new bet. Your Meta history feeds the brief on day one.

SHARED ENGINE

2 · The audits run before images are generated.

Visual diversity (§1) and voice diversity (§2) get caught at the slate level, not after $50 of Fal credits is burned on a clustered batch. Cheap end of the pipeline.

SHARED ENGINE

3 · There are two hard gates.

9-agent copy review (Phase C, ≥90) gates Phase D. Agent 10 creative grader (Phase E, ≥90) gates delivery. Either fails, the batch doesn't ship. This is the rule that makes quality independent of who's running the batch.

DATA-DRIVEN

4 · The Net-New bucket is non-negotiable.

3-4 of every 14 slots must be combos the brand has never tested. Skip it and the slate just optimizes what already worked, which is how every brand's creative converges into one mood after 6 batches. The Net-New floor is what stops your creative from going stale at scale.

SHARED ENGINE

5 · Every batch leaves an audit trail.

Data brief + Agent 10 report + audit + locked generation script. Next batch reads them. The system literally cannot ship two identical batches, because round N reads round N-1's results before deciding what's worth making.

The agents are the system. The audits are the system. The feedback loop is the system. This page is the system, file by file. No abstractions. No slides. Just the actual files Claude reads when it runs.

EVERY DOC.EVERY AGENT.EVERY AUDIT.

Three colors, three roles in the system.

Same for every brand.

Customized for your brand at install.

Refreshed every batch from your Meta.

One-time install + per-batch data pull.

The onboarding flow.

Scrapes every product, every logo.

Pulls your exact hex, your exact type.

The script that asks Meta what worked.

Where the data brief turns into 14 specific ads.

The full pipeline doc. Phases 1 through 7 inside.

The diversity quotas.

The slate-level audit. Two passes.

The format library. One file per archetype.

Nine sequential agents. Every ad scores ≥90 on every rubric.

Does this speak to the persona?

Is the angle sharp, or a category cliché?

Does the ad earn the emotion it's evoking?

Is the copy actually well-written?

Does the copy match the chosen format?

Banned words, hard rules, brand voice.

System 1 vs. System 2. Does it land in the first second?

Will it actually convert?

The final-pass ad reviewer.

Fal calls. Two models. Three blocks pasted into every prompt.

The end-to-end generation playbook.

Keeps headlines out of Instagram's UI overlay.

Forces real-world product proportions.

The single biggest quality lever in the system.

11 gates, 58 dimensions, 0-100 composite.

How shipped ads get their performance back into the system.

30-second human eyes-on pass.

Final delivery + resize. The audit trail that closes the loop.

Final delivery + audit trail.

9:16 source → 1:1, re-graded.

Five things to take away.

1 · The data brief is the brain.

2 · The audits run before images are generated.

3 · There are two hard gates.

4 · The Net-New bucket is non-negotiable.

5 · Every batch leaves an audit trail.

If this looks like the systemyour team is missing.

EVERY DOC.
EVERY AGENT.
EVERY AUDIT.

If this looks like the system
your team is missing.