BLOG OVERVIEW

The YouTube Script Template We Use on Every Client Video (With Framework)

In

YouTube

by

Edward Wood

Most YouTube videos don't fail because the information is bad. They fail because the script doesn't earn the viewer's attention sentence by sentence. I've reviewed hundreds of video scripts across client channels at Humble&Brag, and the pattern is remarkably consistent: the topic is right, the production is decent, the presenter knows their subject, but the script bleeds viewers because it doesn't follow the structure that retention demands.

The good news is that structure is learnable. The best-performing creators on YouTube follow the same underlying framework, and once you see it, you can't unsee it. This guide gives you that framework as a practical template you can use to write or brief your next video.

Before You Write: The Story Spine

Before opening a document and outlining sections, write five lines. Just five. These aren't your script. They're the emotional core that holds everything together: the thread you return to when the project starts to feel directionless.

Situation: Where are we? What's the starting reality? ("Most adults who want to learn piano never start.")

Desire: What does the viewer want to achieve? ("They want to sit down and play a song they love, start to finish.")

Conflict: What gets in the way? ("Every time they try, they hit a wall of scales, theory, and the feeling that it's too late.")

Change: What shifts? What's the insight or turning point? ("When they stop learning piano and start learning songs, something clicks.")

Result: How does it end? How is the world different? ("Twenty minutes a day later, they're playing Clocks by Coldplay and wondering why they waited so long.")

If you can write these five lines clearly, you have a script. Expand to ten lines, then twenty, then the full video. The core doesn't change; it just gets richer. If you can't write them, the idea isn't ready yet, and it's better to know that before you've booked a filming day.

This also works as a rescue tool mid-project. When a script starts to sprawl, come back to the five lines and ask whether each section connects to one of them. If it doesn't, cut it.

Lock Your Title Before You Write

Your title is a promise. The script's job is to fulfil it and then exceed it, which is why you need to finalise the title before writing a single sentence. Every decision in the script, what to include, what to cut, how to structure the hook, flows from the expectation the title sets. If the title changes after the script is written, the script is broken.

Strong title formats include pain-point-led titles that name a situation the viewer is in ("How Long Does It Really Take to Learn Piano as an Adult?"), contrarian titles that challenge a common belief ("Why Most People Learn Piano the Hard Way"), outcome-led titles that promise a specific result ("How to Play Your First Song on Piano With No Experience"), and number titles that signal structured value ("4 Piano Songs That Sound Impressive But Are Surprisingly Easy"). We've tested these patterns extensively across client channels, and our title formulas guide covers the data on which formats perform best.

After locking the title, write out the four to six questions your viewer will have after reading it. Your hook and introduction should answer all of them directly and early.

How to Write a Script: The Four-Hat Process

Most people sit down and try to write a script from start to finish. This is why scripts take 10 hours and feel painful. The better approach, one advocated by George Blackman (the scriptwriter behind Ali Abdaal's biggest videos) and one we use at Humble&Brag, is to separate the writing process into four distinct passes. Each pass has a different job, and you should not try to do two at once.

Hat 1: The Artist (Idea Dump). Brainstorm every possible point, example, anecdote, and visual you could include. Don't filter. The goal is to find your "Grand Payoff," the single most satisfying moment the viewer clicked to see. If you don't know what that moment is, the script isn't ready.

Hat 2: The Architect (Structure). Build the skeleton using Setup-Tension-Payoff loops (more on these below). A well-structured 10 to 15 minute video will have five to seven of these loops. Critically, write the payoffs first, not the setups. This is counterintuitive, but it forces you to confirm that the video actually delivers value before you write a single word of the setup that promises it. Once your payoffs are mapped, the setups become much easier to write because you know exactly what you're building toward. Check the overall flow at this stage. It's much cheaper to restructure a skeleton than a full draft.

Hat 3: The Writer (Draft). Now connect the dots. Fill in the tension sections between each setup and payoff. Don't worry about perfect phrasing. Just get a complete draft on paper. Momentum matters more than polish at this stage. If you get stuck on a word, leave a placeholder and keep going.

Hat 4: The Wizard (Retention Edit). This is the final pass where you optimise for retention. Cut jargon. Simplify sentences. Check that curiosity gaps aren't being closed too early or left open too long. Read the script aloud and cut anything you'd never say in conversation. This is where you add visual cues for your editor: on-screen text, B-roll notes, and chapter markers.

The four-hat process typically cuts writing time by 40 to 50 per cent compared to linear writing because each pass has a single, clear objective. You're never trying to brainstorm, structure, write, and edit simultaneously.

The Five-Part Script Structure

1. The Hook (5 to 30 Seconds)

The hook has roughly 20 seconds to do three things: validate the click (confirm the viewer is in the right place), raise the stakes (why this matters right now), and open the first curiosity loop (hint at the Grand Payoff without giving it away).

Use this three-part formula.

Context lean-in. Establish what the video is about and connect it to something the viewer already cares about. The right viewers will recognise themselves in it. "Most adults who want to learn piano never start. Not because they're not musical, but because everything they've tried made them feel like the problem was them."

Scroll stop. A single line using a contrast word (but, however, here's the thing) that signals something unexpected. "But here's what no one tells you..."

Contrarian snapback. A statement that goes in an unexpected direction. The bigger the gap between what they expected and what you say, the more they want to keep watching. "The problem isn't that you're a slow learner. It's that you've been learning piano the wrong way round."

Keep hook sentences short. Nothing longer than ten words. Don't open with credentials, "Hey guys, welcome back," or vague teases. If your retention curve shows a steep drop in the first 15 seconds, the hook is where to focus your rewriting.

One more principle on hooks: don't try to write them first. Write them last, after the body is complete. Blackman's reasoning, which matches our experience, is compelling: writing a hook that perfectly sells the value of a script that doesn't exist yet is nearly impossible. You usually end up rewriting it anyway. When the body is done, the hook almost writes itself because you know exactly what the video delivers.

2. The Introduction (30 to 60 Seconds)

The intro earns the viewer's commitment to watch the full video. It should validate their experience ("If you've ever felt like..."), name the problem precisely rather than generally, promise the payoff ("By the end of this video, you'll have..."), and establish credibility briefly, led by relevant experience rather than job title.

If you have a standard brand introduction (presenter name, role, company), put it at the end of the introduction rather than the beginning. Viewers are more likely to care about credentials once they're already engaged.

This is also where you can establish the video's visual structure: showing the three to five topics you'll cover as on-screen text. Research from the psychology of waiting lines confirms that people tolerate longer waits when they know the structure and progress. The same applies to video: viewers stay longer when they can see where the video is headed.

3. The Body (Setup-Tension-Payoff Loops)

Each body segment follows a three-beat structure called a Setup-Tension-Payoff loop. A 10 to 15 minute video should contain five to seven of these loops.

Setup: Open with a specific claim that creates a curiosity gap and establishes recognisable stakes. Tell the viewer what this segment will address, then tell them what it costs them to ignore it. Compare these two openings: "Next, let's talk about your evening routine. Most sleep experts recommend avoiding screens before bed" versus "Next, let's talk about your evening routine. Because you might be doing something in the two hours before bed that single-handedly causes you to feel exhausted the next day." The first announces a topic. The second creates stakes. The viewer stays because they need to find out if they're making that mistake.

Tension: This is the section between the setup and the payoff, and it's where most scripts fail. The mistake is treating tension as withholding information. It isn't. Tension is the process of teaching the viewer new information that builds understanding without revealing the payoff prematurely. Blackman's three-step formula for tension (which he discusses in detail on the Creator Science podcast with Jay Clouse): first, show the viewer their current behaviour (the wrong approach); second, explain why that behaviour doesn't work (the mechanism behind the problem); third, gradually reveal what they should do instead through contrasting examples. Each step is genuinely educational, but the specific payoff, the concrete takeaway, comes only at the end.

Payoff: Deliver the concrete answer, then connect it to the larger journey. "When you start with a song instead of a scale, the first session stops feeling like a test and starts feeling like music. That's the shift that keeps people going." The payoff catches viewers up and prepares them for the next loop.

Put your second-best point first and your best point last, establishing a pattern of ascending value. Between loops, create a brief transition hook: "That's the thing that makes the first week click. But there's a second shift most beginners never make..."

Around 60 to 70 per cent of the way through, add a mid-video re-hook: "Before I get to the last piece, which is the most counterintuitive part..." Viewer attention tends to dip in the final third. A reminder of what's still coming pulls it back.

4. The Summary (30 Seconds)

Briefly recap three to five key takeaways. End with something empathetic: "This isn't about talent. It never was. You just needed the right starting point." Don't introduce new information here.

5. The Call to Action (15 to 30 Seconds)

Most end-screen CTAs are wasted. "If you liked this video, check out this one" gives the viewer no concrete reason to click. Blackman's three-step CTA formula is far more effective.

Link: Refer back to something specific covered in this video. "We covered why most beginners give up after the first month."

Curiosity gap: Open a new question connected to that topic. "But there's a second wall that hits around month three, and it's the reason people who started strong never actually finish a full song."

Promise: Give the specific transformation the next video delivers. "So check out this video where I show you the one practice change that gets you past it."

This formula turns the end screen into a setup for a new segment, except the segment lives in the next video. It's the same structure as every Setup-Tension-Payoff loop in the body, just pointed at the next piece of content.

For product CTAs, frame the product as the natural next step in solving the problem the video addressed, not as a separate pitch. We covered how to structure CTAs in descriptions and through lead magnet positioning in detail.

The Psychology That Makes It Work

Everything in this framework is grounded in how the brain processes information while watching video.

Dopamine and the curiosity loop. The brain releases dopamine when it anticipates an answer to an open question, and again when the answer arrives. The hook opens the first loop. Each body segment opens and closes subsequent loops. The viewer stays because each resolved question is immediately replaced by a new one. Retention data from Retention Rabbit's 2025 benchmark report confirms that 55 per cent of viewers are lost within the first 60 seconds when the opening fails to create this loop effectively.

Anticipation beats information. Viewers don't leave because they're bored of your topic. They leave when they stop anticipating what's coming next. Every sentence should make the next one feel worth hearing.

Value ascending, not descending. Viewers are constantly, and largely unconsciously, assessing whether the video is worth their time. If each point is better than the last, they stay. If value peaks in the first minute and declines, they drift, and your retention curve will show it as a steady downward slope.

Tone is trust. The more conversational your delivery feels, the more trust you earn. The best YouTube scripts don't read like articles. They read like a knowledgeable friend talking across a table. Read your script aloud. If you'd never say it that way in conversation, rewrite it.

Common Mistakes

Starting with the channel intro. Earn the viewer's attention before introducing yourself. Credentials before engagement is the fastest way to lose viewers in the first 10 seconds.

Vague opening lines. "Today we're going to talk about something really important" is not a hook. State the problem or the surprising angle immediately.

Padding for length. Every line should earn its place. A tighter 8-minute video nearly always outperforms a padded 15-minute one. We see this consistently in retention benchmarks: shorter, denser videos hold higher average percentage viewed. AIR Media-Tech's research on retention editing confirms that pacing needs to be tightest in the first three minutes and can gradually relax once the viewer is committed.

Lists instead of stories. A list of tips is easy to forget. The same tips delivered through contrast, conflict, and consequence stick. Use the story spine for every example, not just the video's overall arc.

Multiple CTAs. One per video. A viewer with two options to click tends to choose neither.

The Template: Copy and Adapt

Writing order: Payoffs first, then setups, then tension, then hook last.

Hook (5-30s): Context lean-in + scroll stop + contrarian snapback. Validate the click, raise the stakes, open one curiosity loop. Written last.

Intro (30-60s): Validate experience, name the problem, promise the payoff, credentials (brief, at the end). Show visual structure (3-5 topics on screen).

Body (5-7min): 5-7 Setup-Tension-Payoff loops. Ascending value order. Setups with recognisable stakes. Tension that teaches without revealing. Payoffs that deliver and connect. Transition hooks between loops. Mid-video re-hook at 60-70%.

Summary (30s): 3-5 key actions or phrases. Empathetic close. No new information.

CTA (15-30s): Link back to this video + new curiosity gap + specific promise for next video.

If you want help scripting videos for your channel, or want to understand how script structure connects to your broader content strategy, get in touch.

Join our Humbleweed Community

Oh, and you’re very welcome to join our Humbleweed Community of YouTube experts and aspiring experts. It’s free, fun, and packed full of the kind of cutting-edge social video chat you’ll love.

Join our Humbleweed Community

Oh, and you’re very welcome to join our Humbleweed Community of YouTube experts and aspiring experts. It’s free, fun, and packed full of the kind of cutting-edge social video chat you’ll love.

Join our Humbleweed Community

Oh, and you’re very welcome to join our Humbleweed Community of YouTube experts and aspiring experts. It’s free, fun, and packed full of the kind of cutting-edge social video chat you’ll love.