Editor Training · Instagram Reels & Short-Form Only

From Okay
to Great — Fast

The CreatorGrow editing framework for Instagram Reels. Not theory — the actual process used to grow creators to 70K+ followers and 40M+ views. Work through each module in order.

⚠️ This training is for Instagram Reels & short-form only. YouTube editing and OnlyFans content editing are different workflows with separate training — check with coordination if your scope is different.
🎬 7 Modules
Short-form focus
Interactive checklists
Your progress 0 / 7 modules completed
01
👁️
Think Like the Viewer, Not the Editor
The mindset that separates average editors from ones who drive real results
Mindset
Before you touch a single cut, your first job is to watch the raw footage and ask one question: who is watching this, and what would stop their scroll? Everything else flows from that answer. Editors who skip this step produce technically clean videos that nobody watches.
🎯
The core question to ask before every edit"If I was scrolling at 11pm and this came up — would I stop? What format would make this land hardest?" Answer this before you open your editor.
🎬
Most of our content is NOT talking head — think in real scenariosA lot of editor training assumes the creator is sitting in front of a camera speaking to it. That's not usually what we're working with. You might receive:

· A 50-second clip of a creator walking in public, with a guy turning to look at her 15 seconds in
· An interview where the creator says something wild at the 1-minute mark
· B-roll of a travel moment with no dialogue at all
· A reaction, a reveal, a candid moment caught on camera

Your job is to find the best moment in that footage and build around it — not to treat it like a talking head edit. The reaction at 15 seconds? That's your hook. Start there. The crazy moment at minute 1? Cut everything before it or use it as a teaser. The footage tells you where the gold is — your job is to find it.
Heavy editing isn't always the answerSometimes the footage is already compelling — a look from the creator, a specific expression, a quiet moment that hits. You don't need to bury it in effects and cuts. If the first frame already stops you, let it breathe. The edit should serve the footage — not overpower it. Know when to get out of the way.
📱
Imagine the scroll
Your viewer is moving fast. You have 0.5–1.5 seconds to earn their attention. Edit knowing this every single time.
🎭
What's the format this wants to be?
Raw footage can become many things. Your job is to spot the best version of that footage — talking head, reaction, reveal, list, story. Choose intentionally.
🔄
Think in outcomes
You're not making a video. You're trying to get this creator more followers, more views, more fans. Every edit decision should serve that.
✓ Do this
  • Watch the full raw footage before cutting anything
  • Ask "what format suits this content?" before opening the timeline
  • Think about what the viewer gets out of watching
  • Consider whether text overlay or a reframe would change the impact
✗ Not this
  • Jump straight into cutting without a plan
  • Edit for the creator's preferences over the audience's experience
  • Assume the footage structure tells you what format to use
  • Make it look nice without thinking about retention
Quick Check
You receive raw footage of a creator explaining why she left her corporate job. What's your first move?
02
🪝
The Hook — The Only Part That Matters First
No hook = no views. How to engineer the first 1–3 seconds so they can't scroll past
Technique
The hook isn't just the first line of dialogue — and it's not always dialogue at all. It's the first visual, the first energy, the first feeling the viewer receives. A specific look from the creator. A reaction caught on camera. A moment of tension. If the first frame already stops you cold, that is the hook — don't edit over it. If it doesn't, your job is to find what does and move it to the front.
1
Identify the most compelling moment in the footage
Scan all your footage. What's the most shocking, funny, emotional, or surprising moment? That's often where your hook comes from — and sometimes it means starting in the middle of the story, not the beginning.
2
Open with action or conflict, never with setup
"Hey guys, so today I wanted to talk about..." is dead. Start mid-sentence, mid-motion, or with an immediate visual tension. The viewer's brain needs something to resolve.
3
Layer in a text overlay hook if the visual alone isn't enough
Text on screen in the first 2 seconds gives a second chance to capture scrollers who missed the audio or need one more reason to stay. Use it as a curiosity gap — the text should create a question in the viewer's mind that only the rest of the video can answer.

Strong curiosity gap examples:
· "I can't believe this actually happened"
· "She had no idea I was filming this"
· "Nobody expected what came next"
· "This changed everything for her"
· "I had to share this"
· "Wait for the reaction 👀"
· "This is why she went viral"

Notice: none of these describe what happens. They all create a gap the viewer has to close by watching. Describe the feeling or the stakes — never the content.
4
Watch the first 3 seconds back — cold
After editing the hook, close your eyes, wait 5 seconds, then hit play. Is your first reaction curiosity? Does it make you want to see what comes next? If you feel nothing, the hook needs work.
💡
Hook formats that work on our creatorsA visual reveal (hide then show), a bold statement before context, a question that creates a gap ("would you do this for $500?"), starting mid-action, or a raw emotional moment opening cold. Study what stops YOUR scroll and reverse-engineer it.
📱
The "stolen hook" technique — screen record + jumpscareIf the raw footage doesn't have a strong enough opening on its own, we sometimes screen-record a hook from a high-performing video on another account (one that already proven it stops scrolls), then use it as the opening 2–4 seconds in the Edits app. Our video then comes in right after — like a jumpscare cut — creating a jarring, attention-grabbing transition.

How to do it:
1. Find a video with a hook that already works (proven by views/engagement)
2. Screen-record just the hook moment (2–4 seconds max)
3. In the Edits app, place the screen-recorded hook first
4. Cut hard into our creator's footage — abrupt is better than smooth here
5. The contrast between the two clips is the hook. It should feel like a pattern interrupt.

⚠️ Use this when: the footage is strong but the opening few seconds aren't doing enough work on their own. Don't force it if the original hook is already solid.
✂️
Editing app stackWe primarily use the Edits app (Meta's native editor) — it's seamless with Instagram and makes publishing frictionless. CapCut and Final Cut Pro are also fine depending on what you're comfortable with. For most day-to-day Reels, Edits is the default. Use FCP when you need more precision or the edit is complex.
Did I start with action or tension — not with setup or a greeting?
Is there something in the first 1–3 seconds that makes you want to see what happens next?
Did I consider whether a text overlay in the first 2 seconds would strengthen the hook?
Have I watched back the first 3 seconds cold and felt genuine curiosity?
03
✂️
Pacing, Dead Space & The Cut
Every frame should earn its place. How to edit for momentum and viewer retention
Technique
Dead space kills videos. Any moment where nothing is happening — a breath held too long, a pause, a reaction shot that runs a second too long — gives the viewer permission to leave. Your job is to make every frame feel intentional and keep the energy moving forward.
The core rule: zero dead spaceIf nothing is happening — cut it. If the energy dips — cut it. If there's a pause that feels longer than it should — cut it. No mercy. You can always test if you cut too tight by watching it back at 1.25x speed.
🎬
Cut on beats, not on sentences
Cut at the natural end of a thought or action — not necessarily where the sentence ends. Find the rhythm. If there's music, cuts should feel like they land on the beat.
🔍
Zoom in/out as a retention tool
Zooming in or pulling out is a visual cut without a hard edit. Use it to re-engage attention when the pacing of a shot is starting to feel flat — typically every 3–5 seconds in a long shot.
⏱️
Test at 1.25x playback
Play back your rough cut at 1.25x speed. If it feels fast but still comprehensible, you're probably in the right zone. If it feels fine at 1.25x, you may need to cut tighter at 1x.
Have I removed all pauses, breaths, and filler ("um", "like", "so yeah") from the dialogue?
Have I used zoom in/out to keep long shots visually active?
Played back at 1.25x — does the pacing still feel intentional?
Are there any moments where I "feel" myself getting bored — even slightly? (Cut those.)
04
✍️
Text Overlays — Making It More Compelling
When, where, and how to use text to amplify — not distract
Technique
Text overlay is not decoration — it's a tool. The question is always: does this text make the viewer more likely to keep watching or take action? If yes, add it. If it just restates what's being said, it's noise.
1
Hook overlay — amplify the curiosity gap
In the first 2 seconds, text can catch a scroller who missed the audio or who needs one more reason to stay. Frame it as a curiosity gap, not a title: "She had no idea this was coming" not "My reaction".
2
Mid-video emphasis — highlight the punchline or key point
If a key moment lands in the middle of the video, text can be used to amplify it. Time the text to appear slightly before the spoken punchline so it pre-loads the brain.
3
CTA or comment-bait overlay — end with intention
Text at the end can drive a specific action: a question that invites comments, a "follow if you agree," or a stat/claim that invites debate. This is where comment-baiting lives.
✓ Effective text
  • Creates a curiosity gap or emotional amplification
  • Adds context the visuals alone can't provide
  • Times precisely to the energy of the moment
  • Uses the brand font (Advercase or approved alternative)
  • Clean, readable — positioned to not block the creator's face
✗ Weak text
  • Just restates what the creator is saying word for word
  • Stays on screen too long and becomes background noise
  • Clutters the frame — too many elements fighting
  • Random font that doesn't match the creator's brand feel
📍
POV text — place the viewer inside the scene"POV:" hooks work because they don't just describe what's happening — they make the viewer imagine themselves there. This triggers emotional immersion instantly. Use it when the footage has a strong sense of place, experience, or situation the viewer would want to be in (or relates to).

Examples from our creators:
· "POV: you just arrived in Bali for the first time"
· "POV: your first day in Asia"
· "POV: you married the crazy Asian"
· "POV: she doesn't know you're watching"

The formula: POV: [situation the viewer is placed into]. It should feel aspirational, relatable, or intriguing — never boring. Pair it with visuals that immediately match the scenario so the brain makes the connection in under a second.
🔤
Font standard (current)We're using Advercase as the primary font across most creators right now. This creates a consistent brand feel on the page. Any font change needs to be intentional and consistent across that creator's account — don't mix fonts within the same creator's content.
05
🎵
Sound Design — SFX, Music & AI Audio
Sound is 50% of the edit. How to use it to drive emotion and keep viewers in
Technique
Sound effects and music are retention tools. A well-placed whoosh, a beat drop at the right moment, a subtle notification sound — these cue the viewer's brain to pay attention. Use them intentionally, not as decoration.
🔊
SFX amplify moments
A cut that lands on a sound effect feels more intentional than a silent cut. Use whooshes on zooms, impact sounds on text appearances, and notification sounds on reveals.
🎶
Background music drives energy
Music sets the emotional tone before the viewer is consciously aware of it. Match the energy: upbeat for fun content, subtle/tense for drama, none or soft for emotional moments.
🤖
AI voice when needed
For voiceovers, use ElevenLabs to generate AI audio that matches the creator's tone. Useful for POV or narrated content where you need clean voice over b-roll.
🎵
Music can do most of the heavy lifting — don't underestimate itSometimes the right track turns an average edit into something that feels cinematic, funny, or emotional without a single extra cut. The music isn't background — it's often the reason someone watches to the end or shares it.

How we find music:

· Search Instagram by keyword — type a word related to the video's theme into Instagram's search bar (e.g. type "money" if the video is about money, "crazy" if the vibe is chaotic, "boss" for an aspirational feel). Browse the audio that comes up — you'll often find trending sounds that fit perfectly and already have algorithm momentum behind them.

· Save audio from other viral creators — when you're scrolling and a video hits, save the audio. Build a personal library of sounds that have already proven they work on viral content. When you get a new edit, browse your saved audio first before searching from scratch.

· Match mood, not just genre — a slow, emotional piano under a candid moment can make it feel cinematic. A hyperpop beat under a reaction can make it feel chaotic and fun. Don't default to whatever's trending. Ask: what emotion does this footage already have — and what music amplifies that?

· YouTube Audio Library / Pixabay for clean, royalty-free options when you need something specific and can't find it on Instagram.
Have I added at least one SFX to reinforce a key moment (cut, zoom, text reveal)?
Did I check my saved audio library or search Instagram by keyword before defaulting to a random track?
Does the music match the emotional tone of the footage — or is it fighting it?
Is the creator's voice clearly audible above the music (music ducked under speech)?
If voiceover was needed, did I use ElevenLabs or a clean source instead of low-quality mic audio?
06
📈
The Full Reel Structure: Hook → Retention → Payoff
Plus: comment baiting and rage baiting to drive algorithm engagement
Strategy
Every high-performing Reel follows the same underlying structure: something that stops the scroll, something that makes them stay, and something that makes them feel it was worth watching. Then something that makes them want to comment or re-watch. Build this into every edit.
H
Hook — Stop the scroll (0–3 seconds)
Covered in Module 02. Visually or textually arresting. Creates immediate curiosity. Starts mid-action or with a bold claim. Never with setup.
R
Retention — Give them a reason to stay (3 seconds → near end)
Tease the payoff without delivering it. Use pattern interrupts — a zoom, a cut to B-roll, a sound effect, a new text overlay — to reset attention every 3–5 seconds. This is where pacing and SFX do the heavy lifting. The viewer should feel like they can't leave yet.
P
Payoff — Deliver on what you promised (final 5–10 seconds)
The punchline. The reveal. The resolution. Whatever the hook implied — it has to land. A weak payoff teaches the algorithm that your viewers felt disappointed. Make it satisfying.
💬
Comment baiting — build it into the editThe goal is to give viewers something to respond to. This can be: a debatable claim in text overlay ("this is why most men fail at dating"), a question at the end ("would you do the same?"), a subtle mistake or blank left for people to fill in, or a prop/caption that triggers curiosity or mild controversy. Comments and shares are the highest-value engagement signals. Every edit should have at least one intentional comment bait element.
😤
Rage baiting — use carefully but powerfullyA mild controversy or unpopular take in the text or closing line can drive disagreement comments, which the algorithm reads as high engagement. Example: "most people are too lazy to do this" or "if you're not doing X you're leaving money on the table." It shouldn't be mean — just slightly provocative. The key is it has to feel genuine, not manufactured.
Does the video have a clear hook (0–3s), a sustained retention period, and a satisfying payoff?
Did I add at least one intentional comment-bait element (question, debatable claim, or blank to fill)?
Is there a pattern interrupt every ~3–5 seconds in the middle of the video to re-engage attention?
Does the payoff actually deliver on what the hook implied?
07
🎨
Brand Cohesion — The Creator's Page as a Portfolio
Why consistency across a creator's feed is a growth strategy, not just aesthetics
Standard
When someone lands on a creator's profile after watching a Reel, the page has to look like a coherent brand — not a collection of random videos. Consistent visual identity signals trust and professionalism, which increases follow-through. Your edits are building a brand, not just individual videos.
🔤
Current brand standard: AdvercaseWe are using Advercase as the primary font across most creators. Keep it consistent. If a creator needs a different font, it must be a deliberate brand decision — not a one-off choice on a single video. When in doubt, use Advercase.
🔡
Same font, always
Switching fonts between videos fragments the visual identity. Choose one and stick to it for all text overlays across all that creator's content.
🎨
Consistent colour treatment
Text colours, highlight colours, and any graphic elements should feel like they belong to the same system. If you're using white text with an orange shadow — do that consistently.
Energy should match the creator's brand
A fun, bubbly creator and a serious, aspirational one need different editing energy — even if both follow the same structural framework. Know which one you're editing for.
✓ Cohesive edits look like
  • Same font across all videos for that creator
  • Consistent text placement (e.g., always centered, always top-third)
  • SFX and music style that matches the creator's established tone
  • Color grading that's consistent — not different from video to video
✗ Fragmented edits look like
  • Different font every video
  • Text in different positions each time, no visual pattern
  • One video feels dramatic, the next feels playful — no connection
  • Color grading swings wildly between warm and cool
Am I using Advercase (or the creator's approved font) — not a random alternative?
Does this video feel visually consistent with the last 3–5 videos on this creator's page?
Does the overall energy and editing style match this creator's established brand feel?
If I made any style choices that differ from the norm — did I flag them for review before finalising?
Final Check
You're editing a new creator who hasn't worked with us before. There's no established visual style yet. What do you do?
🎬

Pre-Export Checklist — Run This on Every Video

Before you export and deliver any edit, go through this final check. This is the standard every video leaves CreatorGrow with.

Back to Hub