YouTube Thumbnail Generator
youtube-thumbnail
Generates three YouTube thumbnail options optimized for click-through rate using Canva, applying high-contrast colors, emotional expressions, and concise text overlays. Use when a user needs a thumbnail for a YouTube video, wants to A/B test thumbnail designs, or needs click-optimized visual assets at 1280x720.
- This skill, packaged and ready to upload. youtube-thumbnail.zip
- In claude.ai or Claude desktop: Customize → Skills (+) → Create skill → Upload a skill, select the zip and toggle it on. Greyed out? Enable code execution under Settings → Capabilities.
- It’s live in your chats — no code, no setup. Want every Content skill at once? Add the whole plugin from the Content page (Customize → Personal plugins → Create plugin → Upload plugin).
/plugin marketplace add Salah-XD/equipt
/plugin install equipt-content Installs the whole equipt-content plugin — this skill included.
npx @equipt/cli init
npx @equipt/cli add youtube-thumbnail Adds just this skill to your Claude Code project.
3 CTR-optimized options per video. 1280x720 JPG export. High-contrast colors, emotional faces, 3 words max. User picks one, refine, deliver.
When to Use This Skill
Use this skill when you need to:
- Create a click-optimized YouTube thumbnail for an upcoming or published video
- Generate 3 distinct thumbnail concepts to A/B test or pick from
- Apply CTR best practices (contrast, emotion, minimal text) to a thumbnail design
- Refresh an underperforming video's thumbnail with a higher-CTR alternative
DO NOT use this skill for:
- YouTube channel art or banners (different dimensions and goals)
- Video editing, intros, or end screens
- Social media graphics for non-YouTube platforms (use social-media-graphics instead)
Quick Reference: Thumbnail Specifications
| Spec | Value | Why It Matters |
|---|---|---|
| Dimensions | 1280 x 720 px | YouTube required minimum; 16:9 aspect ratio |
| Max file size | 2 MB | YouTube upload limit |
| Format | JPG | Smallest file size at high quality; YouTube standard |
| Text overlay | 3 WORDS MAXIMUM | Must be readable at 168x94 (mobile suggested video) |
| Font size | Fills 30-40% of thumbnail width | Legibility at all screen sizes |
| Safe zone | Key elements in center 80% | Edges get cropped on some devices |
| Color contrast | Minimum 4.5:1 text-to-background | Visibility on white and dark YouTube backgrounds |
| Faces | Visible, emotional, eye contact | Faces with strong emotion lift CTR 30-40% on average |
Quick Reference: CTR Best Practices
| Practice | Impact | Implementation |
|---|---|---|
| High-contrast colors | CRITICAL | Complementary pairs: yellow/black, red/white, blue/orange |
| Emotional face expression | CRITICAL | Surprise, excitement, shock, curiosity — exaggerated beats subtle |
| 3 words or fewer | CRITICAL | Every extra word reduces readability at small sizes |
| Rule of thirds | HIGH | Face on one third, text on the opposite third |
| No clutter | HIGH | One focal element + one text element maximum |
| Color pop background | MEDIUM | Solid or gradient in a bold color behind the subject |
| Outline/stroke on text | MEDIUM | White or black stroke ensures readability on any background |
| Avoid red/green together | LOW | Color-blind viewers cannot distinguish these |
THE #1 RULE: IF THE THUMBNAIL IS NOT READABLE AT THE SIZE OF YOUR THUMB, IT WILL NOT GET CLICKED.
Quick Reference: Thumbnail Style Types
| Style | Layout | Best For | Text Placement | Visual Focus |
|---|---|---|---|---|
| Reaction | Face on left third, text on right | Commentary, reactions, vlogs | 1-2 bold words beside face | Exaggerated facial expression |
| Tutorial | Step visual on right, text on left | How-to, walkthroughs, guides | Action verb + subject ("Edit FASTER") | Screenshot or tool visual |
| Listicle | Large number left, subject montage right | Top 5, rankings, comparisons | Bold number as anchor element | Collage of items being ranked |
| Story | Dramatic full-bleed image, text overlay | Challenges, experiences, storytime | Question or cliffhanger centered | Cinematic photo or dramatic moment |
| Before/After | Split screen, left vs right | Transformations, results, reviews | Labels for each half | Clear visual contrast between halves |
| Bold Text | Text dominates 60%+ of frame | Hot takes, opinions, announcements | Statement fills the canvas | Minimal imagery, maximum text impact |
Core Workflow
GENERATE 3 DISTINCT THUMBNAIL OPTIONS WITH DIFFERENT STYLE APPROACHES — NEVER 3 VARIATIONS OF THE SAME CONCEPT.
Step 1: Gather Video Details
Collect before generating anything:
- Video title — exact or working title of the video
- Topic/subject — what the video is about in one sentence
- Style preference — preferred style from the table above, or let the skill pick 3 contrasting styles
- Face/person — include a face? If yes, describe expression and framing
- Brand colors — Canva brand kit or manual hex codes
- Competing thumbnails — thumbnails they want to stand out from (optional)
If the user provides items 1-2, proceed with defaults for the rest.
Brief template for vague requests:
I'll generate 3 thumbnail options. Quick details needed:
1. Video title?
2. What's it about? (one sentence)
3. Preferred style? (Reaction / Tutorial / Listicle / Story / Bold Text — or I'll pick 3)
4. Include a face? (describe expression if yes)
5. Use your Canva brand kit? (Y/N)
Step 2: Load Brand Kit from Canva
- Call
list-brand-kitsto retrieve available brand kits - Select matching kit or ask user to choose if multiple exist
- Note brand colors, fonts, and logo references for generation prompts
IF NO BRAND KIT EXISTS:
- Ask for primary color, secondary color, and preferred font weight (bold/heavy recommended)
- DEFAULT PALETTE IF NO PREFERENCE: Yellow (#FFD700) text on dark navy (#0A0E27) background with white (#FFFFFF) stroke — high visibility on both light and dark YouTube interfaces
Step 3: Generate 3 Thumbnail Options
Generate 3 thumbnails using 3 different style approaches. Each must be a completely different creative direction.
Option selection logic (when no user preference):
- Commentary/opinion videos: Reaction + Bold Text + Story
- How-to/educational videos: Tutorial + Listicle + Bold Text
- Challenge/experience videos: Story + Reaction + Before/After
- Review/comparison videos: Listicle + Before/After + Tutorial
For each option:
- Condense the video title to 3 words or fewer for overlay text
- Build generation prompt with: overlay text, brand colors, style layout, 1280x720 landscape, high-contrast treatment, text stroke/outline
- Call
generate-designwith the prompt - Call
get-design-thumbnailto preview
Example prompt — Reaction style:
YouTube thumbnail, 1280x720, landscape orientation.
Style: Reaction thumbnail.
Layout: Exaggerated surprised face on the left third of frame.
Background: Solid bold red (#FF2D55) with subtle radial gradient.
Text overlay on right side: "NO WAY" in Impact font, white (#FFFFFF) with
thick black stroke outline. Text fills 35% of frame width.
High contrast. Bold colors. Clean composition. No clutter.
Rule of thirds composition. Safe margins on all edges.
Example prompt — Tutorial style:
YouTube thumbnail, 1280x720, landscape orientation.
Style: Tutorial thumbnail.
Layout: Software screenshot or tool visual on the right two-thirds.
Text on left third: "Edit FASTER" in Bebas Neue, yellow (#FFD700) on
dark navy (#0A0E27) block. White stroke on text.
Arrow pointing from text to the visual.
High contrast. Clean layout. One focal point. No clutter.
Example prompt — Bold Text style:
YouTube thumbnail, 1280x720, landscape orientation.
Style: Bold text-dominant thumbnail.
Layout: Text fills 60% of the frame, centered.
Text: "$10K" in massive Impact font, bright yellow (#FFD700).
Background: Dark navy (#0A0E27) solid.
Subtle money/cash imagery at 20% opacity behind text.
White stroke outline on all text. Extremely readable at small sizes.
Present all 3 options with style name, layout description, design ID, and thumbnail preview. Wait for the user to pick a favorite before proceeding.
IF THE USER DISLIKES ALL 3:
- Ask what element was closest and what feels wrong
- Generate 2 new options incorporating the feedback
- If 5 total attempts miss, stop and provide specs for manual Canva creation
Step 4: Refine the Selected Thumbnail
Once the user picks a favorite:
- Ask if refinements are needed (text wording, colors, repositioning, add/remove elements)
- If refinements requested:
- Call
start-editing-transactionon the selected design ID - Call
perform-editing-operationswith the specific changes - Call
get-design-thumbnailto preview the edit - If approved: call
commit-editing-transaction - If not approved: call
cancel-editing-transactionand repeat
- Call
- If no changes needed, proceed directly to export
REFINEMENT LIMIT: Maximum 3 rounds of edits. After 3 rounds, regenerate from scratch with all feedback baked into the prompt rather than continuing to patch.
Step 5: Export at 1280x720 as JPG
- Call
get-export-formatsto confirm JPG availability - Call
export-designwith: design ID, formatjpg, dimensions 1280x720 - Verify: file under 2 MB, dimensions exactly 1280x720
- Present the final export with file name, dimensions, format, size, and export URL
FILE NAMING: youtube-thumbnail-{topic-slug}.jpg — all lowercase, hyphens only
DEFAULT FORMAT: JPG — keeps files under YouTube's 2 MB limit. Use PNG only if user requests it or design is purely text/vector.
Example 1: "5 Tools Every Solopreneur Needs" — Tutorial Thumbnail
User brief: "Publishing a video called '5 Tools Every Solopreneur Needs' about productivity apps. Professional but eye-catching. Brand colors: teal and white."
Execution:
Details: Title: "5 Tools Every Solopreneur Needs" | Topic: productivity apps | Style: auto-select Tutorial + Listicle + Bold Text (educational) | No face | Brand: teal/white
Brand kit:
list-brand-kitsreturns "Solopreneur Studio" — teal (#00BFA6), white (#FFFFFF), dark gray (#1A1A2E). Loaded.3 options generated:
- Option A (Tutorial): "5 TOOLS" in white on teal block left, app screenshot mockup right.
generate-design+get-design-thumbnail. - Option B (Listicle): Massive "5" in white left, app icon grid on dark gray right, teal accent line.
generate-design+get-design-thumbnail. - Option C (Bold Text): "5 TOOLS" fills 60% of frame, dark gray background, teal underline.
generate-design+get-design-thumbnail.
- Option A (Tutorial): "5 TOOLS" in white on teal block left, app screenshot mockup right.
User picks Option B. Requests pure black background instead of dark gray.
Refine:
start-editing-transaction>perform-editing-operations(background #1A1A2E to #000000) >get-design-thumbnail> user approves >commit-editing-transaction.Export:
export-designas JPG, 1280x720. File: 142 KB.
Delivered:
Thumbnail ready for "5 Tools Every Solopreneur Needs":
File: youtube-thumbnail-5-tools-solopreneur.jpg
Dimensions: 1280 x 720 px
Format: JPG | Size: 142 KB
Export URL: [URL]
Design: Listicle style — bold "5" left, app icon grid right, pure black
background with teal accents. Upload to YouTube Studio as custom thumbnail.
Example 2: "I Made $10K in 30 Days" — Reaction/Story Thumbnail
User brief: "New video: 'I Made $10K in 30 Days' about first month freelancing. Dramatic, makes people click. Include my face looking shocked. Brand: gold and black."
Execution:
Details: Title: "I Made $10K in 30 Days" | Topic: freelancing income milestone | Style: auto-select Reaction + Story + Bold Text (challenge/experience) | Face: shocked expression | Brand: gold (#FFD700), black (#000000)
Brand kit:
list-brand-kitsreturns "Freelance Gold" — gold (#FFD700), black (#000000), white (#FFFFFF). Loaded.3 options generated:
- Option A (Reaction): Shocked face left third, "$10K" in massive gold right, black background, subtle money imagery at 10% opacity.
generate-design+get-design-thumbnail. - Option B (Story): Full-bleed cinematic desk scene, "$10K?" question in gold centered, dark moody tones, vignette.
generate-design+get-design-thumbnail. - Option C (Bold Text): "$10K" in gold top half, "30 DAYS" in white below, black background, gold arrow between.
generate-design+get-design-thumbnail.
- Option A (Reaction): Shocked face left third, "$10K" in massive gold right, black background, subtle money imagery at 10% opacity.
User picks Option A. Requests bigger gold text and subtle green money rain in background.
Refine:
start-editing-transaction>perform-editing-operations(scale "$10K" text up 20%, add green money particles at 15% opacity) >get-design-thumbnail> user approves >commit-editing-transaction.Export:
export-designas JPG, 1280x720. File: 198 KB.
Delivered:
Thumbnail ready for "I Made $10K in 30 Days":
File: youtube-thumbnail-10k-30-days.jpg
Dimensions: 1280 x 720 px
Format: JPG | Size: 198 KB
Export URL: [URL]
Design: Reaction style — shocked face left, massive "$10K" in gold right,
black background with subtle green money rain. Upload to YouTube Studio.
Pre-Delivery Checklist
Run before delivering. DO NOT SKIP ANY ITEM.
| # | Check | Verify |
|---|---|---|
| 1 | Text readable at thumb size | Legible at 168x94 px (YouTube mobile suggested size) |
| 2 | 3 words or fewer | Count overlay words — if >3, cut to most impactful |
| 3 | Face visible and emotional | Expression obvious at small sizes (if face included) |
| 4 | High contrast | 4.5:1+ text-to-background ratio — no medium tones |
| 5 | No clutter | Max 2 focal elements (one text, one visual) |
| 6 | Rule of thirds | Key elements on thirds grid intersections |
| 7 | Safe zone | All key content within center 80% of frame |
| 8 | 1280 x 720 px | Exact dimensions verified in export |
| 9 | Under 2 MB | JPG file size within YouTube limit |
| 10 | No title duplication | Thumbnail text complements, not duplicates, video title |
Pre-Delivery Checklist:
[x] Text readable at thumb size (168x94)
[x] 3 words or fewer on overlay
[x] Face visible and emotional (if applicable)
[x] High contrast (4.5:1+ ratio)
[x] No clutter (max 2 focal elements)
[x] Rule of thirds composition
[x] Safe zone (center 80%)
[x] Dimensions: 1280 x 720 px
[x] File size: under 2 MB
[x] Text complements (not duplicates) video title
Recovery and Troubleshooting
Brand Kit Not Found
- Inform user: "No brand kits found in your Canva account."
- Ask for primary color, secondary color, font weight preference
- No preference? Apply default: Yellow (#FFD700) text, dark navy (#0A0E27) background, white (#FFFFFF) stroke
- Suggest creating a brand kit in Canva for future consistency
All 3 Options Rejected
- Ask: "Which was closest? What did you like/dislike?"
- Ask for a reference thumbnail URL or description of what they envision
- Generate 2 new options with specific feedback
- After 5 total options, stop and provide text specs for manual Canva creation
Editing Transaction Fails
- Call
cancel-editing-transactionto clean up - Retry
start-editing-transactionon the same design - Simplify edits — one change at a time instead of batching
- If still failing, regenerate from scratch with refinements in the prompt
Export Fails
- Verify design ID with
get-design-thumbnail - Try PNG instead of JPG (inform user of format change)
- If both fail, provide design ID for manual export: Canva > Find design > Share > Download > JPG > 1280x720
Design Generation Fails
- Simplify prompt to essentials: text, colors, dimensions
- Retry once with simplified prompt
- Try Bold Text style (fewest layout demands, most reliable)
- After 3 failures, stop and provide specs for manual creation
Text Overlay Exceeds 3 Words
- Do NOT put full title on thumbnail
- Extract 1-3 most powerful words
- Present condensed options to user:
Title too long for thumbnail. Condensed options: A) "6 FIGURES" B) "$100K+" C) "BEDROOM BIZ" Which one? Or suggest your own (3 words max). - Use chosen text in all 3 options
Anti-Patterns
- DO NOT put the full video title on the thumbnail — 3 words maximum
- DO NOT use low-contrast combinations (gray/gray, light yellow/white, navy/black)
- DO NOT generate 3 variations of the same concept — each must be a different style
- DO NOT export before the user picks and approves a favorite
- DO NOT use small or thin fonts — must be readable at 168x94 px
- DO NOT crowd with multiple text elements, logos, and imagery — one text + one visual max
- DO NOT skip the refinement step — always ask if changes are needed before export
- DO NOT export as PNG by default — JPG is the YouTube thumbnail standard
- DO NOT duplicate the video title word-for-word on the thumbnail