voice-extractor

Extract and document someone's authentic writing voice from samples. Use when someone needs a "voice guide," wants to capture their writing DNA, or needs to train AI to write in their style. Also useful for ghostwriting, brand voice documentation, or onboarding writers.

214 stars

Best use case

voice-extractor is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Extract and document someone's authentic writing voice from samples. Use when someone needs a "voice guide," wants to capture their writing DNA, or needs to train AI to write in their style. Also useful for ghostwriting, brand voice documentation, or onboarding writers.

Teams using voice-extractor should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/voice-extractor/SKILL.md --create-dirs "https://raw.githubusercontent.com/BrianRWagner/ai-marketing-claude-code-skills/main/voice-extractor/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/voice-extractor/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How voice-extractor Compares

Feature / Agentvoice-extractorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Extract and document someone's authentic writing voice from samples. Use when someone needs a "voice guide," wants to capture their writing DNA, or needs to train AI to write in their style. Also useful for ghostwriting, brand voice documentation, or onboarding writers.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Voice Extractor

AI-generated content all sounds the same. The fix isn't better prompts — it's teaching the AI how you actually communicate.

This skill extracts your communication DNA from writing samples and produces a Voice Guide: documented, tested, and ready to use.

---

## Mode

Detect from context or ask: *"Quick voice snapshot, full Voice Guide, or full guide with examples?"*

| Mode | What you get | Best for |
|------|-------------|----------|
| `quick` | Top 5 voice characteristics + 3 do/don't rules | Fast style reference, single piece |
| `standard` | Full Voice Guide: tone, vocabulary, rhythm, structure | AI training, ghostwriting, brand documentation |
| `deep` | Full Voice Guide + 10 sample rewrites + writing rules checklist + AI training examples | Onboarding writers, building a brand voice system |

**Default: `standard`** — use `quick` if they just need a fast reference. Use `deep` if they're onboarding a ghostwriter or building a content team.

---

## Context Loading Gates

**Before extracting, collect:**

- [ ] **Writing samples** — minimum 3 samples OR 500 total words (see priority list below)
- [ ] **Purpose of voice guide** — AI training? Ghostwriter onboarding? Team alignment?
- [ ] **Confidence zones** — Any topics where they want to sound more/less authoritative?
- [ ] **Known anti-patterns** — Any words or phrases they already know they want to avoid?

**Sample priority (most → least authentic):**
1. Casual Slack or email (raw, unedited voice)
2. Podcast or call transcript
3. LinkedIn posts or articles
4. Website copy (often edited, less authentic)

**Minimum sample gate:** If samples total under 500 words, stop:
> "These samples are too short to extract reliable patterns. Please add 2-3 more — emails, Slack messages, or transcripts work best. The messier and more casual, the better."

Do not attempt full extraction from under 500 words. Offer quick mode instead.

---

## Phase 1: Sample Quality Assessment

Before extracting, reason through:

1. **Sample authenticity:** Are these samples from edited/polished contexts (website, press) or raw contexts (Slack, email)? More polish = less authentic voice.
2. **Sample variety:** Do the samples cover different contexts (professional, casual, educational)? Single-context samples produce single-dimension voice guides.
3. **Exclusion check:** Identify and flag patterns that are NOT the authentic voice:
   - Platform formatting tics (LinkedIn line breaks, Twitter brevity forcing)
   - Typos and autocorrect errors
   - Phrases borrowed from others (quotes, retweets)
   - Unusually formal writing (legal docs, press releases)
4. **Sample size adequacy:** Is there enough material for full mode, or should I use quick mode?

Output a sample assessment:
> "I have [X samples / Y words] to work with. Quality: [high/medium — why]. I'll use [full/quick] mode. Excluding: [any patterns and why]."

---

## Phase 2: Core Energy Extraction

Identify the fundamental communication mode:

**Role:**
- Teacher (breaks things down systematically)
- Challenger (pushes back on assumptions)
- Cheerleader (builds confidence and momentum)
- Straight-shooter (cuts through BS efficiently)

**Default energy:**
- Calm authority ("Here's what works.")
- High enthusiasm ("This is exciting — let me show you.")
- Understated confidence ("I've seen this a hundred times.")

**Recurring themes:** What topics appear unprompted across samples? These are the things they actually care about.

---

## Phase 3: Phrase Extraction (Systematic)

Scan all samples and extract:

**Transition phrases** (how they shift topics):
- Quote exact examples from samples
- Pattern: "Here's the thing...", "What I've learned...", "Let me put it differently..."

**Emphasis phrases** (how they land a point):
- Quote exact examples
- Pattern: "The reality is...", "This is the part people miss...", "Here's the actual problem..."

**Closers** (how they wrap up):
- Quote exact examples
- Pattern: "That's the move.", "Start there.", "You've got this."

---

## Phase 4: Confidence Zone Mapping

| Zone | Description | Language Markers |
|---|---|---|
| Full authority | Topics they're an expert in | No hedging, definitive statements, "here's what works" |
| Earned perspective | Topics with experience but not mastery | "In my experience...", "What I've found..." |
| Active exploration | Topics they're learning now | "I'm testing this...", "What I'm seeing..." |

Map their stated expertise areas to each zone. This calibration is what makes the voice feel real vs. one-dimensional.

---

## Phase 5: Anti-Pattern Documentation

Extract what they'd NEVER say:
- Words that would feel wrong in their voice
- Phrases that make them cringe
- Tones they naturally avoid
- Industry jargon they hate

Source these from sample evidence where possible: "You never used [word] across [X samples] — it doesn't fit your voice."

---

## Phase 6: Validation Test (REQUIRED)

After extracting the full profile, generate 2 test sentences on the same topic:

**Version A** (using the extracted voice profile):
> "[Sample sentence in their voice]"

**Version B** (wrong voice — contrasting example):
> "[Same content, different voice — shows what to avoid]"

Ask the user: "Does Version A actually sound like you when you're not overthinking it? What feels off?"

This validation catches extraction errors before the guide is put into production.

---

## Quick Mode (`--quick`)

When samples are thin (300–500 words) or time is short:

1. Read 3 samples fast
2. Pull 10 signature phrases
3. Note 3 things they'd never say
4. Write 1 sentence describing their energy

**Output:** Minimum viable voice guide.

**Difference from full mode:**
- Quick: ~10 phrases, 3 anti-patterns, 1-sentence energy descriptor
- Full: Complete profile with confidence calibration, validated test sentences, and source-cited examples

---

## Phase 7: Self-Critique Pass (REQUIRED)

After generating the Voice Guide:

- [ ] Are the extracted phrases actually from the samples, or am I inferring them?
- [ ] Does the anti-pattern list include specific words/phrases, or just vague categories?
- [ ] Do the validation test sentences demonstrate a real difference between in-voice and out-of-voice?
- [ ] Is the confidence zone mapping specific to named topics, or just generic?
- [ ] Would a ghostwriter be able to use this guide without asking follow-up questions?

Flag any issues: "The anti-pattern section only has 2 entries — not enough for a usable guide. I need more samples or direct input from the user."

---

## Output Structure

```markdown
## Voice Guide: [Name] — [Date]

### Sample Assessment
- Samples: [count, types]
- Total words: [count]
- Quality: [high/medium — reason]
- Mode: [quick/full]
- Excluded: [patterns excluded + why]

---

### Core Energy
- Role: [teacher/challenger/cheerleader/straight-shooter]
- Default energy: [description]
- Recurring themes: [list]

### Signature Phrases
**Transitions:**
- "[Phrase]" (source: [email/post])
- "[Phrase]"

**Emphasis:**
- "[Phrase]" (source: [email/post])

**Closers:**
- "[Phrase]"

### Confidence Calibration
**Full authority (no hedging):**
Topics: [list]
Sounds like: "[example sentence]"

**Earned perspective:**
Topics: [list]
Sounds like: "[example sentence]"

**Active exploration:**
Topics: [list]
Sounds like: "[example sentence]"

### Anti-Patterns (Never Use)
- [Word/phrase] — why: [evidence from samples]
- [Word/phrase] — why: [evidence]

### Validation Test
**This sounds like you:**
"[Version A]"

**This doesn't:**
"[Version B — contrast]"

### Self-Critique Notes
[Any gaps, things to validate with user]

### Usage Instructions
- For AI: Paste this guide into your system prompt
- For ghostwriter: Share on day 1 — cuts revision cycles in half
- For team: This is the benchmark for "on brand"
```

---

*Skill by Brian Wagner | AI Marketing Architect | brianrwagner.com*

Related Skills

youtube-summarizer

214
from BrianRWagner/ai-marketing-claude-code-skills

Automatically fetch YouTube video transcripts, generate structured summaries, and send full transcripts to messaging platforms. Detects YouTube URLs and provides metadata, key insights, and downloadable transcripts.

vault-cleanup-auditor

214
from BrianRWagner/ai-marketing-claude-code-skills

Audit your Obsidian vault in Claude Code — finds stale drafts, empty folders, duplicate filenames, and incomplete files. Saves a dated report.

tweet-draft-reviewer

214
from BrianRWagner/ai-marketing-claude-code-skills

Review tweet drafts in Claude Code against 8 voice rules. Scores 1-10, breaks down every rule, and rewrites anything that scores below 7.

testimonial-collector

214
from BrianRWagner/ai-marketing-claude-code-skills

Systematically gather, score, and format client testimonials. Use when someone needs social proof, wants to collect feedback, needs to turn happy clients into public advocates, or asks for help requesting or drafting a testimonial.

social-card-gen

214
from BrianRWagner/ai-marketing-claude-code-skills

Generate platform-specific social post variants (Twitter/X, LinkedIn, Reddit) from one source input. Works with or without Node.js script. Includes platform reasoning, quality review, and guardrails against cross-posting spam.

reddit-insights

214
from BrianRWagner/ai-marketing-claude-code-skills

Search and analyze Reddit content using semantic AI search via reddit-insights.com MCP server. Use when you need to: (1) Find user pain points and frustrations for product ideas, (2) Discover niche markets or underserved needs, (3) Research what people really think about products/topics, (4) Find content inspiration from real discussions, (5) Analyze sentiment and trends on Reddit, (6) Validate business ideas with real user feedback. Triggers: reddit search, find pain points, market research, user feedback, what do people think about, reddit trends, niche discovery, product validation.

newsletter-creation-curation

214
from BrianRWagner/ai-marketing-claude-code-skills

Industry-adaptive B2B newsletter creation with stage, role, and geography-aware workflows

meeting-prep-cc

214
from BrianRWagner/ai-marketing-claude-code-skills

Generate a pre-meeting prep brief in Claude Code. Researches participants, pulls vault context, builds agenda, surfaces sharp questions. Use when user says "prep for this meeting," "I have a call with," "meeting tomorrow with," or "prep brief for [name/company]."

marketing-principles

214
from BrianRWagner/ai-marketing-claude-code-skills

Apply timeless marketing and business principles to any problem. Use when someone needs strategic thinking, wants to evaluate a marketing decision, needs a framework for a tough choice, or mentions "first principles," "should I do X," "what would work here," or wants to think through a marketing problem systematically.

linkedin-profile-optimizer

214
from BrianRWagner/ai-marketing-claude-code-skills

Audit and rewrite your LinkedIn profile to attract the right people. Scores each section, rewrites headline and about copy, and includes an AI visibility checklist so you show up in ChatGPT, Perplexity, and Claude search. Use when someone says "optimize my LinkedIn," "LinkedIn profile help," "rewrite my about section," or "how do I show up in AI search."

linkedin-authority-builder

214
from BrianRWagner/ai-marketing-claude-code-skills

Build a LinkedIn content system for thought leadership. Use when someone needs to establish authority, attract inbound leads, or build a consistent content presence. Covers positioning, content pillars, formats, and posting rhythm.

last30days

214
from BrianRWagner/ai-marketing-claude-code-skills

Research any topic across Reddit, X, and web from the last 30 days. Get current trends, real community sentiment, and actionable insights in 7 minutes vs 2 hours manual research.