How I Built a Tone of Voice Skill for Claude Code

AI content sounds like AI. You can spot it in two sentences. The vocabulary is off, the structure is too clean, and it reads like a press release instead of a person talking. I built a Claude Code skill that fixes this by extracting my actual voice from real transcripts and turning it into a reusable reference layer.

This post covers the full process. How I collected the raw material, how Claude analyzed it, and how the final Claude Code skill works in practice. I also run a live before/after demo so you can see the difference.

TL;DR

Extract your real voice from 5-7 transcripts across different settings (calls, dictation, meetings)
Claude analyzes sentence structure, vocabulary, reasoning patterns, and emotional expression
One unified voice with a 5-level polish system (Casual to High Polish) instead of separate profiles
The skill is a reference layer, not a content generator. Other skills load it to control how output sounds.
Live demo included: 82-word generic AI paragraph reduced to 52 words that sound like a real person

Step 1: Collect Raw Transcripts from Different Settings

I gathered 7 transcripts that captured me speaking in different contexts:

Source	Type	Details
WisprFlow voice-to-text	Dictated notes	~84,000 words, 3 weeks, ~113 wpm
Casual video call	Informal, casual	Natural conversation tone
Website workshop	Formal, structured	Senior managers audience
Website review	Semi-formal	Salespeople audience
Agency/vendor sync	Vendor management	Accountability-driven tone
Marketing & comms weekly	Peer training	Teaching/explaining mode
Claude Code instructions via WisprFlow	Stream of consciousness	Casual, raw dictation

The key: multiple settings. I talk differently in a casual call than I do presenting to senior managers. The Claude Code skill needed to capture the full range, not one mode.

Step 2: Analyze Each Transcript for Patterns

Claude read each transcript and extracted:

Sentence structure: length, fragments, how ideas connect
Vocabulary: go-to intensifiers, qualifiers, connectors, filler words
Opening patterns: how I start a thought
Closing patterns: how I end one
Reasoning style: how I build an argument
Emotional expression: how I show enthusiasm, frustration, curiosity
Conversational habits: repetition, confirmation checks, analogies

Step 3: Find the Universal Voice vs. Situational Adjustments

The early versions tried to create separate voice profiles per register (formal, casual, vendor, etc.). We scrapped that.

The insight: one unified voice with minor situational adjustments. The core patterns (short sentences, "I think" before opinions, "super" as intensifier, practical analogies) stay constant. Only small things shift:

Setting	What Adjusts
Formal (seniors)	"I would like to" replaces "I want to". Drop casual fillers. Lead with data.
Vendors	Push accountability harder. "If not, why not?"
Teaching peers	Demo live. Use familiar analogies. Add caveats.

Situational adjustments: same voice, different calibration

Step 4: Build the Claude Code Skill as a Reference Layer

This is where most people get it wrong. They try to build a skill that writes content. That does not work. The Claude Code skill I built is a voice reference that other skills load. It does not create content. It controls how content sounds.

Structure of the skill:

Sentence structure rules (do/don't with before/after examples)
Vocabulary (signature phrases, words to avoid, replacement table)
Opening patterns (what to use, what to avoid)
Closing patterns (same)
Reasoning and argument style
Emotional expression per emotion
Thought flow: situation > opinion > rationale > compare > ask
Conversational habits

Step 5: Create a Polish Level System

Instead of one tone, the Claude Code skill has 5 levels. All use the same voice at different levels of cleanup:

Level	Use Case	What Changes
Casual	Internal notes, team chat	Raw voice. All tics preserved.
Conversational	LinkedIn, social, emails	Core voice. Clean up fragments. Keep "I think", "super".
Professional	Blog, cold email, vendor comms	Fewer verbal tics. Keep reasoning pattern.
Formal	Presentations to seniors	Data-led. Structured. No casual fillers.
High polish	Ad copy	Strip all tics. Keep directness and short sentences.

Polish level system: same voice at different levels of cleanup

Step 6: Wire It into Other Claude Code Skills

The tone of voice skill gets loaded by my content skill, personal brand skill, social post skill, hooks skill, and writing skill. Each of those handles format (LinkedIn post structure, email structure, etc.). The tone skill handles how it sounds.

So when I ask Claude Code to write a LinkedIn post, the content skill handles the format and the tone skill makes sure it sounds like me. Two layers, one output.

Step 7: Build an Audit Mode

I can paste any piece of content and run a voice audit. The skill returns:

Overall voice match (High / Medium / Low)
Number of issues found
Full corrected version with inline [VOICE FIX: old -> new] notes
Top 3 voice mismatches

This is where the before/after demo comes in. Let me show you what it looks like in practice.

Live Demo: Before and After

I asked Claude to write a general paragraph about automation. No tone of voice skill loaded. Here is what came out:

BEFORE (generic AI output)
─────────────────────────
In today's rapidly evolving digital landscape, it's worth noting
that leveraging automation tools has become increasingly pivotal
for businesses seeking to elevate their operational efficiency.
Furthermore, by harnessing comprehensive workflow platforms,
organizations can essentially revolutionize their processes,
ultimately empowering teams to focus on more strategic endeavors.
Prior to implementing these solutions, however, it would be
advisable to conduct a thorough assessment of your existing
infrastructure in order to ensure seamless integration.
The key takeaway here is that embracing automation isn't just
about saving time. It's about unlocking transformative potential.

Generic AI output

Then I ran the voice audit on it:

VOICE AUDIT
───────────
Overall match: Low
Issues found: 14

CORRECTED VERSION
─────────────────
Automation saves you time.
[VOICE FIX: "In today's rapidly evolving digital landscape,
it's worth noting that leveraging automation tools has become
increasingly pivotal" -> direct statement]

That's basically the whole point.
[VOICE FIX: "Furthermore, by harnessing comprehensive workflow
platforms, organizations can essentially revolutionize their
processes" -> simplified]

You set it up once, it runs every time after. 100%.
[VOICE FIX: removed em dash construction, added signature phrase]

Before you build anything, check what you already have.
[VOICE FIX: "Prior to implementing these solutions, however,
it would be advisable to conduct a thorough assessment of your
existing infrastructure in order to ensure seamless integration"
-> plain language]

Does your current stack support it? If not, why not?
[VOICE FIX: added signature accountability phrase]

I think most people overcomplicate this. Just automate the
repeatable stuff and move on.
[VOICE FIX: "embracing automation isn't just about saving time.
It's about unlocking transformative potential" -> opinion
framed with "I think", no fluff]

TOP VOICE MISMATCHES
────────────────────
1. "In today's rapidly evolving digital landscape"
   -> Open with the point. No preamble.
2. "leveraging", "pivotal", "elevate", "harnessing",
   "revolutionize", "empowering", "endeavors", "seamless",
   "transformative", "unlocking"
   -> 10 forbidden words in 4 sentences
3. "The key takeaway here is"
   -> No summaries. End on the point.

Voice audit with corrections

And here is the clean final version after running through the tone of voice skill:

AFTER (tone of voice applied)
─────────────────────────────
Automation saves you time. That's basically the whole point.

You set it up once, it runs every time after. 100%.

Before you build anything, check what you already have.
Does your current stack support it? If not, why not?

I think most people overcomplicate this.
Just automate the repeatable stuff and move on.

Same content. Sounds like a person, not a press release.

Metric	Before (Generic AI)	After (Tone of Voice)
Sentences	4	6
Word count	82	52
Forbidden words	10	0
Em dashes	2	0
Voice	Passive	Active
Opens with	"In today's rapidly evolving..."	"Automation saves you time."

Step 8: Add a Self-Check for Quality Control

Every Claude Code skill that loads the tone of voice runs 10 checks before returning output:

No sentence longer than 25 words
At least one "I think" or "I'm wondering" in opinion pieces
No forbidden vocabulary (leveraging, pivotal, seamlessly, etc.)
No formal transitions (Furthermore, Moreover, Additionally)
No em dashes
No inspirational sign-offs
No summaries at the end
Opens with the point
Reads like someone talking
At least one signature connector in conversational formats

A Note on How I Use This

This Claude Code skill is not meant for everything I write. It captures my base tone. How I actually talk. After that, it still gets adjusted for the specific channel.

A formal email to a senior stakeholder sounds different from a LinkedIn post. A Slack message to my team sounds different from a vendor email. That is normal. In real life, you adjust how you speak depending on who you are talking to, the channel, and the setting.

The skill gets the foundation right. It makes AI output sound like me instead of a generic language model. Channel-specific tailoring happens on top. I actually recommend doing this. Build your base voice first, then adjust per channel. Not the other way around.

The Core Idea

Your voice already exists. It is in your recordings, your transcripts, your dictated notes. The process is extraction, not invention. Feed real speech into an LLM, have it identify patterns across multiple contexts, then codify those patterns into a Claude Code skill that any content tool can load.

I spent a few hours building this. Every piece of content I produce now sounds like me instead of ChatGPT. That trade-off is worth it.

Frequently asked questions

What is a tone of voice skill for Claude Code?

A tone of voice skill is a markdown file that codifies your real speech patterns, including sentence structure, vocabulary, reasoning style, and emotional expression, into rules that Claude Code follows. It works as a reference layer that other content skills load to control how output sounds, rather than generating content directly.

How many transcripts do you need to build a tone of voice skill?

Five to seven transcripts from different settings give enough range. Include calls, meetings, dictated notes, and informal conversations. The key is variety: you talk differently in a vendor call than in a team meeting, and the skill needs to capture your full range, not just one mode.

Does a tone of voice skill replace editing?

No. The skill gets the foundation right so AI output sounds like you instead of a generic language model. You still adjust for the specific channel, audience, and context. It eliminates the biggest problem, generic AI voice, so your editing time goes toward substance rather than making the tone feel human.