AI Tools8 min

How I Built a Tone of Voice Skill for Claude Code

Step-by-step process for building a Claude Code skill that extracts your real voice from transcripts and applies it to everything AI writes for you.

By Donatas Simkus

How I Built a Tone of Voice Skill for Claude Code – AI Tools blog cover

AI content sounds like AI. You can spot it in two sentences. The vocabulary is off, the structure is too clean, and it reads like a press release instead of a person talking. I built a Claude Code skill that fixes this by extracting my actual voice from real transcripts and turning it into a reusable reference layer.

This post covers the full process. How I collected the raw material, how Claude analyzed it, and how the final Claude Code skill works in practice. I also run a live before/after demo so you can see the difference.

Key Takeaways
  • 1.Extract your real voice from 5-7 transcripts across different settings (calls, dictation, meetings)
  • 2.Claude analyzes sentence structure, vocabulary, reasoning patterns, and emotional expression
  • 3.One unified voice with a 5-level polish system (Casual to High Polish) instead of separate profiles
  • 4.The skill is a reference layer, not a content generator. Other skills load it to control how output sounds.
  • 5.Live demo included: 82-word generic AI paragraph reduced to 52 words that sound like a real person

Step 1: Collect Raw Transcripts from Different Settings

I gathered 7 transcripts that captured me speaking in different contexts:

SourceTypeDetails
WisprFlow voice-to-textDictated notes~84,000 words, 3 weeks, ~113 wpm
Video call with business partnerInformal, casualNatural conversation tone
Website workshopFormal, structuredSenior managers audience
Website reviewSemi-formalSalespeople audience
Agency/vendor syncVendor managementAccountability-driven tone
Marketing & comms weeklyPeer trainingTeaching/explaining mode
Claude Code instructions via WisprFlowStream of consciousnessCasual, raw dictation

The key: multiple settings. I talk differently to my business partner than I do to senior managers. The Claude Code skill needed to capture the full range, not one mode.

Step 2: Analyze Each Transcript for Patterns

Claude read each transcript and extracted:

  • Sentence structure: length, fragments, how ideas connect
  • Vocabulary: go-to intensifiers, qualifiers, connectors, filler words
  • Opening patterns: how I start a thought
  • Closing patterns: how I end one
  • Reasoning style: how I build an argument
  • Emotional expression: how I show enthusiasm, frustration, curiosity
  • Conversational habits: repetition, confirmation checks, analogies

Step 3: Find the Universal Voice vs. Situational Adjustments

The early versions tried to create separate voice profiles per register (formal, casual, vendor, etc.). We scrapped that.

The insight: one unified voice with minor situational adjustments. The core patterns (short sentences, "I think" before opinions, "super" as intensifier, practical analogies) stay constant. Only small things shift:

SettingWhat Adjusts
Formal (seniors)"I would like to" replaces "I want to". Drop casual fillers. Lead with data.
VendorsPush accountability harder. "If not, why not?"
Teaching peersDemo live. Use familiar analogies. Add caveats.
Situational adjustments: same voice, different calibration

Step 4: Build the Claude Code Skill as a Reference Layer

This is where most people get it wrong. They try to build a skill that writes content. That does not work. The Claude Code skill I built is a voice reference that other skills load. It does not create content. It controls how content sounds.

Structure of the skill:

  • Sentence structure rules (do/don't with before/after examples)
  • Vocabulary (signature phrases, words to avoid, replacement table)
  • Opening patterns (what to use, what to avoid)
  • Closing patterns (same)
  • Reasoning and argument style
  • Emotional expression per emotion
  • Thought flow: situation > opinion > rationale > compare > ask
  • Conversational habits

Step 5: Create a Polish Level System

Instead of one tone, the Claude Code skill has 5 levels. All use the same voice at different levels of cleanup:

LevelUse CaseWhat Changes
CasualInternal notes, partner chatRaw voice. All tics preserved.
ConversationalLinkedIn, social, emailsCore voice. Clean up fragments. Keep "I think", "super".
ProfessionalBlog, cold email, vendor commsFewer verbal tics. Keep reasoning pattern.
FormalPresentations to seniorsData-led. Structured. No casual fillers.
High polishAd copyStrip all tics. Keep directness and short sentences.
Polish level system: same voice at different levels of cleanup

Step 6: Wire It into Other Claude Code Skills

The tone of voice skill gets loaded by my content skill, personal brand skill, social post skill, hooks skill, and writing skill. Each of those handles format (LinkedIn post structure, email structure, etc.). The tone skill handles how it sounds.

So when I ask Claude Code to write a LinkedIn post, the content skill handles the format and the tone skill makes sure it sounds like me. Two layers, one output.

Step 7: Build an Audit Mode

I can paste any piece of content and run a voice audit. The skill returns:

  • Overall voice match (High / Medium / Low)
  • Number of issues found
  • Full corrected version with inline [VOICE FIX: old -> new] notes
  • Top 3 voice mismatches

This is where the before/after demo comes in. Let me show you what it looks like in practice.

Live Demo: Before and After

I asked Claude to write a general paragraph about automation. No tone of voice skill loaded. Here is what came out:

BEFORE (generic AI output)
─────────────────────────
In today's rapidly evolving digital landscape, it's worth noting
that leveraging automation tools has become increasingly crucial
for businesses seeking to enhance their operational efficiency.
Furthermore, by utilizing comprehensive workflow platforms,
organizations can essentially streamline their processes,
ultimately empowering teams to focus on more strategic endeavors.
Prior to implementing these solutions, however, it would be
advisable to conduct a thorough assessment of your existing
infrastructure in order to ensure seamless integration.
The key takeaway here is that embracing automation isn't just
about saving time. It's about unlocking transformative potential.
Generic AI output

Then I ran the voice audit on it:

VOICE AUDIT
───────────
Overall match: Low
Issues found: 14

CORRECTED VERSION
─────────────────
Automation saves you time.
[VOICE FIX: "In today's rapidly evolving digital landscape,
it's worth noting that leveraging automation tools has become
increasingly crucial" -> direct statement]

That's basically the whole point.
[VOICE FIX: "Furthermore, by utilizing comprehensive workflow
platforms, organizations can essentially streamline their
processes" -> simplified]

You set it up once, it runs every time after. 100%.
[VOICE FIX: removed em dash construction, added signature phrase]

Before you build anything, check what you already have.
[VOICE FIX: "Prior to implementing these solutions, however,
it would be advisable to conduct a thorough assessment of your
existing infrastructure in order to ensure seamless integration"
-> plain language]

Does your current stack support it? If not, why not?
[VOICE FIX: added signature accountability phrase]

I think most people overcomplicate this. Just automate the
repeatable stuff and move on.
[VOICE FIX: "embracing automation isn't just about saving time.
It's about unlocking transformative potential" -> opinion
framed with "I think", no fluff]

TOP VOICE MISMATCHES
────────────────────
1. "In today's rapidly evolving digital landscape"
   -> Open with the point. No preamble.
2. "leveraging", "crucial", "enhance", "utilizing",
   "streamline", "empowering", "endeavors", "seamless",
   "transformative", "unlocking"
   -> 10 forbidden words in 4 sentences
3. "The key takeaway here is"
   -> No summaries. End on the point.
Voice audit with corrections

And here is the clean final version after running through the tone of voice skill:

AFTER (tone of voice applied)
─────────────────────────────
Automation saves you time. That's basically the whole point.

You set it up once, it runs every time after. 100%.

Before you build anything, check what you already have.
Does your current stack support it? If not, why not?

I think most people overcomplicate this.
Just automate the repeatable stuff and move on.
Same content. Sounds like a person, not a press release.
MetricBefore (Generic AI)After (Tone of Voice)
Sentences46
Word count8252
Forbidden words100
Em dashes20
VoicePassiveActive
Opens with"In today's rapidly evolving...""Automation saves you time."

Step 8: Add a Self-Check for Quality Control

Every Claude Code skill that loads the tone of voice runs 10 checks before returning output:

  1. 1.No sentence longer than 25 words
  2. 2.At least one "I think" or "I'm wondering" in opinion pieces
  3. 3.No forbidden vocabulary (utilize, essentially, arguably, etc.)
  4. 4.No formal transitions (Furthermore, Moreover, Additionally)
  5. 5.No em dashes
  6. 6.No inspirational sign-offs
  7. 7.No summaries at the end
  8. 8.Opens with the point
  9. 9.Reads like someone talking
  10. 10.At least one signature connector in conversational formats

A Note on How I Use This

This Claude Code skill is not meant for everything I write. It captures my base tone. How I actually talk. After that, it still gets adjusted for the specific channel.

A formal email to a senior stakeholder sounds different from a LinkedIn post. A Slack message to my team sounds different from a vendor email. That is normal. In real life, you adjust how you speak depending on who you are talking to, the channel, and the setting.

The skill gets the foundation right. It makes AI output sound like me instead of a generic language model. Channel-specific tailoring happens on top. I actually recommend doing this. Build your base voice first, then adjust per channel. Not the other way around.

The Core Idea

Your voice already exists. It is in your recordings, your transcripts, your dictated notes. The process is extraction, not invention. Feed real speech into an LLM, have it identify patterns across multiple contexts, then codify those patterns into a Claude Code skill that any content tool can load.

I spent a few hours building this. Every piece of content I produce now sounds like me instead of ChatGPT. That trade-off is worth it.

Frequently Asked Questions

What is a tone of voice skill for Claude Code?

A tone of voice skill is a markdown file that codifies your real speech patterns, including sentence structure, vocabulary, reasoning style, and emotional expression, into rules that Claude Code follows. It works as a reference layer that other content skills load to control how output sounds, rather than generating content directly.

How many transcripts do you need to build a tone of voice skill?

Five to seven transcripts from different settings give enough range. Include calls, meetings, dictated notes, and informal conversations. The key is variety: you talk differently in a vendor call than in a team meeting, and the skill needs to capture your full range, not just one mode.

Does a tone of voice skill replace editing?

No. The skill gets the foundation right so AI output sounds like you instead of a generic language model. You still adjust for the specific channel, audience, and context. It eliminates the biggest problem, generic AI voice, so your editing time goes toward substance rather than making the tone feel human.

About the Author

Donatas Simkus

Growth marketing leader with 10+ years scaling B2B SaaS startups across the US, EU, and Middle East. Specializes in AI-driven automation, demand generation, and full-funnel growth strategy.

View experience & credentials