Episode 12·June 5, 2026

Stop Being the Bottleneck: Scorecard + Cadences to Replace Yourself

Spotify Apple Podcasts RSS Feed Open Companion Resource

Intro

This episode is for founders at $8K-$40K MRR who are stuck in delivery and want to reclaim 10-20 hours per week without sacrificing quality or stalling growth. You'll get a systematic approach to delegation that works across time zones with AI assistance.

In This Episode

Kira shares her 2 AM Guatemala meltdown when her project manager's power went out and she couldn't execute her own agency's workflow. Santi breaks down the Replace-Yourself Scorecard that scores SOP maturity, QA coverage, named backups, and handoff risk across every role you touch. They walk through three weekly cadences — ops review, QA wall, and sales forecast — that keep delegated work from drifting. You'll see three archetype paths (content ops, RevOps, support automations) with before-and-after hour recovery and margin lift estimates, plus how to handle the verification burden that kills most AI delegation attempts.

Key Takeaways

Use the Replace-Yourself Scorecard to identify which roles you can delegate this week versus which need 12-week buildout based on SOP maturity, QA coverage, and named backups
Implement three lean weekly cadences (ops review, QA wall, sales forecast) that prevent delegated work from drifting while keeping meeting overhead under 2 hours total
Define 'resolved' locally for AI-assisted work with specific thresholds (CSAT scores, follow-up windows) and use LLM-as-judge plus golden set replays to catch quality drift before clients do

Timestamps

Companion Resource

template

The Replace‑Yourself Toolkit

A fill‑in‑the‑blanks toolkit for nomad founders to score what to delegate, pick three weekly handoffs, and run a 12‑week plan with tight Ops/QA/Sales cadences. Designed for AI‑assisted, async teams that need results next week, not someday.

EOS Worldwide — Level 10 Meeting overview
eosworldwide.com
- - The EOS Level 10 Meeting is a structured, weekly 90‑minute leadership meeting with a standing Scorecard review and named owners for 5–15 measurables.
EOS Delegate and Elevate (official PDF)
client.eosworldwide.com
- - EOS Delegate and Elevate is a standardized tool for deciding what to hand off based on a two‑by‑two of 'like/don’t like' and 'good at/not good at.'
Asana Anatomy of Work 2023 (press release summary)
investors.asana.com
- - Knowledge workers spend 58% of their time on 'work about work' (coordination, switching, status) and estimate they could save 4.9 hours/week with better processes.
McKinsey Global Institute — Economic potential of generative AI
mckinsey.com
- - Generative AI has the potential to automate activities that occupy 60–70% of employees’ time in some occupations, and add $2.6T–$4.4T in annual economic value.
HubSpot State of Marketing 2024 (slides)
slideshare.net
- - Marketers report saving roughly 3 hours per content piece and ~2.5 hours/day overall using AI tools.
Zendesk CX Trends 2024 (newsroom)
zendesk.com
- - Zendesk’s 2024 CX Trends surveyed 2,818 consumers and 4,441 CX leaders; 70% of CX leaders are reimagining customer journeys with generative AI.
Intercom Help — Fin AI Agent reporting
intercom.com
- - Intercom Fin AI Agent reporting defines 'automation rate' as the percentage of conversations resolved by Fin out of all new conversations; deflection rate and CSAT are tracked in‑product.
VentureBeat coverage of Intercom Fin Apex 1.0
venturebeat.com
- - Intercom benchmark claim: Fin Apex 1.0 achieved a 73.1% AI‑only resolution rate in internal benchmarking vs. leading general models (press coverage).
GoalCadence 4DX summary; 12 Week Year site
goalcadence.com
- - Weekly accountability meetings (WIG sessions) are core to the 4 Disciplines of Execution; the 12 Week Year also formalizes a Weekly Accountability Meeting (WAM).
Wikipedia — CMMI (summary)
en.wikipedia.org
- - CMMI/CMM maturity models define five process maturity levels (Initial → Managed → Defined → Quantitatively Managed → Optimizing) commonly used to assess SOP/process maturity.
Dominican University of California materials (via summary)
core.ac.uk
- - Public goal‑setting research (Dominican University) found that participants who wrote goals, shared commitments, and sent weekly progress reports had ~76% success vs. ~43% without.
Gong help docs and guides
help.gong.io
- - Gong recommends weekly forecast review meetings for most sales motions, with an intensified cadence in the final week of a period.
TechRadar Pro summary of Foxit research
techradar.com
- - Some operators report minimal net time saved from AI due to verification/quality‑assurance overhead (e.g., Foxit‑commissioned research summarized by TechRadar Pro: ~16 minutes/week net).
Insight Partners — Sales KPIs for Puppies and ScaleUps
insightpartners.com
- - Insight Partners advises pipeline coverage equal to the inverse of historical win rate (e.g., 4× for a 25% win rate) as a weekly leading indicator to track.
Intercom Community case story: Pupil Progress raised Fin AI Agent resolution rate from 55% to 75% in two months (customer-to-customer forum post; vendor platform but named example).
community.intercom.com
- - Pupil Progress (education SaaS) using Intercom Fin AI Agent
- - Shows real deflection/resolution gains from AI support with specific percentages and time window; supports the Support Automations path.
Zapier Customer Story: ClickUp support team time savings
zapier.com
- - ClickUp (work management) support operations
- - Quantifies hours saved from automation in a named B2B context; helps anchor hour‑recovery assumptions.
Zapier Customer Stories index (aggregator page referencing Coursera story)
zapier.com
- - Coursera (edtech) data operations
- - Named brand; demonstrates large total hours saved claim via automation; use with caveat as vendor case study.
Console case study (vendor): Webflow internal IT support
console.com
- - Webflow (no‑code web platform) internal support automation
- - Shows very high automation/deflection on internal support — a ceiling case that sets expectations and guardrails.
Agentmelt case study (vendor)
agentmelt.com
- - DTC apparel brand support automation
- - Moderate deflection example (35%); useful to keep estimates conservative and realistic for ecommerce contexts.

Kira: It's two AM in Guatemala City. I've got a client deliverable due in six hours — a full content calendar, twenty-four assets, distribution plan — and my project manager in Lagos just went dark. Not ghosted. Her power went out. Whole neighborhood.

Santi: And you can't just… do it yourself?

Kira: I could do it myself. I'd done it myself a hundred times. That was the problem. I opened the doc, started pulling the brief, and I realized — I don't actually know how she builds these anymore. She'd changed the workflow three months ago. Better prompts, different sequencing, a scoring rubric I'd never seen. And none of it was written down anywhere I could find at two AM on my phone.

Santi: So your best operator upgraded the system and you had no idea what the system was.

Kira: Zero idea. I'm sitting in this Airbnb with one bar of signal, scrolling through Slack threads trying to reverse-engineer my own agency's process. And it hits me — I'm not the owner of this business. I'm the backup plan. And I'm a terrible backup plan.

Santi: How many hours a week were you doing at that point?

Kira: Fifty. Maybe fifty-five. At twenty-two K MRR. Which sounds fine until you realize I was doing the same hours at eight K. The revenue tripled. My week didn't change.

Santi: Because you scaled the work without scaling yourself out of it.

Kira: I scaled the work. I hired people. I added AI workflows. I did all the things you're supposed to do. But I never scored which seats I was still sitting in. I never asked — if I disappear for two weeks, what breaks?

Santi: And that night, you found out.

Kira: That night I found out everything breaks. Everything.

Santi: If you're between eight and forty K MRR and you're still the person who has to touch every deliverable, answer every edge case, and show up when someone's power goes out — you are paying yourself less per hour every time you grow. That's the tax on being an operator who never became an owner. And today we're going to kill that tax with a delegation scorecard, three weekly handoffs, and a twelve-week plan that gets you ten to twenty hours back without stalling your revenue.

Kira: And this is the important part — we're not talking about hiring your way out. We're talking about scoring where you're replaceable right now, then using AI-assisted delegation and fixed cadences to actually make the handoff stick.

Santi: So let's go back to that night in Guatemala. You're sitting there at two AM, you can't execute your own agency's workflow. What did you do the next morning?

Kira: I made a list. Every role I personally touched in a given week. Content editing, client onboarding calls, QA on AI outputs, invoicing, the weekly pipeline review. And next to each one I wrote two things — do I have an SOP for this, and is there a person who can do it without me?

Santi: And?

Kira: Seven roles. Two had SOPs. One had a backup. One.

Santi: Out of seven.

Kira: Out of seven. And that's when I realized — delegation isn't a mindset problem. It's a measurement problem. I didn't need a pep talk about letting go. I needed a scorecard that told me exactly where I was exposed.

Santi: Okay, so this is where the Replace-Yourself Scorecard comes from. And I want to be specific about what goes on it because most delegation frameworks — EOS has their Delegate and Elevate tool, which is fine — they ask you what you like doing and what you're good at. That's a starting point. But it doesn't tell you whether the seat is actually ready to hand off.

Kira: Right. Liking something and being replaceable in it are completely different questions.

Santi: Completely different. So the scorecard has five columns for every role you touch. The role itself. Your SOP maturity — and we score that on a one-to-five scale borrowed from the CMMI framework. One means it's in your head. Three means it's documented and someone else has executed it. Five means it runs without you and self-corrects.

Kira: What's the QA column?

Santi: QA coverage — what percentage of outputs from that seat pass a quality check that isn't you eyeballing it. Could be an LLM-as-judge rubric, could be a human reviewer, could be both. But it has to be scored, not vibes.

Kira: Not vibes. That's the whole thing, right? I was running vibes-based QA across my entire agency.

Santi: Everyone is. And then there's the named backup — an actual person or system that can cover the seat for two weeks. Not "I'll figure it out." A name. And finally, handoff risk — low, medium, high — based on what happens to the client or the revenue if the handoff goes wrong.

Kira: So when I scored myself after Guatemala, my content editing seat was — six hours a week, SOP maturity three because my PM had actually documented it even though I didn't know, QA coverage maybe seventy percent because we had a tone rubric but no golden set, and the backup was my PM. Handoff risk medium.

Santi: And that's a handoff you can make this week.

Kira: That's a handoff I made that week. The ones that scared me were client onboarding and pipeline review — SOP maturity one, QA coverage zero, no backup, high risk.

Santi: Those are your twelve-week projects. Not your this-week projects.

Kira: Exactly. And that distinction — what you hand off now versus what you build toward — that's what the priority score does. You multiply hours per week by two, add SOP maturity, add QA tier, subtract risk tier. The highest scores are your first three handoffs.

Santi: Okay so you've scored yourself. You've picked three handoffs. Now what? Because this is where most people stall. They delegate the task and then check in randomly — or worse, they don't check in at all and three weeks later the quality has drifted and they pull the work back.

Kira: Which is exactly what happened to me with my first VA. I handed off social scheduling, didn't set a review cadence, and two months later a client flagged that our LinkedIn posts had started sounding like a chatbot wrote them. Because a chatbot did write them — and nobody was checking.

Santi: Nobody was checking for two months?

Kira: Two months. And that's a content problem. Imagine that happening in support or sales.

Santi: So the fix is cadences. And this isn't new — EOS has the Level Ten meeting, the Four Disciplines of Execution has weekly WIG sessions, the Twelve Week Year has the Weekly Accountability Meeting. The pattern is the same everywhere. A short, fixed, weekly meeting with a scorecard and named owners.

Kira: But for async nomad teams, you can't just copy-paste a ninety-minute EOS meeting. You need three lean cadences. Thirty to sixty minutes max each.

Santi: Three meetings. First — an ops review. Scorecard, issues, decisions. You look at your five to ten measurables, flag what's red, solve the top issue, log the decision, done. Forty-five minutes.

Kira: Second — a QA wall review. This is the one most people skip and it's the one that matters most. You pull your LLM judge scores from the week, replay your golden set — that's your twenty-five to fifty reference cases that define what good looks like — and you sample five to ten percent of live outputs with a human reviewer.

Santi: And you have a stay-or-go rule. If two consecutive weeks fall below your threshold, you roll back the change and open an issue in the ops review.

Kira: Thirty minutes. That's it.

Santi: Third — a sales forecast review. Gong's guidance is weekly for most motions, intensified at period end. You inspect your top five deals by risk, check pipeline coverage against your win rate — Insight Partners says coverage should equal the inverse of your win rate, so if you close twenty-five percent, you need four X pipeline — and you log next actions.

Kira: Also thirty minutes. So total weekly cadence overhead — an hour forty-five. For three meetings that keep your handoffs from drifting.

Santi: Let's make this concrete. Three archetype paths — content ops, RevOps, support automations. Each one with before-and-after numbers.

Kira: Content ops is the one I lived. Before the scorecard, I was editing six to ten pieces a week myself. Three to four hours per piece when you count the review, the revision notes, the back-and-forth. After I handed off editing with a documented rubric and an LLM judge scoring tone and accuracy — HubSpot's data says marketers save roughly three hours per content piece with AI assistance, and that tracks with what I saw — my weekly KPIs became pieces shipped, draft-to-publish cycle time under three days, and QA pass rate above ninety percent.

Santi: And the risk?

Kira: Tone drift. Brand drift. The AI starts sounding generic. Which is why the golden set matters — you replay it weekly and catch the drift before your clients do.

Santi: What was the actual hour recovery?

Kira: Estimated six to ten hours a week. And margin lift around five to twelve percent from higher throughput — more pieces shipped without adding headcount.

Santi: RevOps is the one I'll take. I was spending — this is embarrassing — eight hours a week on CRM hygiene. Updating deal stages, enriching contacts, prepping the forecast. I built an auto-enrichment workflow in Make that handles research and field updates, set up lead routing rules, and moved the weekly forecast prep to an AI summary that I review in fifteen minutes instead of building from scratch.

Kira: What are your weekly KPIs now?

Santi: Pipeline coverage times win rate — I need that above one. Time-in-stage under seven days. And meeting-to-opportunity conversion above twenty-five percent. The Insight Partners framework gives you the math — if your win rate is twenty percent, you need five X coverage. Simple.

Kira: And the hour recovery?

Santi: Four to eight hours a week. Margin lift three to eight percent from cleaner pipeline and fewer slipped deals. But — and I want to be honest about this — the first two weeks were net negative. I spent more time setting up the automation than I saved.

Kira: You lost time?

Santi: I lost time. Week three is when it flipped. That's why the twelve-week frame matters. You're not optimizing for week one. You're optimizing for week eight.

Kira: Support automations is the big one for anyone running client-facing AI. Pupil Progress — education SaaS — took their Intercom Fin resolution rate from fifty-five to seventy-five percent in two months. ClickUp's support team reported over nine hundred hours a month saved through AI-assisted triage. Those are vendor case studies, so take the exact numbers directionally, but the pattern is real.

Santi: Okay but what does "resolved" even mean? Because Intercom defines automation rate as conversations resolved by the AI out of all new conversations. If your definition of resolved is loose — the bot answered something and the customer didn't reply — you're overcounting.

Kira: Which is exactly why you define it locally. For my agency, resolved means confirmed resolution with a CSAT score above four out of five or zero follow-up within forty-eight hours. And we track escalations separately.

Santi: That's the QA wall doing its job.

Kira: That's the QA wall. Weekly KPIs for support — deflection rate, first response time under two minutes, first contact resolution above sixty percent, CSAT above four point three. Risks are policy hallucination and edge-case misroutes. Mitigation is a golden set of fifty intents that you replay every week.

Santi: Alright, I want to raise the thing nobody wants to talk about. There's a Foxit-commissioned study — TechRadar covered it — that found the net time saved from AI was about sixteen minutes a week. Sixteen. Because the time you save generating content gets eaten by the time you spend verifying it.

Kira: That's… a depressing number.

Santi: It's a depressing number if you don't have a QA wall. If you're manually reviewing every AI output, yeah — you're just trading creation time for verification time. Net savings are minimal. But that's the whole argument for making QA coverage a first-class dimension on the scorecard. You don't verify everything manually. You build a rubric, you point an LLM judge at it, you replay your golden set weekly, and you only human-sample five to ten percent.

Kira: So the sixteen-minute number is what happens when you delegate without a system.

Santi: It's what happens when you delegate without a system. With the system — the scorecard, the cadences, the QA wall — the numbers look completely different. Not because the AI got better. Because you stopped doing the AI's job for it.

Kira: I'll push back on one thing though. The cadences themselves are overhead. You're asking someone who's already at fifty hours a week to add three meetings.

Santi: An hour forty-five total.

Kira: An hour forty-five that feels like a lot when you're drowning.

Santi: It does feel like a lot. But here's the math — if your first three handoffs recover even eight hours a week, and the cadences cost you two hours, you're net positive six hours by week four. And by week eight, when the handoffs are running clean, the cadences get shorter because you're only reviewing exceptions.

Kira: Exceptions-only agendas. That's the key. You're not reviewing everything. You're reviewing what's red.

Santi: What's red. That's it.

Kira: So here's what I want you to do this week. Not this month. This week. Score yourself. Every role you touch — SOP maturity, QA coverage, named backup, handoff risk. It takes five to ten minutes. And when you see the number, you'll know exactly which three seats to hand off first.

Santi: The Replace-Yourself Toolkit is on the Resources page — scorecard, twelve-week plan generator, cadence calendar with agendas, and a public commitment form. Fill it out, post your three handoffs somewhere your accountability partner can see them. Dominican University research found that people who wrote goals and sent weekly progress reports hit seventy-six percent completion versus forty-three percent without. So don't just plan it. Publish it.

Kira: You know what I keep thinking about? That night in Guatemala — the thing that broke me wasn't the power outage. It was realizing I'd built a business that couldn't survive me going to sleep. The scorecard fixed that. Not overnight. Over twelve weeks. But it fixed it.

Santi: And now when the power goes out in Lagos—

Kira: I sleep through it. I actually sleep through it.

Santi: Score yourself. Pick three. Run the plan. We'll see you Wednesday.

Kira: See you Wednesday.

delegationAI automationbusiness operationsdigital nomadproductivitysystemsremote workfounder toolsbusiness scalingtime management