Loading...

Private and Confidential
← Back to Guides

Control Chart & KPI Diagnostic Guide

How to separate signal from noise, catch problems early, and diagnose root causes correctly instead of guessing.

Analytics Operations

The Core Problem

ROAS drops from 50% to 44%. Is this a crisis or just noise? Without control charts, you're guessing. Teams either panic over random variance or miss real problems until they've cost thousands. Control charts tell you exactly when to act.

What is a Control Chart?

A control chart separates normal day-to-day variance (noise) from actual structural changes (signal). You calculate a baseline, set limits at Β±2 standard deviations, and watch for points that break through.

Control Chart Example: D7 ROAS
UCL (62%) Mean (50%) LCL (38%)
Normal variance
Structural break
Control limits (Β±2Οƒ)

Points inside the limits = normal. Points outside = investigate immediately. This removes gut-feel decisions and gives you an objective trigger for action.

Leading vs. Lagging Indicators

You need both types of metrics. Leading indicators catch problems fast. Lagging indicators confirm severity.

πŸš€ Leading Indicators

Signal in 24 hours

D1 Retention or D1 ROAS

If a bad update ships, these break first. Your smoke detector.

πŸ“Š Lagging Indicators

Signal in 7 days

D7 ROAS (source of truth)

Confirms profitability impact. The data point for "today" is actually the cohort from 7 days ago.

⚠️ Critical Timing Note

A product change today won't show in D7 ROAS until next week. If you only watch D7, you're reacting to problems that started 7 days ago. Always monitor D1 metrics daily.

The KPI Decomposition Tree

When ROAS drops, you need to find which input changed. Never act on a top-level metric without decomposing it first.

ROAS Decomposition
D7 ROAS
Cohort Revenue
Payers
ARPPU
Spend
Volume
CPI

ROAS = Revenue / Spend. If ROAS drops, either revenue went down, spend went up, or both. Decompose first, then drill into the branch that explains the change.

The 7-Step Diagnostic

When a control chart signals a problem, follow this exact sequence. Don't skip steps.

1

Confirm the Break

Is it real? Check for data gaps, one-time events, or day-of-week effects. If signal persists 2-3 days, proceed.

2

Timestamp It

Identify the exact date the break began. "D1 Retention dropped from 42% to 34% starting Aug 30."

3

Decompose

Calculate which component drove the change. Revenue down 24%, spend stable? Revenue explains 100% of the decline.

4

Correlate with Events

Check: Product releases near the date? Channel mix shift? Creative changes? 0-day lag = strongest signal.

5

Drill Down

Follow the decomposition to its end. Revenue down β†’ Payers down β†’ Conversion down β†’ Why?

6

Form Hypothesis

Good: "v1.8.4 physics changes reduced D7 conversion by 15%." Bad: "ROAS is down, let's try another network."

7

Remediate Root Cause

Product issue β†’ escalate to product. Creative fatigue β†’ refresh assets. Market pressure β†’ adjust bids.

Case Study: The Wrong Diagnosis

πŸ“‹ Slam Clash (Aug-Nov 2024)

Aug 29 v1.8.4 deploys with physics changes. D1 Retention drops from 42% to 34%.
Sep 5 D7 ROAS confirms: 52% vs 71% baseline. Channel mix unchanged (98% primary network).
Sep-Nov Team assumes channel is the problem. Shifts budget to secondary network. Same poor ROAS.

The smoking gun: Zero-day lag between release and break. Channel didn't change. Product broke.

❌ What went wrong

Without decomposing or checking version correlation, the team blamed the channel. The secondary network showed the same bad ROASβ€”because the product was broken, not the channel. Misdiagnosis delayed the fix by months.

Quick Reference

βœ“ DO

  • Monitor D1 metrics daily
  • Decompose before acting
  • Timestamp breaks precisely
  • Check product releases first
  • Use 7-day rolling averages
  • Wait 2-3 days to confirm signal

βœ— DON'T

  • React to single-day variance
  • Change channels to fix product
  • Skip the decomposition step
  • Wait for D7 to act on D1 signals
  • Assume channel is the problem
  • Ignore version release dates

Getting Started

  1. Set up two control charts β€” D1 Retention (leading) and D7 ROAS (lagging)
  2. Establish your baseline β€” Use 4-8 weeks of stable data to calculate mean and Β±2Οƒ limits
  3. Create a version log β€” Track all product releases with dates
  4. Check daily β€” Review D1 metrics every morning. If outside limits, start the diagnostic.
  5. Document findings β€” Build institutional knowledge of what breaks what

The time you invest in this system pays off massively. You'll catch problems in 24 hours instead of weeks, and fix the right thing instead of chasing ghosts.

✍️ Select text to comment
βž• Add Comment

πŸ’¬ Comments

πŸ’¬

No comments yet.

Select text to add the first comment.

Add Comment