From b6d878730ae47c8b298c2377db34a3c8e203d01e Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Fri, 12 Jun 2026 10:26:21 -0700
Subject: [PATCH 1/8] docs(flaky-tests): add "Alert When a Test Escalates"
 webhook recipe

Documents the test-escalation user story from the connectors Slack
thread: how to get Slack alerts when a test gets worse, not just on
first detection.

- New recipe page covering the v2.test_case.status_changed (overall
  health transitions) vs test_case.monitor_status_changed (per-monitor
  activations) distinction, with transform snippets for the
  classify-as-broken and apply-a-label forks.
- Cross-link section in the Slack integration guide.
- Card on the webhooks index + nav entry in docs.json.

All examples stay on the v2 event schema; legacy v1 event not documented.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 docs.json                                     |   1 +
 .../webhooks/alert-on-test-escalation.mdx     | 117 ++++++++++++++++++
 flaky-tests/webhooks/index.mdx                |   5 +
 flaky-tests/webhooks/slack-integration.mdx    |   8 ++
 4 files changed, 131 insertions(+)
 create mode 100644 flaky-tests/webhooks/alert-on-test-escalation.mdx
diff --git a/docs.json b/docs.json
index e16adc7d..5b365bd1 100644
--- a/docs.json
+++ b/docs.json
@@ -295,6 +295,7 @@
                 "group": "Webhooks",
                 "root": "flaky-tests/webhooks/index",
                 "pages": [
+                  "flaky-tests/webhooks/alert-on-test-escalation",
                   "flaky-tests/webhooks/slack-integration",
                   "flaky-tests/webhooks/microsoft-teams-integration",
                   "flaky-tests/webhooks/github-issues-integration",
diff --git a/flaky-tests/webhooks/alert-on-test-escalation.mdx b/flaky-tests/webhooks/alert-on-test-escalation.mdx
new file mode 100644
index 00000000..52a15ab5
--- /dev/null
+++ b/flaky-tests/webhooks/alert-on-test-escalation.mdx
@@ -0,0 +1,117 @@
+---
+title: "Alert When a Test Escalates"
+description: "Send Slack alerts when a test gets worse, not just the first time it's flagged"
+og:title: "Alerting on flaky test escalation with Trunk webhooks"
+---
+A single "this test is now flaky" alert tells you a test crossed a threshold once. It doesn't tell you when that same test keeps getting worse — failing on more branches, tripping additional monitors, or degrading from flaky to a consistently broken regression. For tests that matter, you want to hear about the escalation, not just the first detection.
+
+This page shows how to wire that up with Trunk webhooks and a Slack transformation. It builds on the [Slack integration guide](./slack-integration) — set that connection up first, then come back here to filter it down to escalations.
+
+## Pick the right event
+
+The key decision is which event you subscribe to, because two different events fire at two different granularities.
+
+| Event | Fires when | Use it to |
+|---|---|---|
+| [`v2.test_case.status_changed`](./index) | The test's **overall health status** transitions between `HEALTHY`, `FLAKY`, and `BROKEN` | Alert on health escalations like `FLAKY` → `BROKEN` |
+| [`test_case.monitor_status_changed`](./index) | **Any individual monitor** activates or resolves for the test | Alert every time a monitor flags the test, even if its overall status doesn't move |
+
+The distinction matters. `v2.test_case.status_changed` only fires when the test's combined status changes. If a test is already `FLAKY` and a second monitor starts flagging it, the overall status stays `FLAKY`, so no `v2.test_case.status_changed` event is sent. To catch a test getting flagged by more monitors over time — the "more than just the first detection" case — subscribe to `test_case.monitor_status_changed` instead.
+
+<Note>
+Test status priority is **Broken > Flaky > Healthy**. A test flagged by both a broken-type and a flaky-type monitor shows as `BROKEN` until the broken monitor resolves. See [Flake Detection](../detection/) for how the combined status is calculated.
+</Note>
+
+## Alert when a test becomes broken
+
+Use this when you want a louder, separate signal for tests that have degraded into consistent failures, distinct from routine flakiness.
+
+**1. Configure a broken-type monitor.** A test only reaches `BROKEN` status when a [failure rate](../detection/failure-rate-monitor) or [failure count](../detection/failure-count-monitor) monitor with its **Detection type** set to **Broken** is active for it. Set one up if you haven't already. A common pattern is to pair a broken-type monitor (catching consistently failing tests) with a flaky-type monitor (catching intermittent ones).
+
+**2. Filter the transformation to escalations.** In your Slack endpoint's transformation, cancel the webhook unless the status got worse. This example ranks the three statuses and only sends a message when `new_status` is more severe than `previous_status`, so recoveries and resolutions stay quiet:
+
+```javascript
+const SEVERITY = { HEALTHY: 0, FLAKY: 1, BROKEN: 2 };
+
+function handler(webhook) {
+  const { previous_status = "HEALTHY", new_status = "HEALTHY" } = webhook.payload;
+
+  // Only alert when the test got worse, not when it recovered.
+  if (SEVERITY[new_status] <= SEVERITY[previous_status]) {
+    webhook.cancel = true;
+    return webhook;
+  }
+
+  webhook.payload = summarizeTestCase(webhook.payload);
+  return webhook;
+}
+```
+
+To alert *only* when a test reaches the broken state — and stay silent on first-time flaky detections — gate on the new status directly instead:
+
+```javascript
+function handler(webhook) {
+  if (webhook.payload.new_status !== "BROKEN") {
+    webhook.cancel = true;
+    return webhook;
+  }
+
+  webhook.payload = summarizeTestCase(webhook.payload);
+  return webhook;
+}
+```
+
+Reuse the `summarizeTestCase` helper from the [Slack integration guide](./slack-integration#id-2.-customize-your-transformation) to format the message body. The `previous_status → new_status` line in that template makes the escalation obvious in the channel.
+
+## Alert every time a monitor flags a test
+
+Use this when you want to know about every detection event on a test, including the ones that don't change its overall status — a second monitor piling on, or a labeling monitor surfacing a new pattern.
+
+**1. Subscribe to `test_case.monitor_status_changed`.** On your Slack endpoint, enable this event in addition to (or instead of) `v2.test_case.status_changed`.
+
+**2. Filter to monitor activations.** The event fires on both activation and resolution, so cancel the webhook unless a monitor is becoming active:
+
+```javascript
+function handler(webhook) {
+  const { monitor } = webhook.payload;
+
+  // Only alert when a monitor starts flagging the test.
+  if (!monitor || monitor.status !== "active") {
+    webhook.cancel = true;
+    return webhook;
+  }
+
+  webhook.payload = {
+    blocks: [
+      {
+        type: "header",
+        text: { type: "plain_text", text: `Monitor active: ${webhook.payload.test_case.name}` },
+      },
+      {
+        type: "section",
+        text: {
+          type: "mrkdwn",
+          text: [
+            `Monitor type: \`${monitor.type}\``,
+            `Test Details: ${webhook.payload.test_case.html_url}`,
+          ].join("\n"),
+        },
+      },
+    ],
+  };
+  return webhook;
+}
+```
+
+Because `test_case.monitor_status_changed` fires for every monitor independently, this catches a test that keeps tripping new monitors over time, even while its headline status stays `FLAKY`. The `monitor.type` field tells you which monitor fired, so you can branch on it — for example, route [labeling monitors](../management/test-labels#automatic-labeling-from-monitors) to a triage channel and health classification monitors to your on-call channel.
+
+<Tip>
+Prefer labels over a separate broken classification when you want to triage a pattern without changing a test's health status. Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
+</Tip>
+
+## Related
+
+- [Integration for Slack](./slack-integration) — set up the Slack connection these transformations build on
+- [Webhooks](./index) — the full event catalog and field reference
+- [Flake Detection](../detection/) — how monitors classify tests as flaky or broken
+- [Test Labels](../management/test-labels) — apply and route labels with monitors
diff --git a/flaky-tests/webhooks/index.mdx b/flaky-tests/webhooks/index.mdx
index 0a64f74e..efe6f76b 100644
--- a/flaky-tests/webhooks/index.mdx
+++ b/flaky-tests/webhooks/index.mdx
@@ -88,6 +88,11 @@ Emitted when an AI-powered flaky test analysis finishes for a test case.
 You can also find guides for specific examples here:
 
 <Columns cols={2}>
+  <Card
+    title="Alert When a Test Escalates"
+    href="./alert-on-test-escalation"
+  />
+
   <Card
     title="Send a Slack Message"
     href="./slack-integration"
diff --git a/flaky-tests/webhooks/slack-integration.mdx b/flaky-tests/webhooks/slack-integration.mdx
index 2153a212..f3021555 100644
--- a/flaky-tests/webhooks/slack-integration.mdx
+++ b/flaky-tests/webhooks/slack-integration.mdx
@@ -147,6 +147,14 @@ You can see a list of past delivery attempts in the **Message Attempts** modal.
 ![](/assets/flaky-tests/webhooks/example-webhook-logs.png)
 </Frame>
 
+## Alert only when a test gets worse
+
+By default this connection alerts on every status change. If you'd rather hear about a test only when it **escalates** — degrading to broken, or tripping more monitors over time — filter the transformation on the status transition instead of sending every event.
+
+<Card title="Alert When a Test Escalates" href="./alert-on-test-escalation" horizontal>
+  Send Slack alerts when a test gets worse, not just the first time it's flagged.
+</Card>
+
 ## Congratulations!
 
 You should now receive notifications in your Slack workspace when a test's status changes. You can further modify your transformation script to customize your messages.

From 666cc815f80b8dbb57260a5ad774a826a880e001 Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Fri, 12 Jun 2026 10:41:21 -0700
Subject: [PATCH 2/8] docs(flaky-tests): use Info callout for status-priority
 background

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 flaky-tests/webhooks/alert-on-test-escalation.mdx | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/flaky-tests/webhooks/alert-on-test-escalation.mdx b/flaky-tests/webhooks/alert-on-test-escalation.mdx
index 52a15ab5..ba62d241 100644
--- a/flaky-tests/webhooks/alert-on-test-escalation.mdx
+++ b/flaky-tests/webhooks/alert-on-test-escalation.mdx
@@ -18,9 +18,9 @@ The key decision is which event you subscribe to, because two different events f
 
 The distinction matters. `v2.test_case.status_changed` only fires when the test's combined status changes. If a test is already `FLAKY` and a second monitor starts flagging it, the overall status stays `FLAKY`, so no `v2.test_case.status_changed` event is sent. To catch a test getting flagged by more monitors over time — the "more than just the first detection" case — subscribe to `test_case.monitor_status_changed` instead.
 
-<Note>
+<Info>
 Test status priority is **Broken > Flaky > Healthy**. A test flagged by both a broken-type and a flaky-type monitor shows as `BROKEN` until the broken monitor resolves. See [Flake Detection](../detection/) for how the combined status is calculated.
-</Note>
+</Info>
 
 ## Alert when a test becomes broken
 

From dab45d3c7350e7f3464efbad45106061183607de Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Fri, 12 Jun 2026 10:53:46 -0700
Subject: [PATCH 3/8] docs(flaky-tests): move escalation recipe into new
 Recipes group

Seed a Flaky Tests > Recipes nav group with the escalation-alert page as
its first entry. It's a process/pattern doc, not a connector reference,
so it reads better as a recipe than under Webhooks. Webhooks keeps a
cross-link card. Quarantine-recipes (#59) and monitor-tuning (#53) are
the planned next entries.

- git mv flaky-tests/webhooks/ -> flaky-tests/recipes/
- new Recipes group in docs.json after Webhooks
- fixed relative links (./slack-integration, ./index -> ../webhooks/...)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 docs.json                                            |  7 ++++++-
 .../alert-on-test-escalation.mdx                     | 12 ++++++------
 flaky-tests/webhooks/index.mdx                       |  2 +-
 flaky-tests/webhooks/slack-integration.mdx           |  2 +-
 4 files changed, 14 insertions(+), 9 deletions(-)
 rename flaky-tests/{webhooks => recipes}/alert-on-test-escalation.mdx (85%)

diff --git a/docs.json b/docs.json
index 5b365bd1..6fe27a28 100644
--- a/docs.json
+++ b/docs.json
@@ -295,7 +295,6 @@
                 "group": "Webhooks",
                 "root": "flaky-tests/webhooks/index",
                 "pages": [
-                  "flaky-tests/webhooks/alert-on-test-escalation",
                   "flaky-tests/webhooks/slack-integration",
                   "flaky-tests/webhooks/microsoft-teams-integration",
                   "flaky-tests/webhooks/github-issues-integration",
@@ -303,6 +302,12 @@
                   "flaky-tests/webhooks/jira-integration"
                 ]
               },
+              {
+                "group": "Recipes",
+                "pages": [
+                  "flaky-tests/recipes/alert-on-test-escalation"
+                ]
+              },
               {
                 "group": "Agents",
                 "root": "flaky-tests/agents/index",
diff --git a/flaky-tests/webhooks/alert-on-test-escalation.mdx b/flaky-tests/recipes/alert-on-test-escalation.mdx
similarity index 85%
rename from flaky-tests/webhooks/alert-on-test-escalation.mdx
rename to flaky-tests/recipes/alert-on-test-escalation.mdx
index ba62d241..812d8b1a 100644
--- a/flaky-tests/webhooks/alert-on-test-escalation.mdx
+++ b/flaky-tests/recipes/alert-on-test-escalation.mdx
@@ -5,7 +5,7 @@ og:title: "Alerting on flaky test escalation with Trunk webhooks"
 ---
 A single "this test is now flaky" alert tells you a test crossed a threshold once. It doesn't tell you when that same test keeps getting worse — failing on more branches, tripping additional monitors, or degrading from flaky to a consistently broken regression. For tests that matter, you want to hear about the escalation, not just the first detection.
 
-This page shows how to wire that up with Trunk webhooks and a Slack transformation. It builds on the [Slack integration guide](./slack-integration) — set that connection up first, then come back here to filter it down to escalations.
+This page shows how to wire that up with Trunk webhooks and a Slack transformation. It builds on the [Slack integration guide](../webhooks/slack-integration) — set that connection up first, then come back here to filter it down to escalations.
 
 ## Pick the right event
 
@@ -13,8 +13,8 @@ The key decision is which event you subscribe to, because two different events f
 
 | Event | Fires when | Use it to |
 |---|---|---|
-| [`v2.test_case.status_changed`](./index) | The test's **overall health status** transitions between `HEALTHY`, `FLAKY`, and `BROKEN` | Alert on health escalations like `FLAKY` → `BROKEN` |
-| [`test_case.monitor_status_changed`](./index) | **Any individual monitor** activates or resolves for the test | Alert every time a monitor flags the test, even if its overall status doesn't move |
+| [`v2.test_case.status_changed`](../webhooks/index) | The test's **overall health status** transitions between `HEALTHY`, `FLAKY`, and `BROKEN` | Alert on health escalations like `FLAKY` → `BROKEN` |
+| [`test_case.monitor_status_changed`](../webhooks/index) | **Any individual monitor** activates or resolves for the test | Alert every time a monitor flags the test, even if its overall status doesn't move |
 
 The distinction matters. `v2.test_case.status_changed` only fires when the test's combined status changes. If a test is already `FLAKY` and a second monitor starts flagging it, the overall status stays `FLAKY`, so no `v2.test_case.status_changed` event is sent. To catch a test getting flagged by more monitors over time — the "more than just the first detection" case — subscribe to `test_case.monitor_status_changed` instead.
 
@@ -61,7 +61,7 @@ function handler(webhook) {
 }
 ```
 
-Reuse the `summarizeTestCase` helper from the [Slack integration guide](./slack-integration#id-2.-customize-your-transformation) to format the message body. The `previous_status → new_status` line in that template makes the escalation obvious in the channel.
+Reuse the `summarizeTestCase` helper from the [Slack integration guide](../webhooks/slack-integration#id-2.-customize-your-transformation) to format the message body. The `previous_status → new_status` line in that template makes the escalation obvious in the channel.
 
 ## Alert every time a monitor flags a test
 
@@ -111,7 +111,7 @@ Prefer labels over a separate broken classification when you want to triage a pa
 
 ## Related
 
-- [Integration for Slack](./slack-integration) — set up the Slack connection these transformations build on
-- [Webhooks](./index) — the full event catalog and field reference
+- [Integration for Slack](../webhooks/slack-integration) — set up the Slack connection these transformations build on
+- [Webhooks](../webhooks/index) — the full event catalog and field reference
 - [Flake Detection](../detection/) — how monitors classify tests as flaky or broken
 - [Test Labels](../management/test-labels) — apply and route labels with monitors
diff --git a/flaky-tests/webhooks/index.mdx b/flaky-tests/webhooks/index.mdx
index efe6f76b..902e71fb 100644
--- a/flaky-tests/webhooks/index.mdx
+++ b/flaky-tests/webhooks/index.mdx
@@ -90,7 +90,7 @@ You can also find guides for specific examples here:
 <Columns cols={2}>
   <Card
     title="Alert When a Test Escalates"
-    href="./alert-on-test-escalation"
+    href="../recipes/alert-on-test-escalation"
   />
 
   <Card
diff --git a/flaky-tests/webhooks/slack-integration.mdx b/flaky-tests/webhooks/slack-integration.mdx
index f3021555..59e5b165 100644
--- a/flaky-tests/webhooks/slack-integration.mdx
+++ b/flaky-tests/webhooks/slack-integration.mdx
@@ -151,7 +151,7 @@ You can see a list of past delivery attempts in the **Message Attempts** modal.
 
 By default this connection alerts on every status change. If you'd rather hear about a test only when it **escalates** — degrading to broken, or tripping more monitors over time — filter the transformation on the status transition instead of sending every event.
 
-<Card title="Alert When a Test Escalates" href="./alert-on-test-escalation" horizontal>
+<Card title="Alert When a Test Escalates" href="../recipes/alert-on-test-escalation" horizontal>
   Send Slack alerts when a test gets worse, not just the first time it's flagged.
 </Card>
 

From 27dcdfb89eedceb311c2feedec618ea46dfd967a Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Mon, 15 Jun 2026 09:17:05 -0700
Subject: [PATCH 4/8] docs(flaky-tests): warn that broken classification
 un-quarantines flaky tests

Addresses Tyler's PR #249 review:
- Clarify the transform snippets are drop-in replacements for the Slack
  guide's handler and depend on its summarizeTestCase helper staying in
  the transformation.
- Add a Warning that classifying a test as broken changes its health
  status, dropping a flaky+auto-quarantined test out of auto-quarantine
  (broken tests aren't quarantine candidates) so it blocks CI again.
  Labeling monitors avoid this; manually quarantined tests are unaffected.
- Tie the label Tip to the quarantine tradeoff.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 flaky-tests/recipes/alert-on-test-escalation.mdx | 10 ++++++++--
 1 file changed, 8 insertions(+), 2 deletions(-)

diff --git a/flaky-tests/recipes/alert-on-test-escalation.mdx b/flaky-tests/recipes/alert-on-test-escalation.mdx
index 812d8b1a..055dec56 100644
--- a/flaky-tests/recipes/alert-on-test-escalation.mdx
+++ b/flaky-tests/recipes/alert-on-test-escalation.mdx
@@ -61,7 +61,13 @@ function handler(webhook) {
 }
 ```
 
-Reuse the `summarizeTestCase` helper from the [Slack integration guide](../webhooks/slack-integration#id-2.-customize-your-transformation) to format the message body. The `previous_status → new_status` line in that template makes the escalation obvious in the channel.
+Both snippets replace the `handler` function from the [Slack integration guide](../webhooks/slack-integration#id-2.-customize-your-transformation); keep that guide's `summarizeTestCase` helper in the same transformation so the message body still renders. Its `previous_status → new_status` line makes the escalation obvious in the channel.
+
+<Warning>
+Classifying a test as **broken** changes its health status, and that can change quarantine behavior. Auto-quarantine applies only to tests with a **Flaky** status, so when a broken-type monitor flags a test that was auto-quarantined as flaky, the test becomes `BROKEN`, drops out of the auto-quarantine set, and its failures start blocking CI again. This is by design — a broken test is a real regression, not a flake to skip — but it means a broken classification is not a side-effect-free way to get an escalation alert.
+
+If you want the escalation signal *without* touching quarantine, use a **labeling** monitor instead (see [Alert every time a monitor flags a test](#alert-every-time-a-monitor-flags-a-test) below). Labels don't change health status, so an auto-quarantined test stays quarantined. Manually quarantined tests are unaffected either way. See [Quarantining](../quarantining/) and [Flake Detection](../detection/) for the full composite-status behavior.
+</Warning>
 
 ## Alert every time a monitor flags a test
 
@@ -106,7 +112,7 @@ function handler(webhook) {
 Because `test_case.monitor_status_changed` fires for every monitor independently, this catches a test that keeps tripping new monitors over time, even while its headline status stays `FLAKY`. The `monitor.type` field tells you which monitor fired, so you can branch on it — for example, route [labeling monitors](../management/test-labels#automatic-labeling-from-monitors) to a triage channel and health classification monitors to your on-call channel.
 
 <Tip>
-Prefer labels over a separate broken classification when you want to triage a pattern without changing a test's health status. Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
+Prefer labels over a broken classification when you want to triage a pattern without changing a test's health status — and, as noted above, without disturbing auto-quarantine. Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
 </Tip>
 
 ## Related

From 3c445d7d53d697aa18bdc1c85cb7e4f95c53b0ee Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Mon, 15 Jun 2026 09:47:10 -0700
Subject: [PATCH 5/8] docs(flaky-tests): note summarizeTestCase source +
 uppercase statuses

Resolves the remaining part of Tyler's PR #249 review (transform validity):
- Inline comment in both status snippets noting summarizeTestCase() lives
  in the Slack integration guide, so a single-block copy-paste doesn't
  silently ReferenceError.
- Comment on the SEVERITY map noting status values are uppercase.

Validated with a local Node harness against the real v2 + monitor payloads
(16/16): handlers send/cancel correctly, and the casing experiment confirms
lowercasing the comparisons silently breaks gating.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 flaky-tests/recipes/alert-on-test-escalation.mdx | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/flaky-tests/recipes/alert-on-test-escalation.mdx b/flaky-tests/recipes/alert-on-test-escalation.mdx
index 055dec56..354fecd0 100644
--- a/flaky-tests/recipes/alert-on-test-escalation.mdx
+++ b/flaky-tests/recipes/alert-on-test-escalation.mdx
@@ -31,6 +31,7 @@ Use this when you want a louder, separate signal for tests that have degraded in
 **2. Filter the transformation to escalations.** In your Slack endpoint's transformation, cancel the webhook unless the status got worse. This example ranks the three statuses and only sends a message when `new_status` is more severe than `previous_status`, so recoveries and resolutions stay quiet:
 
 ```javascript
+// Status values are uppercase (HEALTHY, FLAKY, BROKEN), matching the payload.
 const SEVERITY = { HEALTHY: 0, FLAKY: 1, BROKEN: 2 };
 
 function handler(webhook) {
@@ -42,6 +43,7 @@ function handler(webhook) {
     return webhook;
   }
 
+  // summarizeTestCase() is defined in the Slack integration guide.
   webhook.payload = summarizeTestCase(webhook.payload);
   return webhook;
 }
@@ -56,6 +58,7 @@ function handler(webhook) {
     return webhook;
   }
 
+  // summarizeTestCase() is defined in the Slack integration guide.
   webhook.payload = summarizeTestCase(webhook.payload);
   return webhook;
 }

From b6b4b5532c9670a0eec782fb9d9f81f73c4cc589 Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Mon, 15 Jun 2026 10:21:35 -0700
Subject: [PATCH 6/8] docs(flaky-tests): sam-style pass + embed two animated
 diagrams

- Voice/clarity pass on the escalation recipe, remove all em dashes.
- Add two standalone animated SVGs (CSS keyframes, reduced-motion safe):
  - event-granularity-gap: HEALTHY->FLAKY->FLAKY across three columns,
    showing status_changed stays silent on the second monitor while
    monitor_status_changed fires on both.
  - broken-classification-quarantine: a broken classification drops a
    flaky auto-quarantined test out of quarantine and re-blocks CI.
- Embed both via <Frame> in the recipe.

Transforms validated end to end: trunk2 source, a Node harness (16/16),
and Svix Run Test on a play.svix.com test endpoint.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 .../broken-classification-quarantine.svg      | 67 ++++++++++++++
 .../recipes/event-granularity-gap.svg         | 91 +++++++++++++++++++
 .../recipes/alert-on-test-escalation.mdx      | 36 +++++---
 3 files changed, 180 insertions(+), 14 deletions(-)
 create mode 100644 assets/flaky-tests/recipes/broken-classification-quarantine.svg
 create mode 100644 assets/flaky-tests/recipes/event-granularity-gap.svg

diff --git a/assets/flaky-tests/recipes/broken-classification-quarantine.svg b/assets/flaky-tests/recipes/broken-classification-quarantine.svg
new file mode 100644
index 00000000..7fa0ffd9
--- /dev/null
+++ b/assets/flaky-tests/recipes/broken-classification-quarantine.svg
@@ -0,0 +1,67 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 660 228" width="660" height="228" role="img" aria-label="A flaky test that is auto-quarantined, with CI passing because its failure is ignored. A broken-type monitor fires and reclassifies the test as BROKEN. Because broken tests are not quarantine candidates, the test drops out of auto-quarantine and its failures block CI again. The animation runs the broken-type monitor, then highlights the after state on the right.">
+  <!--
+    Concept: classifying a test as broken can un-quarantine it (the quarantine side effect).
+    Before (left): FLAKY, auto-quarantined, CI passes. A broken-type monitor fires (middle scan
+    bar). After (right): BROKEN, not a quarantine candidate, CI blocked.
+
+    TIMING (6.5s loop, one scan = 0.5s = 7.7%):
+      monitor scan   10% to 17.7%   then "fired" (red)
+      after-state glow pulse on the right box at completion
+      finished frame HOLDS 18% to 100% (about 5.3s linger)
+    (Note: write "to", not a double hyphen, illegal inside an XML comment.)
+  -->
+  <style>
+    .bar { transform-box: fill-box; transform-origin: left center; }
+    .bar2 { animation: brkBar 6.5s linear infinite; }
+    .glow2 { animation: brkGlow 6.5s linear infinite; }
+    .flow { animation: brkFlow 6.5s linear infinite; }
+    @keyframes brkBar { 0%,10% { transform: scaleX(0); fill: #e8932a; } 17.7% { transform: scaleX(1); fill: #e8932a; } 18.7%,100% { transform: scaleX(1); fill: #d64545; } }
+    @keyframes brkGlow { 0%,17% { opacity: 0; } 19.5% { opacity: 0.42; } 30%,100% { opacity: 0; } }
+    @keyframes brkFlow { 0% { stroke-dashoffset: 16; } 18%,100% { stroke-dashoffset: 0; } }
+    @media (prefers-reduced-motion: reduce) {
+      .bar2, .glow2, .flow { animation: none; }
+      .bar { transform: scaleX(1); fill: #d64545; }
+      .glow2 { opacity: 0; }
+      .flow { stroke-dashoffset: 0; }
+    }
+  </style>
+
+  <rect x="0" y="0" width="660" height="228" rx="10" fill="#ffffff"/>
+  <text x="16" y="26" font-family="ui-sans-serif, system-ui, sans-serif" font-size="12.5" font-weight="700" fill="#3a4350">Classifying a test as broken can un-quarantine it</text>
+
+  <g font-family="ui-sans-serif, system-ui, sans-serif">
+    <!-- after-state glow (transient highlight) -->
+    <rect class="glow2" x="436" y="54" width="204" height="122" rx="10" fill="#d64545" opacity="0"/>
+
+    <!-- BEFORE box -->
+    <text x="24" y="50" font-size="11" font-weight="700" fill="#5b6573">Flaky + auto-quarantined</text>
+    <rect x="24" y="58" width="196" height="118" rx="8" fill="#f7f9fc" stroke="#9aa3b0" stroke-width="1"/>
+    <rect x="40" y="72" width="164" height="28" rx="6" fill="#e3edfc" stroke="#346DDB" stroke-width="1.5"/>
+    <text x="122" y="90" text-anchor="middle" font-size="11.5" font-weight="700" fill="#1f5fc0">Status: FLAKY</text>
+    <rect x="40" y="106" width="164" height="28" rx="6" fill="#eee7fb" stroke="#7c5cdb" stroke-width="1.5"/>
+    <text x="122" y="124" text-anchor="middle" font-size="11.5" fill="#5a3fb0">Auto-quarantined</text>
+    <rect x="40" y="140" width="164" height="28" rx="6" fill="#e4f5ea" stroke="#1d9b54" stroke-width="1.5"/>
+    <text x="122" y="158" text-anchor="middle" font-size="10.5" fill="#1d7a4d">CI passes (failure ignored)</text>
+
+    <!-- MIDDLE trigger -->
+    <text x="330" y="92" text-anchor="middle" font-size="11" fill="#5b6573">broken-type monitor fires</text>
+    <line class="flow" x1="248" y1="117" x2="424" y2="117" stroke="#9aa3b0" stroke-width="2" stroke-dasharray="8 4"/>
+    <polygon points="424,112 434,117 424,122" fill="#9aa3b0"/>
+    <rect x="252" y="128" width="160" height="6" rx="3" fill="#d9dee6"/>
+    <rect class="bar bar2" x="252" y="128" width="160" height="6" rx="3" fill="#e8932a"/>
+
+    <!-- AFTER box -->
+    <text x="440" y="50" font-size="11" font-weight="700" fill="#5b6573">Reclassified as broken</text>
+    <rect x="440" y="58" width="196" height="118" rx="8" fill="#f7f9fc" stroke="#9aa3b0" stroke-width="1"/>
+    <rect x="456" y="72" width="164" height="28" rx="6" fill="#fde7e7" stroke="#d64545" stroke-width="1.5"/>
+    <text x="538" y="90" text-anchor="middle" font-size="11.5" font-weight="700" fill="#b22222">Status: BROKEN</text>
+    <rect x="456" y="106" width="164" height="28" rx="6" fill="#eef1f5" stroke="#9aa3b0" stroke-width="1.5"/>
+    <text x="538" y="124" text-anchor="middle" font-size="10" fill="#5b6573">Not a quarantine candidate</text>
+    <rect x="456" y="140" width="164" height="28" rx="6" fill="#fde7e7" stroke="#d64545" stroke-width="1.5"/>
+    <text x="538" y="158" text-anchor="middle" font-size="10.5" fill="#b22222">CI blocked (failure counts)</text>
+
+    <!-- caption -->
+    <text x="16" y="198" font-size="11.5" fill="#5b6573">Broken tests are not quarantine candidates, so the test drops out of auto-quarantine</text>
+    <text x="16" y="214" font-size="11.5" fill="#5b6573">and its failures block CI again. Manually quarantined tests are unaffected.</text>
+  </g>
+</svg>
diff --git a/assets/flaky-tests/recipes/event-granularity-gap.svg b/assets/flaky-tests/recipes/event-granularity-gap.svg
new file mode 100644
index 00000000..03d77f37
--- /dev/null
+++ b/assets/flaky-tests/recipes/event-granularity-gap.svg
@@ -0,0 +1,91 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 660 250" width="660" height="250" role="img" aria-label="A test starts HEALTHY. Monitor A fires and the test goes HEALTHY to FLAKY, so v2.test_case.status_changed sends an event and test_case.monitor_status_changed sends one too. Then Monitor B fires while the test is already FLAKY, so the overall status stays FLAKY: v2.test_case.status_changed sends nothing, but test_case.monitor_status_changed still sends an event. The animation steps through Monitor A then Monitor B, highlighting that status_changed misses the second escalation while monitor_status_changed catches it.">
+  <!--
+    Concept: the two events fire at different granularities (event-granularity gap).
+    Three time columns: Healthy (baseline), Monitor A fires (HEALTHY to FLAKY),
+    Monitor B fires (already FLAKY). Three lanes: overall Test status,
+    v2.test_case.status_changed, test_case.monitor_status_changed.
+    Teaching: status_changed is silent on the second escalation; monitor_status_changed is not.
+
+    TIMING (7s loop, one scan = 0.5s = 7.14%):
+      column 1 (Healthy) static baseline, present from the start
+      Monitor A scan   0% to 7.14%   then "fired" (blue), glow pulse on column 2
+      pause
+      Monitor B scan  14% to 21.14%  then "fired" (blue), glow pulse on column 3
+      finished frame HOLDS 21% to 100% (about 5.5s linger)
+    (Note: write "to", not a double hyphen, illegal inside an XML comment.)
+  -->
+  <style>
+    .bar { transform-box: fill-box; transform-origin: left center; }
+    .bar2 { animation: gapBar2 7s linear infinite; }
+    .bar3 { animation: gapBar3 7s linear infinite; }
+    .glow2 { animation: gapGlow2 7s linear infinite; }
+    .glow3 { animation: gapGlow3 7s linear infinite; }
+    @keyframes gapBar2 { 0% { transform: scaleX(0); fill: #e8932a; } 7.14% { transform: scaleX(1); fill: #e8932a; } 8.14%,100% { transform: scaleX(1); fill: #346DDB; } }
+    @keyframes gapBar3 { 0%,14% { transform: scaleX(0); fill: #e8932a; } 21.14% { transform: scaleX(1); fill: #e8932a; } 22.14%,100% { transform: scaleX(1); fill: #346DDB; } }
+    @keyframes gapGlow2 { 0%,6% { opacity: 0; } 8.5% { opacity: 0.4; } 15%,100% { opacity: 0; } }
+    @keyframes gapGlow3 { 0%,20% { opacity: 0; } 22.5% { opacity: 0.4; } 30%,100% { opacity: 0; } }
+    @media (prefers-reduced-motion: reduce) {
+      .bar2, .bar3, .glow2, .glow3 { animation: none; }
+      .bar { transform: scaleX(1); fill: #346DDB; }
+      .glow2, .glow3 { opacity: 0; }
+    }
+  </style>
+
+  <rect x="0" y="0" width="660" height="250" rx="10" fill="#ffffff"/>
+  <text x="16" y="26" font-family="ui-sans-serif, system-ui, sans-serif" font-size="12.5" font-weight="700" fill="#3a4350">Two events, two granularities</text>
+
+  <g font-family="ui-sans-serif, system-ui, sans-serif">
+    <!-- column glows (transient highlight when each monitor fires) -->
+    <rect class="glow2" x="337" y="82" width="136" height="114" rx="9" fill="#e8932a" opacity="0"/>
+    <rect class="glow3" x="497" y="82" width="136" height="114" rx="9" fill="#e8932a" opacity="0"/>
+
+    <!-- column headers -->
+    <text x="245" y="48" text-anchor="middle" font-size="11.5" font-weight="700" fill="#3a4350">Healthy</text>
+    <text x="245" y="62" text-anchor="middle" font-size="10.5" fill="#5b6573">starting state</text>
+
+    <text x="405" y="48" text-anchor="middle" font-size="11.5" font-weight="700" fill="#3a4350">Monitor A fires</text>
+    <text x="405" y="62" text-anchor="middle" font-size="10.5" fill="#1f5fc0">HEALTHY &#8594; FLAKY</text>
+    <rect x="345" y="72" width="120" height="5" rx="2.5" fill="#d9dee6"/>
+    <rect class="bar bar2" x="345" y="72" width="120" height="5" rx="2.5" fill="#e8932a"/>
+
+    <text x="565" y="48" text-anchor="middle" font-size="11.5" font-weight="700" fill="#3a4350">Monitor B fires</text>
+    <text x="565" y="62" text-anchor="middle" font-size="10.5" fill="#5b6573">already FLAKY</text>
+    <rect x="505" y="72" width="120" height="5" rx="2.5" fill="#d9dee6"/>
+    <rect class="bar bar3" x="505" y="72" width="120" height="5" rx="2.5" fill="#e8932a"/>
+
+    <!-- lane labels -->
+    <text x="16" y="104" font-size="11" fill="#5b6573">Test status</text>
+    <text x="16" y="136" font-size="10" fill="#5b6573">v2.test_case.</text>
+    <text x="16" y="148" font-size="10" fill="#5b6573">status_changed</text>
+    <text x="16" y="176" font-size="10" fill="#5b6573">test_case.</text>
+    <text x="16" y="188" font-size="10" fill="#5b6573">monitor_status_changed</text>
+
+    <!-- Row 1: Test status (HEALTHY then FLAKY then FLAKY) -->
+    <rect x="185" y="88" width="120" height="26" rx="6" fill="#e4f5ea" stroke="#1d9b54" stroke-width="1.5"/>
+    <text x="245" y="105" text-anchor="middle" font-size="12" font-weight="700" fill="#1d7a4d">HEALTHY</text>
+    <text x="325" y="106" text-anchor="middle" font-size="13" fill="#9aa3b0">&#8594;</text>
+    <rect x="345" y="88" width="120" height="26" rx="6" fill="#e3edfc" stroke="#346DDB" stroke-width="1.5"/>
+    <text x="405" y="105" text-anchor="middle" font-size="12" font-weight="700" fill="#1f5fc0">FLAKY</text>
+    <text x="485" y="106" text-anchor="middle" font-size="13" fill="#9aa3b0">&#8594;</text>
+    <rect x="505" y="88" width="120" height="26" rx="6" fill="#e3edfc" stroke="#346DDB" stroke-width="1.5"/>
+    <text x="565" y="105" text-anchor="middle" font-size="12" font-weight="700" fill="#1f5fc0">FLAKY</text>
+
+    <!-- Row 2: v2.test_case.status_changed (none, sent, NO event) -->
+    <text x="245" y="145" text-anchor="middle" font-size="11" fill="#c2c8d0">&#183;</text>
+    <rect x="345" y="128" width="120" height="26" rx="6" fill="#e3edfc" stroke="#346DDB" stroke-width="1.5"/>
+    <text x="405" y="145" text-anchor="middle" font-size="11" fill="#1f5fc0">event sent</text>
+    <rect x="505" y="128" width="120" height="26" rx="6" fill="#eef1f5" stroke="#9aa3b0" stroke-width="1.5" stroke-dasharray="4 3"/>
+    <text x="565" y="145" text-anchor="middle" font-size="11" fill="#5b6573">no event</text>
+
+    <!-- Row 3: test_case.monitor_status_changed (none, sent, sent) -->
+    <text x="245" y="185" text-anchor="middle" font-size="11" fill="#c2c8d0">&#183;</text>
+    <rect x="345" y="168" width="120" height="26" rx="6" fill="#e2f4f1" stroke="#18a394" stroke-width="1.5"/>
+    <text x="405" y="185" text-anchor="middle" font-size="11" fill="#0e7d70">event sent</text>
+    <rect x="505" y="168" width="120" height="26" rx="6" fill="#e2f4f1" stroke="#18a394" stroke-width="1.5"/>
+    <text x="565" y="185" text-anchor="middle" font-size="11" fill="#0e7d70">event sent</text>
+
+    <!-- caption -->
+    <text x="16" y="218" font-size="11.5" fill="#5b6573">status_changed fires only when the overall status changes, so Monitor B sends nothing.</text>
+    <text x="16" y="234" font-size="11.5" fill="#5b6573">monitor_status_changed fires on every activation, so it catches both escalations.</text>
+  </g>
+</svg>
diff --git a/flaky-tests/recipes/alert-on-test-escalation.mdx b/flaky-tests/recipes/alert-on-test-escalation.mdx
index 354fecd0..82441200 100644
--- a/flaky-tests/recipes/alert-on-test-escalation.mdx
+++ b/flaky-tests/recipes/alert-on-test-escalation.mdx
@@ -3,20 +3,24 @@ title: "Alert When a Test Escalates"
 description: "Send Slack alerts when a test gets worse, not just the first time it's flagged"
 og:title: "Alerting on flaky test escalation with Trunk webhooks"
 ---
-A single "this test is now flaky" alert tells you a test crossed a threshold once. It doesn't tell you when that same test keeps getting worse — failing on more branches, tripping additional monitors, or degrading from flaky to a consistently broken regression. For tests that matter, you want to hear about the escalation, not just the first detection.
+A single "this test is now flaky" alert tells you a test crossed a threshold once. It says nothing about what happens next: the same test failing on more branches, tripping more monitors, or sliding from flaky into a consistently broken regression. For the tests that matter, you want to hear about the escalation, not just the first detection.
 
-This page shows how to wire that up with Trunk webhooks and a Slack transformation. It builds on the [Slack integration guide](../webhooks/slack-integration) — set that connection up first, then come back here to filter it down to escalations.
+This page wires that up with Trunk webhooks and a Slack transformation. It builds on the [Slack integration guide](../webhooks/slack-integration), so set that connection up first, then come back here to filter it down to escalations.
 
 ## Pick the right event
 
-The key decision is which event you subscribe to, because two different events fire at two different granularities.
+The one decision that matters is which event you subscribe to. Two events fire here, at two different granularities.
 
 | Event | Fires when | Use it to |
 |---|---|---|
 | [`v2.test_case.status_changed`](../webhooks/index) | The test's **overall health status** transitions between `HEALTHY`, `FLAKY`, and `BROKEN` | Alert on health escalations like `FLAKY` → `BROKEN` |
 | [`test_case.monitor_status_changed`](../webhooks/index) | **Any individual monitor** activates or resolves for the test | Alert every time a monitor flags the test, even if its overall status doesn't move |
 
-The distinction matters. `v2.test_case.status_changed` only fires when the test's combined status changes. If a test is already `FLAKY` and a second monitor starts flagging it, the overall status stays `FLAKY`, so no `v2.test_case.status_changed` event is sent. To catch a test getting flagged by more monitors over time — the "more than just the first detection" case — subscribe to `test_case.monitor_status_changed` instead.
+That distinction matters. `v2.test_case.status_changed` only fires when the test's combined status changes. If a test is already `FLAKY` and a second monitor starts flagging it, the overall status stays `FLAKY`, so nothing is sent. To catch a test that keeps getting flagged by more monitors over time (the "more than just the first detection" case), subscribe to `test_case.monitor_status_changed` instead.
+
+<Frame>
+  <img src="/assets/flaky-tests/recipes/event-granularity-gap.svg" alt="A test goes HEALTHY to FLAKY when Monitor A fires, so both events send. When Monitor B fires while the test is already FLAKY, v2.test_case.status_changed sends nothing while test_case.monitor_status_changed still fires." />
+</Frame>
 
 <Info>
 Test status priority is **Broken > Flaky > Healthy**. A test flagged by both a broken-type and a flaky-type monitor shows as `BROKEN` until the broken monitor resolves. See [Flake Detection](../detection/) for how the combined status is calculated.
@@ -24,7 +28,7 @@ Test status priority is **Broken > Flaky > Healthy**. A test flagged by both a b
 
 ## Alert when a test becomes broken
 
-Use this when you want a louder, separate signal for tests that have degraded into consistent failures, distinct from routine flakiness.
+Use this when consistently failing tests deserve a louder, separate signal than routine flakiness.
 
 **1. Configure a broken-type monitor.** A test only reaches `BROKEN` status when a [failure rate](../detection/failure-rate-monitor) or [failure count](../detection/failure-count-monitor) monitor with its **Detection type** set to **Broken** is active for it. Set one up if you haven't already. A common pattern is to pair a broken-type monitor (catching consistently failing tests) with a flaky-type monitor (catching intermittent ones).
 
@@ -49,7 +53,7 @@ function handler(webhook) {
 }
 ```
 
-To alert *only* when a test reaches the broken state — and stay silent on first-time flaky detections — gate on the new status directly instead:
+To alert *only* when a test reaches the broken state, and stay quiet on first-time flaky detections, gate on the new status directly instead:
 
 ```javascript
 function handler(webhook) {
@@ -67,14 +71,18 @@ function handler(webhook) {
 Both snippets replace the `handler` function from the [Slack integration guide](../webhooks/slack-integration#id-2.-customize-your-transformation); keep that guide's `summarizeTestCase` helper in the same transformation so the message body still renders. Its `previous_status → new_status` line makes the escalation obvious in the channel.
 
 <Warning>
-Classifying a test as **broken** changes its health status, and that can change quarantine behavior. Auto-quarantine applies only to tests with a **Flaky** status, so when a broken-type monitor flags a test that was auto-quarantined as flaky, the test becomes `BROKEN`, drops out of the auto-quarantine set, and its failures start blocking CI again. This is by design — a broken test is a real regression, not a flake to skip — but it means a broken classification is not a side-effect-free way to get an escalation alert.
+Classifying a test as **broken** changes its health status, and that can change quarantine behavior. Auto-quarantine applies only to tests with a **Flaky** status. So when a broken-type monitor flags a test that was auto-quarantined as flaky, the test becomes `BROKEN`, drops out of the auto-quarantine set, and its failures start blocking CI again. That is by design (a broken test is a real regression, not a flake to skip), but it means a broken classification is not a side-effect-free way to get an escalation alert.
 
 If you want the escalation signal *without* touching quarantine, use a **labeling** monitor instead (see [Alert every time a monitor flags a test](#alert-every-time-a-monitor-flags-a-test) below). Labels don't change health status, so an auto-quarantined test stays quarantined. Manually quarantined tests are unaffected either way. See [Quarantining](../quarantining/) and [Flake Detection](../detection/) for the full composite-status behavior.
 </Warning>
 
+<Frame>
+  <img src="/assets/flaky-tests/recipes/broken-classification-quarantine.svg" alt="A flaky, auto-quarantined test with CI passing. A broken-type monitor fires and reclassifies it as BROKEN. Because broken tests are not quarantine candidates, it drops out of auto-quarantine and its failures block CI again." />
+</Frame>
+
 ## Alert every time a monitor flags a test
 
-Use this when you want to know about every detection event on a test, including the ones that don't change its overall status — a second monitor piling on, or a labeling monitor surfacing a new pattern.
+Use this when you want to know about every detection event on a test, including the ones that don't change its overall status (a second monitor piling on, or a labeling monitor surfacing a new pattern).
 
 **1. Subscribe to `test_case.monitor_status_changed`.** On your Slack endpoint, enable this event in addition to (or instead of) `v2.test_case.status_changed`.
 
@@ -112,15 +120,15 @@ function handler(webhook) {
 }
 ```
 
-Because `test_case.monitor_status_changed` fires for every monitor independently, this catches a test that keeps tripping new monitors over time, even while its headline status stays `FLAKY`. The `monitor.type` field tells you which monitor fired, so you can branch on it — for example, route [labeling monitors](../management/test-labels#automatic-labeling-from-monitors) to a triage channel and health classification monitors to your on-call channel.
+Because `test_case.monitor_status_changed` fires for every monitor independently, this catches a test that keeps tripping new monitors over time, even while its headline status stays `FLAKY`. The `monitor.type` field tells you which monitor fired, so you can branch on it: route [labeling monitors](../management/test-labels#automatic-labeling-from-monitors) to a triage channel and health classification monitors to your on-call channel.
 
 <Tip>
-Prefer labels over a broken classification when you want to triage a pattern without changing a test's health status — and, as noted above, without disturbing auto-quarantine. Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
+Prefer labels over a broken classification when you want to triage a pattern without changing a test's health status (and, as noted above, without disturbing auto-quarantine). Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
 </Tip>
 
 ## Related
 
-- [Integration for Slack](../webhooks/slack-integration) — set up the Slack connection these transformations build on
-- [Webhooks](../webhooks/index) — the full event catalog and field reference
-- [Flake Detection](../detection/) — how monitors classify tests as flaky or broken
-- [Test Labels](../management/test-labels) — apply and route labels with monitors
+- [Integration for Slack](../webhooks/slack-integration). The Slack connection these transformations build on.
+- [Webhooks](../webhooks/index). The full event catalog and field reference.
+- [Flake Detection](../detection/). How monitors classify tests as flaky or broken.
+- [Test Labels](../management/test-labels). Apply and route labels with monitors.

From a824d191cd306276f9d6cb5a7acaac42f8a02422 Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Mon, 15 Jun 2026 10:55:20 -0700
Subject: [PATCH 7/8] docs(flaky-tests): unwrap quarantine Warning into its own
 section

Apply CONTRIBUTING admonition rules to the escalation recipe:
- The quarantine side effect was a two-paragraph <Warning> wrapping what
  is really core content. The guide forbids wrapping a section in a
  callout, and a reversible, by-design behavior is not a Warning-grade
  hazard. Promote it to a '## The quarantine trade-off' section (prose),
  and move the broken-classification animation into it.
- Trim the label <Tip> so it no longer duplicates that section; it now
  covers only the optional label-routing mechanics.

Page now has two callouts (Info for background, Tip for an optional path),
none stacked or section-wrapping.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 flaky-tests/recipes/alert-on-test-escalation.mdx | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/flaky-tests/recipes/alert-on-test-escalation.mdx b/flaky-tests/recipes/alert-on-test-escalation.mdx
index 82441200..f7e9f9b6 100644
--- a/flaky-tests/recipes/alert-on-test-escalation.mdx
+++ b/flaky-tests/recipes/alert-on-test-escalation.mdx
@@ -70,11 +70,11 @@ function handler(webhook) {
 
 Both snippets replace the `handler` function from the [Slack integration guide](../webhooks/slack-integration#id-2.-customize-your-transformation); keep that guide's `summarizeTestCase` helper in the same transformation so the message body still renders. Its `previous_status → new_status` line makes the escalation obvious in the channel.
 
-<Warning>
-Classifying a test as **broken** changes its health status, and that can change quarantine behavior. Auto-quarantine applies only to tests with a **Flaky** status. So when a broken-type monitor flags a test that was auto-quarantined as flaky, the test becomes `BROKEN`, drops out of the auto-quarantine set, and its failures start blocking CI again. That is by design (a broken test is a real regression, not a flake to skip), but it means a broken classification is not a side-effect-free way to get an escalation alert.
+## The quarantine trade-off
 
-If you want the escalation signal *without* touching quarantine, use a **labeling** monitor instead (see [Alert every time a monitor flags a test](#alert-every-time-a-monitor-flags-a-test) below). Labels don't change health status, so an auto-quarantined test stays quarantined. Manually quarantined tests are unaffected either way. See [Quarantining](../quarantining/) and [Flake Detection](../detection/) for the full composite-status behavior.
-</Warning>
+Before you reach for a broken-type monitor, know what it does to quarantine. Classifying a test as broken changes its health status, and auto-quarantine applies only to tests with a **Flaky** status. So when a broken-type monitor flags a test that was auto-quarantined as flaky, the test becomes `BROKEN`, drops out of the auto-quarantine set, and its failures start blocking CI again. That is by design, since a broken test is a real regression, not a flake to skip. It also means a broken classification is not a side-effect-free way to get an escalation alert.
+
+Labels avoid this. A labeling monitor doesn't change health status, so an auto-quarantined test stays quarantined while you still get the activation signal (see [Alert every time a monitor flags a test](#alert-every-time-a-monitor-flags-a-test) below). Manually quarantined tests are unaffected either way. See [Quarantining](../quarantining/) and [Flake Detection](../detection/) for the full composite-status behavior.
 
 <Frame>
   <img src="/assets/flaky-tests/recipes/broken-classification-quarantine.svg" alt="A flaky, auto-quarantined test with CI passing. A broken-type monitor fires and reclassifies it as BROKEN. Because broken tests are not quarantine candidates, it drops out of auto-quarantine and its failures block CI again." />
@@ -123,7 +123,7 @@ function handler(webhook) {
 Because `test_case.monitor_status_changed` fires for every monitor independently, this catches a test that keeps tripping new monitors over time, even while its headline status stays `FLAKY`. The `monitor.type` field tells you which monitor fired, so you can branch on it: route [labeling monitors](../management/test-labels#automatic-labeling-from-monitors) to a triage channel and health classification monitors to your on-call channel.
 
 <Tip>
-Prefer labels over a broken classification when you want to triage a pattern without changing a test's health status (and, as noted above, without disturbing auto-quarantine). Configure a monitor's action as **Apply labels**, then filter `test_case.monitor_status_changed` on `monitor.type` to route those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
+To route by pattern without changing a test's health status, set a monitor's action to **Apply labels**, then branch on `monitor.type` in your transform to send those activations wherever they belong. See [Test Labels](../management/test-labels) for the full setup.
 </Tip>
 
 ## Related

From 487f8707024524fd28ebd6894ba87d1a5672eddd Mon Sep 17 00:00:00 2001
From: Sam Gutentag <1404219+samgutentag@users.noreply.github.com>
Date: Mon, 15 Jun 2026 11:02:25 -0700
Subject: [PATCH 8/8] docs(flaky-tests): clarify monitor_status_changed fires
 per-monitor in gap diagram

Accuracy pass on the event-granularity diagram. The column-3 event is
correct (monitor_status_changed fires on Monitor B's own activation,
independent of overall status), but the framing invited a 'why an event
if FLAKY to FLAKY?' misread. Sharpen it:
- column 3 sublabel 'already FLAKY' -> '2nd monitor, still FLAKY'
- caption: 'catches both escalations' -> 'fires on every monitor
  activation, so it catches both' (Monitor A's first detection is not an
  escalation)

broken-classification diagram audited, accurate, unchanged.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 assets/flaky-tests/recipes/event-granularity-gap.svg | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/assets/flaky-tests/recipes/event-granularity-gap.svg b/assets/flaky-tests/recipes/event-granularity-gap.svg
index 03d77f37..06ec87c3 100644
--- a/assets/flaky-tests/recipes/event-granularity-gap.svg
+++ b/assets/flaky-tests/recipes/event-granularity-gap.svg
@@ -49,7 +49,7 @@
     <rect class="bar bar2" x="345" y="72" width="120" height="5" rx="2.5" fill="#e8932a"/>
 
     <text x="565" y="48" text-anchor="middle" font-size="11.5" font-weight="700" fill="#3a4350">Monitor B fires</text>
-    <text x="565" y="62" text-anchor="middle" font-size="10.5" fill="#5b6573">already FLAKY</text>
+    <text x="565" y="62" text-anchor="middle" font-size="10.5" fill="#5b6573">2nd monitor, still FLAKY</text>
     <rect x="505" y="72" width="120" height="5" rx="2.5" fill="#d9dee6"/>
     <rect class="bar bar3" x="505" y="72" width="120" height="5" rx="2.5" fill="#e8932a"/>
 
@@ -86,6 +86,6 @@
 
     <!-- caption -->
     <text x="16" y="218" font-size="11.5" fill="#5b6573">status_changed fires only when the overall status changes, so Monitor B sends nothing.</text>
-    <text x="16" y="234" font-size="11.5" fill="#5b6573">monitor_status_changed fires on every activation, so it catches both escalations.</text>
+    <text x="16" y="234" font-size="11.5" fill="#5b6573">monitor_status_changed fires on every monitor activation, so it catches both.</text>
   </g>
 </svg>