fix(worktree): skip rescue for noise-only worktrees by dimakis · Pull Request #397 · dimakis/mitzo

dimakis · 2026-06-25T17:23:26Z

Summary

Worktree rescue was creating draft PRs for sessions that had no meaningful work — just the .mitzo-session marker file. This caused 15+ empty "Rescued" PRs to pile up in mgmt.
Added a noise-file check after staging: if all staged files are in RESCUE_NOISE_FILES (currently just .mitzo-session), the rescue bails out early — no commit, no push, no PR.
The set is extensible if other noise markers appear in the future.

Test plan

New test: skips rescue when only .mitzo-session is staged
New test: skips rescue when no files are staged
New test: proceeds with rescue when meaningful files exist alongside noise
All 15 existing + new rescue-worktree tests pass

🤖 Generated with Claude Code

dimakis · 2026-06-25T17:28:16Z

Centaur Review

Found 4 issue(s) (1 warning).

`packages/client/src/sse-connection.ts`

Solid multi-pronged fix for replay storms and message loss. The worktree noise-file filtering and shouldSync/replay-cap changes are clean and well-tested. The SSE POST retry mechanism works but should guard against permanent HTTP failures to prevent infinite re-queue cycles.

🟡 unsafe_assumptions (L336): Re-queues on ALL non-ok responses (!res.ok), including permanent failures like 400 Bad Request or 403 Forbidden. These will never succeed but will cycle through pendingSends on every reconnect indefinitely. Consider limiting re-queue to transient statuses (404, 502, 503) or adding a max-retry counter to avoid infinite retry loops for permanently rejected messages. [fixable]
🔵 unsafe_assumptions (L352): flushPendingSends calls doPost without await. If a flushed POST fails and a new reconnect completes before the failed doPost's catch handler runs, the re-queued message won't be included in the next flush (it pushes to pendingSends after the new flush already cleared and iterated). This is a narrow timing window but could cause a message to sit in pendingSends until a third reconnect. [fixable]

`packages/protocol/src/event-store.ts`

Solid multi-pronged fix for replay storms and message loss. The worktree noise-file filtering and shouldSync/replay-cap changes are clean and well-tested. The SSE POST retry mechanism works but should guard against permanent HTTP failures to prevent infinite re-queue cycles.

🔵 missing_tests (L617): New getHeadSeq method has no unit test in the protocol package's event-store test suite. It is only mocked in the connection-registry tests. A dedicated test covering the three cases (session with events, empty session, non-existent session) would be valuable given this is a SQL query backing a replay-storm prevention mechanism. [fixable]

`server/index.ts`

Solid multi-pronged fix for replay storms and message loss. The worktree noise-file filtering and shouldSync/replay-cap changes are clean and well-tested. The SSE POST retry mechanism works but should guard against permanent HTTP failures to prevent infinite re-queue cycles.

🔵 style (L375): Comment says SSE is the 'Default transport' and WS is 'opt-in via localStorage', but this is a comment-only change in a PR titled 'fix(worktree): skip rescue for noise-only worktrees'. Consider splitting transport documentation updates into their own commit/PR to keep the changeset focused.

dimakis

Centaur Review

Found 5 issue(s) (2 warning).

`server/worktree.ts`

Solid defensive PR addressing replay storms and noise worktrees. Two substantive issues: basename() in noise-file check is overly broad (should be strict equality), and silently capping replay depth without a client-side gap signal may cause subtle data loss on reconnect.

🟡 bugs (L470): basename(f) matches .mitzo-session at any directory depth (e.g. subdir/.mitzo-session). Since .mitzo-session is only created at the worktree root, a simple equality check f === '.mitzo-session' would be more precise and avoid false positives if a file with the same name ever appears in a subdirectory. [fixable]

`packages/harness/src/connection-registry.ts`

Solid defensive PR addressing replay storms and noise worktrees. Two substantive issues: basename() in noise-file check is overly broad (should be strict equality), and silently capping replay depth without a client-side gap signal may cause subtle data loss on reconnect.

🟡 unsafe_assumptions (L248): When the replay gap is capped, events between the old and new cursor are silently dropped with no client-side notification. The comment says clients can lazy-load via REST API, but there is no gap signal (e.g. events_skipped message) sent to the client, so the client has no way to know it missed events unless it independently tracks seq continuity. Consider sending a gap notification so the client can trigger a full restore. [fixable]

`server/chat.ts`

Solid defensive PR addressing replay storms and noise worktrees. Two substantive issues: basename() in noise-file check is overly broad (should be strict equality), and silently capping replay depth without a client-side gap signal may cause subtle data loss on reconnect.

🔵 missing_tests (L1269): The new try/catch around queryInstance.interrupt() has no test coverage. A test where interrupt rejects (e.g. vi.fn().mockRejectedValue(new Error('not ready'))) should verify that the function still returns true and the prompt is still pushed to inputQueue. [fixable]

`packages/harness/tests/connection-registry.test.ts`

Solid defensive PR addressing replay storms and noise worktrees. Two substantive issues: basename() in noise-file check is overly broad (should be strict equality), and silently capping replay depth without a client-side gap signal may cause subtle data loss on reconnect.

🔵 missing_tests (L507): The 'caps replay depth' test doesn't verify the log.warn call or that the cursor is actually persisted (subsequent sync rounds should start from the new cursor, not re-cap). A follow-up assertion after a second timer advance would confirm the cursor update sticks. [fixable]

`server/index.ts`

Solid defensive PR addressing replay storms and noise worktrees. Two substantive issues: basename() in noise-file check is overly broad (should be strict equality), and silently capping replay depth without a client-side gap signal may cause subtle data loss on reconnect.

🔵 style (L372): The comment update ('SSE Chat Transport (primary)' and 'Default transport') is documentation about the transport migration status — useful context but may go stale. Minor nit, no action needed.

dimakis · 2026-06-25T21:59:09Z

+      timeout: WORKTREE_GIT_TIMEOUT_MS,
+    }).trim();
+    const files = staged ? staged.split('\n') : [];
+    if (files.length === 0 || files.every((f) => RESCUE_NOISE_FILES.has(basename(f)))) {


🟡 bugs: basename(f) matches .mitzo-session at any directory depth (e.g. subdir/.mitzo-session). Since .mitzo-session is only created at the worktree root, a simple equality check f === '.mitzo-session' would be more precise and avoid false positives if a file with the same name ever appears in a subdirectory. [fixable]

dimakis · 2026-06-25T21:59:09Z

+                connectionId,
+                sessionId,
+                oldCursor: cursor,
+                newCursor,


🟡 unsafe_assumptions: When the replay gap is capped, events between the old and new cursor are silently dropped with no client-side notification. The comment says clients can lazy-load via REST API, but there is no gap signal (e.g. events_skipped message) sent to the client, so the client has no way to know it missed events unless it independently tracks seq continuity. Consider sending a gap notification so the client can trigger a full restore. [fixable]

dimakis · 2026-06-25T21:59:09Z

      await Promise.allSettled(stops);
    }
-    await session.queryInstance.interrupt();
+    try {


🔵 missing_tests: The new try/catch around queryInstance.interrupt() has no test coverage. A test where interrupt rejects (e.g. vi.fn().mockRejectedValue(new Error('not ready'))) should verify that the function still returns true and the prompt is still pushed to inputQueue. [fixable]

dimakis · 2026-06-25T21:59:09Z

      const sessionIds = calls.map((c: unknown[]) => c[0]);
      expect(sessionIds).toContain('sess-a');
      expect(sessionIds).toContain('sess-b');



🔵 missing_tests: The 'caps replay depth' test doesn't verify the log.warn call or that the cursor is actually persisted (subsequent sync rounds should start from the new cursor, not re-cap). A follow-up assertion after a second timer advance would confirm the cursor update sticks. [fixable]

dimakis · 2026-06-25T21:59:09Z

 };

-// ─── SSE Chat Transport ──────────────────────────────────────────────────────
+// ─── SSE Chat Transport (primary) ────────────────────────────────────────────


🔵 style: The comment update ('SSE Chat Transport (primary)' and 'Default transport') is documentation about the transport migration status — useful context but may go stale. Minor nit, no action needed.

dimakis