perf: single-pass drift metric computation (#447) by tcconnally · Pull Request #464 · Perseus-Computing-LLC/perseus

tcconnally · 2026-06-26T04:33:47Z

Addresses #447 item 2b (drift). _compute_drift built two lists (recent/baseline) then iterated each three times — rate(), tokens(), avg_len() — and the tokens pass re-ran the _extract_recommendation_tokens regex per entry.

Collapse to one pass: classify each entry into its window bucket and accumulate count, positive count, response-length sum, and the recommendation-token set inline — so each entry is visited once and its tokens extracted once. Metrics and the result dict are byte-identical (the existing drift tests pin them).

Deferred (same item): the time-windowed tail-read needs a chronological-ordering assumption on the JSONL log for a marginal CLI-only gain; the full read is already bounded by pythia.max_entries.

Tests: full oracle suite green (56), incl. the 8 drift tests pinning the acceptance-rate / jaccard / avg-length / count outputs.

🤖 Generated with Claude Code

Addresses #447 item 2b (drift). _compute_drift built two lists (recent/baseline) then iterated each THREE times — rate(), tokens(), avg_len() — and the tokens pass re-ran the _extract_recommendation_tokens regex per entry. Collapse to one pass: classify each entry into its window bucket and accumulate count, positive count, response-length sum, and the recommendation-token set inline, so each entry is visited once and its tokens extracted once. Metrics and the result dict are byte-identical (the existing drift tests pin them). Deferred (same item): the time-windowed tail-read (read only the last N days instead of the whole capped log). It needs a chronological-ordering assumption on the JSONL log for a marginal CLI-only gain, so it's left out; the full read is already bounded by pythia.max_entries. Tests: full oracle suite green (56), incl. the 8 drift tests that pin the acceptance-rate / jaccard / avg-length / count outputs. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

tcconnally merged commit ef3e502 into main Jun 26, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: single-pass drift metric computation (#447)#464

perf: single-pass drift metric computation (#447)#464
tcconnally merged 1 commit into
mainfrom
perf/447-drift-single-pass

tcconnally commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

tcconnally commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant