Skip to content

fix: Route Sentry telemetry and simplify evals#617

Merged
dcramer merged 1 commit into
mainfrom
fix/evals-native-session-gh-615
Jun 18, 2026
Merged

fix: Route Sentry telemetry and simplify evals#617
dcramer merged 1 commit into
mainfrom
fix/evals-native-session-gh-615

Conversation

@dcramer

@dcramer dcramer commented Jun 18, 2026

Copy link
Copy Markdown
Member

Sentry product telemetry and feature-usage requests now steer through the Sentry skill instead of falling through to Hex as the visible skill. Hex remains the internal data primitive, but its skill description now explicitly excludes direct use for Sentry product telemetry when the Sentry skill is available.

The eval cleanup supports that fix by moving Junior evals onto native vitest-evals session/tool primitives instead of repo-local output buckets. The Slack eval harness returns a standard HarnessRun, rubric criteria are reduced to pass/fail, and deterministic checks use helpers such as toolCalls(result.session) and assistantMessages(result.session).

The docs and local agent instructions now point eval authors at the package eval scripts, direct file/-t arguments, and the native Pi harness boundary for ordered full-turn transcript behavior. This avoids adding a second event-log or sequencing layer in Junior while keeping Slack evals focused on Slack-visible behavior.

Refs GH-615

@vercel

vercel Bot commented Jun 18, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
junior-docs Ready Ready Preview, Comment Jun 18, 2026 8:10pm

Request Review

Comment thread packages/junior-evals/package.json
@dcramer dcramer changed the title test(evals): Use native session assertions fix: Route Sentry telemetry and simplify evals Jun 18, 2026
@dcramer dcramer force-pushed the fix/evals-native-session-gh-615 branch from b6fccd4 to 8bd0ff3 Compare June 18, 2026 19:43

@cursor cursor Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit 8bd0ff3. Configure here.

Comment thread packages/junior-evals/evals/core/scheduler.eval.ts
Route Sentry product telemetry and feature-usage requests through the Sentry skill while keeping Hex as the internal data primitive. Clarify the skill descriptions so explicit Sentry telemetry requests do not steer directly to Hex.

Move Junior evals onto the normalized vitest-evals session surface for judge scoring and deterministic assertions. Simplify rubrics to pass/fail and document the package eval commands so agents do not guess at direct Vitest invocation.

Keep ordered full-turn transcript behavior owned by the native vitest-evals Pi harness instead of adding repo-local event log layers.

Refs GH-615

Co-Authored-By: GPT-5 Codex <noreply@openai.com>
@dcramer dcramer force-pushed the fix/evals-native-session-gh-615 branch from 8bd0ff3 to 717581e Compare June 18, 2026 20:09
@dcramer dcramer marked this pull request as ready for review June 18, 2026 20:23
@dcramer dcramer merged commit 66dcb5a into main Jun 18, 2026
16 checks passed
@dcramer dcramer deleted the fix/evals-native-session-gh-615 branch June 18, 2026 20:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant