Skip to content

feat: add flag to strictly evaluate deterministic metrics#24

Open
ev1lm0nk3y wants to merge 2 commits into
WillAbides:mainfrom
ev1lm0nk3y:strict-deterministic-metrics
Open

feat: add flag to strictly evaluate deterministic metrics#24
ev1lm0nk3y wants to merge 2 commits into
WillAbides:mainfrom
ev1lm0nk3y:strict-deterministic-metrics

Conversation

@ev1lm0nk3y
Copy link
Copy Markdown

Purpose

Running benchmarks within github actions or host that may have multiple users running various workloads can skew any time-based benchmark metrics leading to potential noisy CI failures. Also, this will give benchdiff another tool in its kit.

What This Does

This flag allows benchdiff to bypass benchstat's statistical significance checks for allocs/op and B/op, ensuring that any regression in these metrics exceeding the tolerance threshold triggers a failure.

By default, any regression in these metrics will signal failure but a tolerance can be configured.

…tic metrics

This flag allows benchdiff to bypass benchstat's statistical significance
checks for allocs/op and B/op, ensuring that any regression in these
metrics exceeding the tolerance threshold triggers a failure.
This allows users to define a separate threshold for regressions in
allocs/op and B/op. It defaults to 0.0 as requested.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant