Releases: EntityProcess/agentv
Releases · EntityProcess/agentv
v4.25.4-next.1
What's Changed
Full Changelog: v4.25.3-next.1...v4.25.4-next.1
v4.25.3-next.1
What's Changed
- fix(core): derive score from assertions when score absent in code-grader by @christso in #1212
- fix(cli): detect global npm self-update installs by @christso in #1213
Full Changelog: v4.25.2-next.1...v4.25.3-next.1
v4.25.2-next.1
What's Changed
- refactor(core): migrate LLM provider layer from Vercel AI SDK to pi-ai by @christso in #1206
- feat: code-grader plain-text fallback + workspace env preflight by @christso in #1209
Full Changelog: v4.25.1...v4.25.2-next.1
v4.25.1
v4.25.1-next.1
What's Changed
Full Changelog: v4.25.0...v4.25.1-next.1
v4.25.0
v4.25.0-next.1
What's Changed
- fix: show live run names before completion by @christso in #1199
- feat(results): direct-push auto_push to base branch by @christso in #1201
Full Changelog: v4.24.1...v4.25.0-next.1
v4.24.1
What's Changed
- fix: remove before_session hook startup color by @christso in #1193
- feat: add check-grader-scores post-processor by @christso in #1191
- fix: resolve self-update package manager command by @christso in #1195
- fix: use Studio threshold as Run Eval default by @christso in #1197
- fix(core): normalize rubric grader name to rubrics by @christso in #1196
Full Changelog: v4.24.0...v4.24.1
v4.24.1-next.1
What's Changed
- fix: remove before_session hook startup color by @christso in #1193
- feat: add check-grader-scores post-processor by @christso in #1191
- fix: resolve self-update package manager command by @christso in #1195
- fix: use Studio threshold as Run Eval default by @christso in #1197
- fix(core): normalize rubric grader name to rubrics by @christso in #1196
Full Changelog: v4.24.0...v4.24.1-next.1
v4.24.0
What's Changed
- fix(agentv-bench): fix grader subagent pipeline bugs by @jozsurf in #1148
- fix(pipeline): align subagent-mode suite fallback with CLI mode by @christso in #1151
- feat(core): optional governance metadata on EvalMetadata and EvalTest by @christso in #1165
- docs(examples): AI system register convention (.ai-register.yaml) + aggregator Action template by @christso in #1167
- feat(examples): scenario-based red-team suites for coding and customer-facing agent archetypes by @christso in #1168
- feat(examples): OWASP LLM Top 10 / MITRE ATLAS-aligned red-team eval pack by @christso in #1166
- fix(test): raise pipeline-e2e timeout to 30s (#1169) by @christso in #1170
- fix(docs): correct contains* case-sensitivity in grader.md by @christso in #1171
- fix(core): unwrap YAML merge keys (
<<:) in eval loader by @christso in #1174 - fix(examples): use tool-trajectory not contains for destructive-git negative assertions by @christso in #1175
- fix(test): raise input.test.ts pipeline timeouts to 30s by @christso in #1176
- fix(test): bump e2e/integration test timeouts to 30s (#1173) by @christso in #1177
- feat(skill): add agentv-compliance skill for governance block authoring and lint (Phase 1 of #1172) by @christso in #1178
- refactor(core): remove typed governance schema, generalize metadata merge (Phase 2 of #1172) by @christso in #1179
- feat(examples): add compliance-lint GitHub Action reference (Phase 3 of #1172) by @christso in #1180
- chore: rename agentv-compliance skill to agentv-governance by @christso in #1181
- docs: fix risk_tier vocabulary mismatch and add eval-level governance docs by @christso in #1183
- docs(examples): align ai-register manifest with EU AI Act, NIST RMF, ISO 42001 by @christso in #1184
- feat(studio): add targets tab drilldown by @christso in #1189
- feat(red-team): screenshot PII upload eval for coding agents by @christso in #1185
- feat(cli): show eval progress durations in ms by @christso in #1187
New Contributors
Full Changelog: v4.22.0...v4.24.0