Skip to content

feat: add default metrics for core evals#1602

Merged
zakiali merged 1 commit intomainfrom
zaki/bench-add-default-metrics
Mar 15, 2025
Merged

feat: add default metrics for core evals#1602
zakiali merged 1 commit intomainfrom
zaki/bench-add-default-metrics

Conversation

@zakiali
Copy link
Copy Markdown
Collaborator

@zakiali zakiali commented Mar 10, 2025

This PR adds default metrics to the the core eval suite

@zakiali zakiali force-pushed the zaki/bench-add-default-metrics branch from 55643b7 to ba31836 Compare March 10, 2025 21:54
@zakiali zakiali force-pushed the zaki/bench-add-default-metrics branch from ba31836 to 56752d7 Compare March 10, 2025 22:07
@zakiali zakiali merged commit a9fefd0 into main Mar 15, 2025
6 checks passed
@zakiali zakiali deleted the zaki/bench-add-default-metrics branch March 15, 2025 01:11
laanak08 added a commit that referenced this pull request Mar 16, 2025
* main: (31 commits)
  feat: add default metrics for core evals (#1602)
  feat(google_drive): use oauth2 crate for PKCE support, make token storage generic over Serializable (#1645)
  ui: reorganize extensions settings (#1702)
  feat: google_drive write tools and read comment tool (#1650)
  fix: developer builtin name (#1699)
  chore: update extensions section to work with new endpoints (#1696)
  chore: move things around (#1662)
  ui: extensions state updates (#1674)
  docs: goose ollama blog, updated (#1691)
  ui: load builtins (#1679)
  chore(release): release version 1.0.14 (#1676)
  Revert "feat: handling larger more complex PDF docs (and fix) (#1663)" (#1675)
  fix: uvshim default to existing uv configuration (#1670)
  fix: handle interruptions during tool responses (#1651)
  feat: Copy error message button in toast (#1658)
  feat: handling larger more complex PDF docs (and fix) (#1663)
  Add Filesystem Tutorial (#1666)
  docs: figma blog post (#1647)
  docs: updating goose modes doc (#1665)
  docs: Add running tasks guide (#1626)
  ...
cbruyndoncx pushed a commit to cbruyndoncx/goose that referenced this pull request Jul 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants