Submission for Amazon Q Developer Agent v20240430-dev #4

timesler · 2024-05-10T04:53:00Z

Thank you for your great work putting together this benchmark and the leaderboard!

This PR submits benchmark results for the Amazon Q Developer Agent for feature development (v20240430-dev), a coding assistant tool recently launched by AWS.

Results achieved after running the SWE-bench evaluation harness are below.

	SWE-bench	SWE-bench lite
% Resolved	13.82%	20.33%

This PR provides predictions, results, and logs for both the test (2294) and lite (300) subsets.

john-b-yang · 2024-05-11T16:49:54Z

@timesler thanks so much for the submission, it looks great!

I'm currently a bit busy traveling right now so I haven't had a chance to look at it, but rest assured I'll get to it soon.

I'll check a few things out and have it merged by Tuesday this coming week.

john-b-yang · 2024-05-14T17:55:52Z

@timesler Just pulled the branch and verified the results - congratulations on setting SOTA on the full and lite splits of SWE-bench! 🎉

I have merged the results into the repository. I will make some minor naming tweaks to the log files + add the results/ folder containing statistics about the submission.

I will make a follow up comment once the results have been propagated to https://www.swebench.com/!

timesler · 2024-05-14T18:12:41Z

Fantastic, thank you!

…-dev Submission for Amazon Q Developer Agent v20240430-dev

Submission for Amazon Q Developer Agent v20240430-dev

7aa9541

john-b-yang merged commit 75dc98d into SWE-bench:main May 14, 2024

john-b-yang mentioned this pull request May 18, 2024

How can one participate in the SWE-bench leaderboard? SWE-bench/SWE-bench#121

Closed

john-b-yang added a commit that referenced this pull request Oct 15, 2024

Merge pull request #4 from timesler/amazon-q-developer-agent-20240430…

816640e

…-dev Submission for Amazon Q Developer Agent v20240430-dev

john-b-yang added a commit that referenced this pull request Oct 15, 2024

Merge pull request #4 from timesler/amazon-q-developer-agent-20240430…

1dee0bc

…-dev Submission for Amazon Q Developer Agent v20240430-dev

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Submission for Amazon Q Developer Agent v20240430-dev #4

Submission for Amazon Q Developer Agent v20240430-dev #4

Uh oh!

timesler commented May 10, 2024

Uh oh!

john-b-yang commented May 11, 2024

Uh oh!

john-b-yang commented May 14, 2024

Uh oh!

timesler commented May 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Submission for Amazon Q Developer Agent v20240430-dev #4

Submission for Amazon Q Developer Agent v20240430-dev #4

Uh oh!

Conversation

timesler commented May 10, 2024

Uh oh!

john-b-yang commented May 11, 2024

Uh oh!

john-b-yang commented May 14, 2024

Uh oh!

timesler commented May 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants