-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Enable browser-use tool for all image-capable models #8121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I reviewed my own code and found it's only half-baked. The other half is still in the oven.
daniel-lxs
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's still a lot of references to supportsComputerUse in the code however it can be removed as a follow up PR.
daniel-lxs
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
✅ No issues found.
🔄 Code Review UpdateRe-reviewed PR after new commit (3ba204d). The latest commit completes the translation cleanup by removing both Status of All IssuesAll previously flagged issues have been resolved:
Latest Changes (Commit 3ba204d)✅ Test cleanup - Removed outdated comments from ✅ Translation cleanup - Completed removal of unused Summary✅ All issues resolved - The core functionality change (using ✅ Ready for merge - No outstanding issues or concerns. |
29fb04f to
3ba204d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No issues found.
Summary
This PR addresses Issue #8116 by enabling the browser-use tool for all models that support images, rather than restricting it to models with explicit computer-use capability.
Changes
supportsComputerUsetosupportsImagesin browser tool availability checksRationale
As discussed in the issue, any model that can process images inherently has the capability to understand screenshots, which is the primary requirement for browser interaction. The previous restriction to only models with explicit computer-use support was unnecessarily limiting.
Files Modified
src/core/webview/generateSystemPrompt.tssrc/core/task/Task.tssrc/core/prompts/sections/capabilities.tssrc/core/prompts/sections/rules.tssrc/core/prompts/system.tssrc/core/prompts/tools/index.tssrc/core/prompts/tools/types.tssrc/core/prompts/tools/browser-action.tsTesting
Fixes #8116
Important
Enable browser-use tool for all image-capable models by changing checks from
supportsComputerUsetosupportsImages.supportsComputerUsetosupportsImagesin browser tool availability checks.generateSystemPrompt.ts,Task.ts, andindex.ts.supportsComputerUsefrom model definitions inmodel.ts,anthropic.ts, andbedrock.ts.generateSystemPrompt.browser-capability.spec.tsto verify browser capability logic.supportsComputerUserelated UI elements and translations inModelInfoView.tsxandsettings.jsonfiles.This description was created by
for 3ba204d. You can customize this summary. It will automatically update as commits are pushed.