Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Force done output #835

Merged
merged 20 commits into from
Feb 23, 2025
Merged

Force done output #835

merged 20 commits into from
Feb 23, 2025

Conversation

MagMueller
Copy link
Collaborator

This pull request includes various changes to improve the functionality and readability of the browser_use/agent service. The most important changes include refactoring method signatures to use the | operator for type hints, restructuring the initialization process, and enhancing the handling of the last step in the agent's execution.

Refactoring method signatures:

  • Changed the method signatures in browser_use/agent/message_manager/service.py and browser_use/agent/message_manager/views.py to use the | operator for type hints instead of Optional. [1] [2] [3]

Restructuring initialization process:

  • Reorganized the __init__ method in browser_use/agent/service.py to initialize the state earlier and set up action models and message context more clearly. [1] [2] [3]

Enhancing last step handling:

  • Added logic to handle the last step in the agent's execution, including logging a message and setting the appropriate AgentOutput. [1] [2] [3]

Additional improvements:

  • Introduced the log_completion method to log the task completion status.
  • Updated the system prompt documentation to include new rules and clarify existing ones. [1] [2] [3] [4]

These changes collectively improve the maintainability and functionality of the agent service.

- Introduce warning message for the last step in the agent
- Add `success` parameter to `ActionResult` and `DoneAction`
- Modify `AgentStepInfo` to check for last step
- Update `AgentHistoryList` methods to handle optional values and success status
- Update `_add_message_with_tokens` and `add_message` methods to support `None` as default position
- Modify last step warning message in agent service
- Add logging for last step completion
- Improve type hints for message position parameters
- Capture token count before generating model output
- Store tokens in a local variable to ensure accurate input token tracking
- Modify metadata logging to use the pre-computed token count
- Update telemetry event to include more comprehensive agent performance metrics
- Modify `AgentEndTelemetryEvent` to capture total input tokens and duration
- Refactor `is_successful()` method to return `None` if agent is not done
- Adjust `ActionResult` default success value to `None`
- Enhance browser-use version detection to prioritize git repository check
- Add support for creating a done-only action model
- Modify logging to distinguish between successful and unfinished tasks
- Update agent to switch to done action model when max steps are reached
- Improve system prompt to clarify done action usage in max steps scenario
- Extend registry action model creation to support filtering actions
- Extract common task completion logging into a new `log_completion()` method
- Simplify duplicate logging code in `run()` and `run_with_max_steps()` methods
- Maintain existing logging behavior for task completion status
- Add support for 'raw' and 'json_mode' tool calling methods
- Introduce new methods for setting message context and tool calling method
- Modify input message conversion for specific model names
- Update tool calling method detection logic
- Extend ToolCallingMethod type to include new methods
- Update service to conditionally limit actions based on max_actions_per_step
- Modify views to use json_schema_extra for action validation
- Improve action list handling to prevent unnecessary slicing
- Add 'asyncio' marker to pytest configuration
- Simplify LLM fixture by removing FakeListChatModel
- Update logging and code style in conftest.py
- Update conftest.py to use SecretStr for OpenAI API key
- Modify test_models.py to improve model testing and error reporting
- Reorder model test parameters and IDs for better readability
- Add done and successful status tracking in model search test
- Improve error handling and logging in test model search function
- Modify system prompt to clarify done action usage when max steps are reached
- Update service to provide more descriptive action description for task completion
- Add explicit handling for task success status in done action
- Enhance documentation for done action to indicate task completion state
- Add type hints and type ignore comments in service.py for image message handling
- Update system prompt to provide clearer guidance on done action usage
- Refine action description for done action in controller service
- Enhance documentation for task completion scenarios
- Update last step warning message in service.py to provide clearer, more explicit instructions for the "done" action
- Modify test_models.py to comment out DeepSeek model and adjust model testing configuration
- Enhance guidance on task completion, success status, and action requirements
- Refactor last step warning message to be more concise and direct
- Clarify instructions for using the "done" function
- Improve guidance on task completion and success status reporting
- Update last step instructions to emphasize single action requirement
- Clarify guidance for using the "done" action
- Improve wording to highlight task completion and success status reporting
- Add example for setting success to false when task is not fully completed
- Provide more explicit guidance on reporting task completion status
- Improve readability of last step instructions
@MagMueller MagMueller merged commit 4518d25 into main Feb 23, 2025
3 checks passed
@MagMueller MagMueller deleted the force-done-output branch February 23, 2025 17:58
AryamanParida pushed a commit to AryamanParida/browser-use that referenced this pull request Mar 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant