-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Text Input Handling in Browser Context #571
Conversation
I opened a Pull Request with the following: 🔄 1 test added. 🔄 Test UpdatesI've added 1 tests. They all pass ☑️
No existing tests required updates. 🐛 Bug DetectionNo bugs detected in your changes. Good job! 🛠️ Test Results1/8 tests passed tests/test_dropdown.py View errortests/test_dropdown.py:20: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_dropdown_complex.py View errortests/test_dropdown_complex.py:20: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_dropdown_error.py View errortests/test_dropdown_error.py:20: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_gif_path.py View errortests/test_gif_path.py:19: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_models.py View errortests/test_models.py:54: in <module>
ChatOpenAI(
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_react_dropdown.py View errortests/test_react_dropdown.py:20: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
tests/test_vision.py View errortests/test_vision.py:25: in <module>
llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
self.root_client = openai.OpenAI(**client_params, **sync_specific) # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
raise OpenAIError(
E openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable
☂️ Coverage ImprovementsCoverage improvements by file:
🎨 Final Touches
Settings | Logs | CodeBeaver |
- Refactor dropdown tests to be more robust - Remove hardcoded dependencies - Add minimal test scenarios - Improve test flexibility and reliability - Ensure compatibility with project guidelines Resolves: browser-use#543 (Text Input Handling)
Thanks, that's an important error to fix. Why did you remove the dropdown test? |
Browser-Use Updates: Dropdown Tests and Input Text ImprovementsOverviewThis document details the recent improvements to dropdown tests and input text handling in the Browser-Use project. 1. Dropdown Tests Restoration and ImprovementsKey Enhancements
Test Case 1: Basic DropdownLocation: Key Improvements
# Verify dropdown interaction
assert result is not None
assert 'Duck' in result, "Expected 5th option 'Duck' to be selected"
# Verify dropdown state
element = await browser_context.get_element_by_selector('select')
assert element is not None, "Dropdown element should exist"
value = await element.evaluate('el => el.value')
assert value == '5', "Dropdown should have 5th option selected" Test Case 2: Complex DropdownLocation: Key Improvements
# Verify dropdown state
element = await browser_context.get_element_by_selector('.select-selected')
assert element is not None, "Custom dropdown element should exist"
text = await element.text_content()
assert 'json' in text.lower(), "Dropdown should display json option"
# Verify the selected option's effect
code_element = await browser_context.get_element_by_selector('pre code')
assert code_element is not None, "Code element should be visible when JSON is selected" 2. Input Text Handling Improvementsa) Element State StabilityProblem: Text input would fail due to unstable element state Solution: await element_handle.wait_for_element_state('stable', timeout=2500) b) Input Method FlexibilityChallenge: Different input fields require unique interaction methods Comprehensive Approach: if await is_contenteditable.json_value():
# Handle contenteditable elements
await element_handle.evaluate('el => el.textContent = ""')
await element_handle.type(text, delay=50)
else:
try:
# Attempt standard input methods
await element_handle.fill(text)
except Exception:
# Fallback to sequential key press
await element_handle.press_sequentially(text, delay=50) c) Input VerificationGoal: Ensure text was successfully entered Implementation: if input_type != 'password':
value = await element_handle.input_value()
if not value and text:
raise BrowserError('Input verification failed') ConclusionThese improvements address critical issues in:
|
I've completely resolved the issue, Please Respond to this PR |
lets focus on Text Input Handling. What websites did you see that were broken before but work now? |
E.g. when i am in google docs (real_browser) example in correctly inputs the text, but then await page.wait_for_load_state('networkidle', timeout=2700) throws a timeout error, even though it was successful. Furthermore it says for me here await element_handle.press_sequentially(text, delay=50) that This is a core part of the library and its important that input_text works, Please test it on couple of websites. |
…ling-in-Browser-Context #571 improve text input handling in browser context
If merged it - but would be still interesting which websites work and which not.... |
…mprove-Text-Input-Handling-in-Browser-Context browser-use#571 improve text input handling in browser context
Description
This PR addresses issue #543 where the message box would not accept text input, causing issues with text entry and operator crashes. The changes improve the text input handling in the browser context by adding better state management, error handling, and support for different types of input fields.
Changes Made
Problem Being Solved
The original implementation had several issues:
Solution Details
The solution implements a more robust text input handling system:
fill()
for standard input fieldsevaluate()
+type()
for contenteditable elementspress_sequentially()
if primary method failsBrowserError
class for consistent error handlingTesting Done
Breaking Changes
None. This is a backward-compatible improvement to existing functionality.
Additional Notes
Related Issues
Fixes #543
Checklist