Improve DOM Processing #616

maticzav · 2025-02-07T17:53:03Z

This PR improves the DOM processing functionality. Instead of saving DOM in a tree structure, it saves elements to a hash map and keeps the pointers for rewiring in Python.

It's a drop in replacement that changes no interfaces.

Additionally, it improves the calculation of the DOM elements in the JS eval script by ignoring the added indicator elements.

CLAassistant · 2025-02-07T17:53:12Z

All committers have signed the CLA.

codebeaver-ai · 2025-02-07T18:21:46Z

I opened a Pull Request with the following:

🔄 1 test added.
🐛 Found 1 bug
🛠️ 1/8 tests passed

🔄 Test Updates

I've added 1 tests. They all pass ☑️
New Tests:

tests/test_views.py

No existing tests required updates.

🐛 Bug Detection

Potential issues found in the following files:

browser_use/dom/service.py

The error is raised by the code in Registry.execute_action when it calls the action function. Although the test calls execute_action with a “browser” keyword argument (and so the merged parameters should include browser), the function ends up not actually receiving that argument. In other words, the extra browser parameter is either being dropped or not passed correctly when calling action.function(**validated_params.model_dump(), **extra_args). This causes the underlying async function (test_action_with_browser) to be invoked without its required “browser” parameter, triggering a TypeError. Thus, the registry’s implementation of execute_action is buggy in how it handles and passes extra keyword arguments to the action function.

🛠️ Test Results

1/8 tests passed ⚠️

tests/test_dropdown.py

View error

tests/test_dropdown.py:20: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_dropdown.py}

tests/test_dropdown_complex.py

View error

tests/test_dropdown_complex.py:20: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_dropdown_complex.py}

tests/test_dropdown_error.py

View error

tests/test_dropdown_error.py:20: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_dropdown_error.py}

tests/test_gif_path.py

View error

tests/test_gif_path.py:19: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_gif_path.py}

tests/test_models.py

View error

tests/test_models.py:54: in <module>
    ChatOpenAI(
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_models.py}

tests/test_react_dropdown.py

View error

tests/test_react_dropdown.py:20: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_react_dropdown.py}

tests/test_vision.py

View error

tests/test_vision.py:25: in <module>
    llm = ChatOpenAI(model='gpt-4o')
/usr/local/lib/python3.11/site-packages/langchain_core/load/serializable.py:125: in __init__
    super().__init__(*args, **kwargs)
/usr/local/lib/python3.11/site-packages/langchain_openai/chat_models/base.py:622: in validate_environment
    self.root_client = openai.OpenAI(**client_params, **sync_specific)  # type: ignore[arg-type]
/usr/local/lib/python3.11/site-packages/openai/_client.py:110: in __init__
    raise OpenAIError(
E   openai.OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

_{tests/test_vision.py}

☂️ Coverage Improvements

Coverage improvements by file:

tests/test_views.py

New coverage: 99.07%
Improvement: +99.07%

🎨 Final Touches

I ran the hooks included in the pre-commit config.

_{Settings | Logs | CodeBeaver}

Improve DOM Processing

maticzav added 14 commits February 6, 2025 20:05

stash

ca90d0f

stash

0acce1f

stash

568c3c7

cleanup

7a50c50

Delete performance_test.py

0a416b6

Update service.py

d0142da

cleanup

1200f76

Update context.py

9e02514

fixes

2d59091

mem

76f7516

cleanup

a433c47

Update simple.py

70ed35f

Update views.py

45d5b1d

ignore highlight elements

76614f0

maticzav added 2 commits February 7, 2025 18:56

Update buildDomTree.js

81056a2

Update buildDomTree.js

4d1040d

codebeaver-ai bot mentioned this pull request Feb 7, 2025

Improve DOM Processing - Unit Tests #617

Closed

gregpr07 requested a review from MagMueller February 11, 2025 02:58

MagMueller merged commit 87f4d74 into main Feb 11, 2025
3 checks passed

MagMueller deleted the fix/memory branch February 11, 2025 03:11

AryamanParida pushed a commit to AryamanParida/browser-use that referenced this pull request Mar 7, 2025

Merge pull request browser-use#616 from browser-use/fix/memory

7347b16

Improve DOM Processing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve DOM Processing #616

Improve DOM Processing #616

maticzav commented Feb 7, 2025

CLAassistant commented Feb 7, 2025 •

edited

Loading

codebeaver-ai bot commented Feb 7, 2025

Improve DOM Processing #616

Improve DOM Processing #616

Conversation

maticzav commented Feb 7, 2025

CLAassistant commented Feb 7, 2025 • edited Loading

codebeaver-ai bot commented Feb 7, 2025

🔄 Test Updates

🐛 Bug Detection

🛠️ Test Results

☂️ Coverage Improvements

🎨 Final Touches

CLAassistant commented Feb 7, 2025 •

edited

Loading