Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Screen ratio fluctuating while capturing multiple screenshots #663

Closed
Kirtan2607 opened this issue Feb 11, 2025 · 2 comments
Closed

Screen ratio fluctuating while capturing multiple screenshots #663

Kirtan2607 opened this issue Feb 11, 2025 · 2 comments
Labels
bug Something isn't working

Comments

@Kirtan2607
Copy link

Bug Description

My.Movie.mp4

Using the openai o3-mini model.

When running the given code to take a screenshot, the agent does not work properly. The screen ratio continuously changes, making it look like it's lagging. This behavior affects usability and causes instability in the UI.

Reproduction Steps

  1. Install browser-use and necessary dependencies.
  2. Run the following task to capture a screenshot.
  3. Observe the screen flickering and ratio changing unexpectedly.

Code Sample

import asyncio
import base64

class BrowserSingleton:
    _instance = None
    _lock = asyncio.Lock()

    @classmethod
    async def get_instance(cls):
        async with cls._lock:
            if cls._instance is None:
                cls._instance = Browser()  
            return cls._instance

    @classmethod
    async def close(cls):
        async with cls._lock:
            if cls._instance:
                await cls._instance.close()
                cls._instance = None

class AgentManager:
    def __init__(self, agent_id):
        self.id = agent_id
        self.browser = None
        self.context = None
        self.agent = None

    async def run_agent(self):
        self.browser = await BrowserSingleton.get_instance()
        self.context = await self.browser.new_context(
            BrowserContextConfig(
                highlight_elements=False,
                save_recording_path=f"videos/{self.id}",
            )
        )

        self.agent = Agent(
                task="Go to google.com",
                llm=self.llm, #o3-mini
                browser=self.browser,
                browser_context=self.context,
                use_vision=False,
                generate_gif=False,
            )

    async def get_screenshot(self):
        if not self.context or not self.context.session:
            return None

        try:
            page = await self.context.get_current_page()
            screenshot = await page.screenshot(full_page=False, animations="disabled")
            return base64.b64encode(screenshot).decode("utf-8")
        except Exception as e:
            print(f"Error taking screenshot: {e}")
            return None

async def main():
    manager = AgentManager(agent_id="agent_1")
    task_future = asyncio.create_task(manager.run_agent())

    while not task_future.done():
        screenshot = await manager.get_screenshot()
        if screenshot:
            print("Screenshot captured:", screenshot[:30] + "...")  
        await asyncio.sleep(5)

    await task_future 

if __name__ == "__main__":
    asyncio.run(main())

Version

0.1.36

LLM Model

Other (specify in description)

Operating System

macOS 13

Relevant Log Output

@Kirtan2607 Kirtan2607 added the bug Something isn't working label Feb 11, 2025
@PaperBoardOfficial
Copy link
Contributor

I raised a PR, check if it resolves your issue.

@pirate
Copy link
Member

pirate commented Mar 25, 2025

Fixed by @PaperBoardOfficial, thanks!

@pirate pirate closed this as completed Mar 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants