-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Autoscroll before before archiving and take full-height screenshots #80
Comments
The code provided in this playwright issue solves the full-page screenshot problem for me Here is the code I use to take a full page screenshot with playwright
|
Is this feature natively available now or only via hacking in user scripts? |
Not available natively yet, it's blocked on #51 |
Ah fair enough, thanks! Seems like #51 encapsulates a whole ton of effort to make this happen, so thanks and good luck! |
will be great to have the ability to take full height screenshots! any update on this after 4 years? |
My conclusion after a lot of work on this issue is that full-page screenshots up to ~8000px maximum height are ok, but many many pages are longer than that, and most common image formats actually don't support images that big. Even the formats that do (png) cause most image viewers to crash when you try to open them. You need to mess with Chrome's GPU memory settings to even get it to take more than 16,000px in one image, let alone the 90,000px+ that some long comment thread pages have. Multiple screenshots are the better solution. My solution so far is one 4:3 screenshot at the top of the page, and then numbered 16:10 screenshots for like ~15 full-height scrolls down the page. Also works great for feeding it to vision and OCR models for analysis. I built this ^ more advanced puppeteer based screenshot approach for a paying client last year, and it's still in active development. It's all in TS and ArchiveBox is all Python, so it takes time to bridge that gap, refactor, open source it, document it, package it, ship it, etc. for the public. |
I've sumbitted a Chromium bug tracker feature request for adding a
--full-page
flag: https://bugs.chromium.org/p/chromium/issues/detail?id=854013Hopefully it's merged, allowing us to screenshot the full height of pages, instead of limiting them to the config settings defined by
DIMENSIONS
.The text was updated successfully, but these errors were encountered: