Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

URL Metric compression during pagehide event results in no storage request #1928

Closed
westonruter opened this issue Mar 16, 2025 · 5 comments · Fixed by #1929
Closed

URL Metric compression during pagehide event results in no storage request #1928

westonruter opened this issue Mar 16, 2025 · 5 comments · Fixed by #1929
Labels
[Plugin] Optimization Detective Issues for the Optimization Detective plugin [Type] Bug An existing feature is broken

Comments

@westonruter
Copy link
Member

While carefully and extensively testing #1924 on my site, I was noticing that the URL Metric data was not consistently being submitted when navigating between pages. It would be sent if I changed tabs (when the visibilitychange event fires), but it would not be sent at the pagehide event.

I tested in Chrome, Firefox, and Safari. (For Safari, I had to remove the onLCP part due to #1925.) My debugging approach was to add sessionStorage.setItem( 'lastLineExecuted', '{lineNumber}' ) calls at various points in the detect() function after the pagehide, pageswap, and visibilitychange events occur:

detect.js.diff
diff --git a/plugins/optimization-detective/detect.js b/plugins/optimization-detective/detect.js
index cd42fd583..65b9e7f6f 100644
--- a/plugins/optimization-detective/detect.js
+++ b/plugins/optimization-detective/detect.js
@@ -654,27 +654,6 @@ export default async function detect( {
 	/** @type {(LCPMetric|LCPMetricWithAttribution)[]} */
 	const lcpMetricCandidates = [];
 
-	// Obtain at least one LCP candidate. More may be reported before the page finishes loading.
-	await new Promise( ( resolve ) => {
-		onLCP(
-			/**
-			 * Handles an LCP metric being reported.
-			 *
-			 * @param {LCPMetric|LCPMetricWithAttribution} metric
-			 */
-			( metric ) => {
-				lcpMetricCandidates.push( metric );
-				resolve();
-			},
-			{
-				// This avoids needing to click to finalize LCP candidate. While this is helpful for testing, it also
-				// ensures that we always get an LCP candidate reported. Otherwise, the callback may never fire if the
-				// user never does a click or keydown, per <https://github.com/GoogleChrome/web-vitals/blob/07f6f96/src/onLCP.ts#L99-L107>.
-				reportAllChanges: true,
-			}
-		);
-	} );
-
 	// Stop observing.
 	disconnectIntersectionObserver();
 	log( 'Detection is stopping.' );
@@ -748,6 +727,8 @@ export default async function detect( {
 		return;
 	}
 
+	sessionStorage.setItem( 'lastLineExecuted', '751' );
+
 	// Finalize extensions.
 	if ( extensions.size > 0 ) {
 		/** @type {Promise[]} */
@@ -809,6 +790,8 @@ export default async function detect( {
 		}
 	}
 
+	sessionStorage.setItem( 'lastLineExecuted', '814' );
+
 	/*
 	 * Now prepare the URL Metric to be sent as JSON request body.
 	 */
@@ -825,12 +808,16 @@ export default async function detect( {
 		return;
 	}
 
-	const payloadBlob = gzdecodeAvailable
+	sessionStorage.setItem( 'lastLineExecuted', '832' );
+
+	const payloadBlob = gzdecodeAvailable && ! sessionStorage.getItem( 'gzdecodeDisabled' )
 		? await compress( jsonBody )
 		: new Blob( [ jsonBody ], { type: 'application/json' } );
 	const percentOfBudget =
 		( payloadBlob.size / ( maxBodyLengthKiB * 1000 ) ) * 100;
 
+	sessionStorage.setItem( 'lastLineExecuted', '840' );
+
 	/*
 	 * According to the fetch() spec:
 	 * "If the sum of contentLength and inflightKeepaliveBytes is greater than 64 kibibytes, then return a network error."
@@ -859,6 +846,8 @@ export default async function detect( {
 		);
 	}
 
+	sessionStorage.setItem( 'lastLineExecuted', '870' );
+
 	const message = `Sending URL Metric (${ payloadBlob.size.toLocaleString() } bytes, ${ Math.round(
 		percentOfBudget
 	) }% of ${ maxBodyLengthKiB } KiB limit):`;
@@ -887,4 +876,6 @@ export default async function detect( {
 
 	// Clean up.
 	breadcrumbedElementsMap.clear();
+
+	sessionStorage.setItem( 'lastLineExecuted', '901' );
 }

(Note that I didn't use console.trace() because Chrome doesn't seem to put these in the console when the page is being navigated away from, even when console persistence is enabled.)

In all three browsers, when gzip compression is enabled, the lastLineExecuted is usually 832 which is right before compression occurs:

const payloadBlob = gzdecodeAvailable
? await compress( jsonBody )
: new Blob( [ jsonBody ], { type: 'application/json' } );

(I say usually because Chrome sometimes will go ahead and finish executing to the end of the function.)

Safari also produces an actual error in the console:

Image

This error in Safari with the Blob doesn't occur when switching tabs to cause the URL Metric to be sent at the visibilitychange event.

However, when I try disabling gzip compression via:

sessionStorage.setItem('gzdecodeDisabled', 'true')

Then the lastLineExecuted is 901 for all three browsers consistently, which is the very end of the function in Chrome, Safari, and Firefox. And more importantly, when compression is disabled, URL Metrics are consistently submitted when navigating away from a page in each of the three browsers.

This seems to indicate that it is not feasible to compress the URL Metric for submission (proposed in #1893 and implemented in #1905). Nevertheless, there is one way I tried which would allow compression to be used reliably (in Firefox and Chrome): using the beforeunload event instead of the pagehide event. There is a big downside to using the beforeunload event, however: It disables bfcache. Nevertheless, it would only be disabling bfcache when the current client is gathering URL Metrics, which should be a small percentage of the actual visitors (depending on the amount of traffic to the site, of course), for example three visitors to a given page using a mobile device would have bfcache disabled while URL Metrics are collected for their visits. I think this is an acceptable tradeoff. All this being said, Safari throws a different error related to the compressed Blob when using the beforeunload event:

Image

Therefore, it seems using gzip compression is not going to be viable at least for the near future. This will need to be reverted for the next release.

@b1ink0
Copy link
Contributor

b1ink0 commented Mar 16, 2025

@westonruter
We could compress the URL metrics before the pagehide promise then it does not cause the issue that you have highlighted here. We could use this solution if we are not modifying the the URL metrics after the pagehide promise is resolved. Here's a patch that could resolve this issue.

diff --git a/plugins/optimization-detective/detect.js b/plugins/optimization-detective/detect.js
index cd42fd58..63ecb0c4 100644
--- a/plugins/optimization-detective/detect.js
+++ b/plugins/optimization-detective/detect.js
@@ -725,6 +725,11 @@ export default async function detect( {
 
 	log( 'Current URL Metric:', urlMetric );
 
+	let compressedPayloadBlob;
+	if ( gzdecodeAvailable ) {
+		compressedPayloadBlob = await compress( JSON.stringify( urlMetric ) );
+	}
+
 	// Wait for the page to be hidden.
 	await new Promise( ( resolve ) => {
 		win.addEventListener( 'pagehide', resolve, { once: true } );
@@ -826,7 +831,7 @@ export default async function detect( {
 	}
 
 	const payloadBlob = gzdecodeAvailable
-		? await compress( jsonBody )
+		? compressedPayloadBlob
 		: new Blob( [ jsonBody ], { type: 'application/json' } );
 	const percentOfBudget =
 		( payloadBlob.size / ( maxBodyLengthKiB * 1000 ) ) * 100;

Alternatively we could also disable URL metrics compression for the pagehide event if we are modifying the the URL metrics after the pagehide promise is resolved.

diff --git a/plugins/optimization-detective/detect.js b/plugins/optimization-detective/detect.js
index cd42fd58..4bce19ea 100644
--- a/plugins/optimization-detective/detect.js
+++ b/plugins/optimization-detective/detect.js
@@ -727,7 +727,14 @@ export default async function detect( {
 
 	// Wait for the page to be hidden.
 	await new Promise( ( resolve ) => {
-		win.addEventListener( 'pagehide', resolve, { once: true } );
+		win.addEventListener(
+			'pagehide',
+			function () {
+				gzdecodeAvailable = false; // Prevent sending compressed data.
+				resolve();
+			},
+			{ once: true }
+		);
 		win.addEventListener( 'pageswap', resolve, { once: true } );
 		doc.addEventListener(
 			'visibilitychange',

@westonruter
Copy link
Member Author

@b1ink0 Due to the client-side extension script modules, the URL Metric data may be modified at any point up until pagehide. Specifically here:

// Finalize extensions.
if ( extensions.size > 0 ) {
/** @type {Promise[]} */
const extensionFinalizePromises = [];
/** @type {string[]} */
const finalizingExtensionModuleUrls = [];
for ( const [
extensionModuleUrl,
extension,
] of extensions.entries() ) {
if ( extension.finalize instanceof Function ) {
const extensionLogger = createLogger(
isDebug,
`[Optimization Detective: ${
extension.name || 'Unnamed Extension'
}]`
);
try {
const finalizePromise = extension.finalize( {
isDebug,
...extensionLogger,
getRootData,
getElementData,
extendElementData,
extendRootData,
} );
if ( finalizePromise instanceof Promise ) {
extensionFinalizePromises.push( finalizePromise );
finalizingExtensionModuleUrls.push(
extensionModuleUrl
);
}
} catch ( err ) {
error(
`Unable to start finalizing extension '${ extensionModuleUrl }':`,
err
);
}
}
}
// Wait for all extensions to finish finalizing.
const settledFinalizePromises = await Promise.allSettled(
extensionFinalizePromises
);
for ( const [
i,
settledFinalizePromise,
] of settledFinalizePromises.entries() ) {
if ( settledFinalizePromise.status === 'rejected' ) {
error(
`Failed to finalize extension '${ finalizingExtensionModuleUrls[ i ] }':`,
settledFinalizePromise.reason
);
}
}
}

This used by Image Prioritizer to populate the lcpElementExternalBackgroundImage property:

export async function finalize( { extendRootData, log: _log } ) {
// eslint-disable-next-line no-console
const log = _log || console.log; // TODO: Remove once Optimization Detective likely updated, or when strict version requirement added in od_init action.
if ( externalBackgroundImages.length === 0 ) {
return;
}
// Get the last detected external background image which is going to be for the LCP element (or very likely will be).
const lcpElementExternalBackgroundImage = externalBackgroundImages.pop();
log(
'Sending external background image for LCP element:',
lcpElementExternalBackgroundImage
);
extendRootData( { lcpElementExternalBackgroundImage } );
}

(Granted, this might be able to be refactored so that it happens at initialize(), so that in the handleLCPMetric() function it calls the extendRootData() function every time. This would depend on passing extendRootData() and extendElementData() to initialize() and we'd need to define urlMetric earlier. This would allow Image Prioritizer to get rid of its externalBackgroundImages variable since it would be directly putting the LCP data into the urlMetric whenever a new one is detected.)

However, this also used by Embed Optimizer to populate the resizedBoundingClientRect property of an element:

export async function finalize( {
log: _log,
error: _error,
getElementData,
extendElementData,
} ) {
/* eslint-disable no-console */
// TODO: Remove once Optimization Detective likely updated, or when strict version requirement added in od_init action.
const log = _log || console.log;
const error = _error || console.error;
/* eslint-enable no-console */
for ( const [ xpath, domRect ] of loadedElementContentRects.entries() ) {
try {
extendElementData( xpath, {
resizedBoundingClientRect: domRect,
} );
const elementData = getElementData( xpath );
log(
`boundingClientRect for ${ xpath } resized:`,
elementData.boundingClientRect,
'=>',
domRect
);
} catch ( err ) {
error(
`Failed to extend element data for ${ xpath } with resizedBoundingClientRect:`,
domRect,
err
);
}
}
}

This latter example is more important here because the resize observer will fire only when the embeds get resized, which due to them being lazy loaded, will only happen after the user scrolls down in the page and thus at an indeterminate time after the page has loaded. While the LCP metric happens around when the page loads, normally, the CLS metric is cumulative across the life of the page, so we need to be able to keep capturing the data.

The reason for sending the URL Metric at pagehide is also so that we can gather INP data which also gets reported throughout the life of the page.

Alternatively we could also disable URL metrics compression for the pagehide event if we are modifying the the URL metrics after the pagehide promise is resolved.

But this would entail sending the URL Metric data twice, once when the page loads and then again to try doing when leaving the page, right?

@b1ink0
Copy link
Contributor

b1ink0 commented Mar 16, 2025

Alternatively we could also disable URL metrics compression for the pagehide event if we are modifying the the URL metrics after the pagehide promise is resolved.

But this would entail sending the URL Metric data twice, once when the page loads and then again to try doing when leaving the page, right?

No, this would only disable the URL metrics compression logic, ensuring that detect.js behaves the same as it did before the compression logic was introduced. However, this means that URL metrics compression will only occur on the visibilitychange event. I'm not sure if it's worth adding all this logic solely for the visibilitychange event, especially since most URL metrics will be collected by the pagehide event.

@westonruter
Copy link
Member Author

@b1ink0 OK, it seems that could work since I understand the pagehide event fires before the visibilitychange event when leaving a page. But yeah, this means compressed URL Metrics would only be sent at visibilitychange which is actually not the event we wanted to use in the first place as noted by this TODO:

doc.addEventListener(
'visibilitychange',
() => {
if ( document.visibilityState === 'hidden' ) {
// TODO: This will fire even when switching tabs.
resolve();
}
},
{ once: true }
);

I can't remember why I added the visibilitychange event, other than it being an alternative to using beforeunload, the same as pagehide is. I vaguely remember pagehide not working in some browsers at the time and that this is why I included it.

@westonruter
Copy link
Member Author

westonruter commented Mar 17, 2025

@b1ink0 Here's another idea, which is somewhat a variant of your idea: #1893 (comment)

The only downside is it means compression would happen potentially multiple times in the life of the page to re-compress the URL Metric data after each modification (with debouncing).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
[Plugin] Optimization Detective Issues for the Optimization Detective plugin [Type] Bug An existing feature is broken
Projects
Status: Done 😃
Development

Successfully merging a pull request may close this issue.

2 participants