Normalize filesystem operations when syncing from memory fs to OPFS by ashfame · Pull Request #2252 · WordPress/wordpress-playground

ashfame · 2025-06-11T07:35:41Z

This PR supersedes https://github.com/Automattic/wordpress-playground-private/pull/126

Motivation for the change, related issues

When using OPFS, the memory fs sync to OPFS doesn't normalize filesystem operations, even though there is a normalizing file system operations function already available. In a previous attempt, a new journal compression function was added that followed the following rules but partitioned around RENAME operations:

Multiple WRITE operations of the same file (at end of request, writing once is enough)
CREATE operation before WRITE operation (no need to CREATE separately)
DELETE operation after CREATE/WRITE operation (no need to handle CREATE/WRITE in these cases)

While reconciling the logic of both functions, I discovered the already existing function handles pretty much all rules that were implemented (and some more), so in this PR, I have extended the tests to ensure the expected behavior out of normalizeFilesystemOperations()

A test case was found which was failing and the fix for that in the logic was implemented.
And now OPFSRewriter normalizes fs operations before executing them to keep OPFS in sync.

Improvements in journal size that's processed (screenshot from earlier PR):

438687809-b4c6c627-7742-41c1-8e82-d9f2d06c9248

…ps to a single WRITE

…e WRITE (file)

…single WRITE (file)

Copilot

Pull Request Overview

This PR normalizes filesystem operations when syncing from the memory fs to OPFS by leveraging an existing normalization function and aligning behavior through updated tests and logic.

The flushJournal function now applies normalizeFilesystemOperations to optimize processing.
Comprehensive tests have been added to cover various scenarios involving CREATE, WRITE, RENAME, and DELETE operations.
The normalization logic in fs-journal.ts has been refined to handle DELETE redundancies more explicitly.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
packages/php-wasm/web/src/lib/directory-handle-mount.ts	Updates flushJournal to use normalized journal entries for OPFS syncing and includes concurrency considerations.
packages/php-wasm/fs-journal/src/test/fs-journal.spec.ts	Adds tests ensuring normalizeFilesystemOperations covers multiple file operation scenarios.
packages/php-wasm/fs-journal/src/lib/fs-journal.ts	Adjusts normalization logic and comments to clarify DELETE redundancy handling following a CREATE/WRITE operation.

Comments suppressed due to low confidence (1)

packages/php-wasm/fs-journal/src/lib/fs-journal.ts:457

Consider clarifying in the comment that the DELETE operation is considered redundant only for nodes created in the current journal, ensuring the intent is clear.

if (former.operation === 'CREATE') {

packages/php-wasm/web/src/lib/directory-handle-mount.ts

Co-authored-by: Copilot <[email protected]>

adamziel · 2025-06-13T15:24:19Z

packages/php-wasm/web/src/lib/directory-handle-mount.ts

+		// Concurrency safety note
+		// As I understand it, journal is specific to a PHP instance,
+		// so it's not possible to have concurrency push of entries to journal
+		// But this can change in future so it doesn't hurt to read from journal
+		// in a concurrent safe way, which is what we are doing here.
+
+		// We first copy it to a new array
+		const journalEntries = [...journal];
+		// and then only delete however many entries we were able to grab
+		// since with concurrent writes there could have been more insertions
+		journal.splice(0, journalEntries.length);


Good call! Let's just talk about asynchronous writers, not concurrent writers. I got confused for a second and wrote a big reply about how JS is single-threaded, and only then I realized how this change works 🙈

@adamziel I indeed meant concurrent writes in this context since web workers offer true concurrency, so I believe my comment reflects the intent accurately. I will wait for you to respond before merging this PR :)

Aha! journal is a regular variable and it can only be accessed and modified from the same thread. We should never see conflicting unshift() calls.

As for concurrent filesystem writes - I don't think the journaling approach will generalize to concurrent rw access.

We'd need a live two-way sync between the source of truth (e.g. opfs) and memfs. It's possible, but the complexity would be really bad. We'd effectively turn Playground into a distributed system. I'd rather integrate OPFS directly via e dedicated Emscripten filesystem backen, in which case there would be no journal, or implement a SharedArrayBuffer MEMFS with a single OPFS writer.

Gotcha! I will leave the comment as is then & merge it.

ashfame added 5 commits June 11, 2025 11:28

additional test - Normalises CREATE and WRITE + multiple WRITE file o…

134f49d

…ps to a single WRITE

additional test - Normalizes CREATE and RENAME with WRITEs to a singl…

86c4d86

…e WRITE (file)

additional test case - Normalizes CREATE and RENAME with WRITEs to a …

8d970cb

…single WRITE (file)

addition test case - Normalizes CREATE/WRITE and DELETE to an empty list

4b99e97

normalize fs ops inside journalFSEventsToOpfs()

0c9aada

ashfame self-assigned this Jun 11, 2025

github-project-automation bot added this to Playground Board Jun 11, 2025

github-project-automation bot moved this to Inbox in Playground Board Jun 11, 2025

ashfame added 3 commits June 11, 2025 11:37

fix comment

acb909b

add test case which currently fails

5441b71

fix failing test

94d5fe3

ashfame requested a review from adamziel June 11, 2025 14:12

ashfame marked this pull request as ready for review June 11, 2025 14:12

ashfame requested review from bgrgicak, brandonpayton, Copilot and zaerl June 11, 2025 14:14

Copilot AI reviewed Jun 11, 2025

View reviewed changes

packages/php-wasm/web/src/lib/directory-handle-mount.ts Outdated Show resolved Hide resolved

Use for loop to avoid shift() call on our big array

76b0a03

Co-authored-by: Copilot <[email protected]>

adamziel reviewed Jun 13, 2025

View reviewed changes

adamziel approved these changes Jun 13, 2025

View reviewed changes

github-project-automation bot moved this from Inbox to Reviewed in Playground Board Jun 13, 2025

ashfame merged commit 8f01a29 into trunk Jun 16, 2025
24 checks passed

ashfame deleted the journal_compression_opfs branch June 16, 2025 12:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize filesystem operations when syncing from memory fs to OPFS#2252

Normalize filesystem operations when syncing from memory fs to OPFS#2252
ashfame merged 9 commits intotrunkfrom
journal_compression_opfs

ashfame commented Jun 11, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

adamziel Jun 13, 2025

Uh oh!

ashfame Jun 13, 2025

Uh oh!

adamziel Jun 15, 2025 •

edited

Loading

Uh oh!

ashfame Jun 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ashfame commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation for the change, related issues

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

adamziel Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

ashfame Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

adamziel Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ashfame Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ashfame commented Jun 11, 2025 •

edited

Loading

adamziel Jun 15, 2025 •

edited

Loading