IncrementalIDB - MegaChunking by radex · Pull Request #874 · techfort/LokiJS

radex · 2020-12-10T09:29:34Z

(WIP, please don't review the code yet)

Yes, it's me again, with yet another pull request full of strange, complicated code — and another promise that it's worth it for performance 🙃

I think I'm approaching the limits of what IndexedDB can do performance-wise, but it's important for my use case to squeeze all that's possible out of it ;)

TL;DR: It loads the database 22% faster ;)

I made a picture to explain the problem that this PR is trying to solve:

IndexedDB is implemented (in all browsers as far as I can tell, but certainly in Chrome and Safari) with a multi-process architecture, and the cross-process communication is not very efficient. This can be seen above - waiting for IDB to fetch data from disk takes relatively little time, and most of the time is spent waiting for the XPC dance to complete transferring data -- and clearly, it's not very well tuned, as the CPU usage in the browser process is very low.

So the goal is to:

layer enough work at the same time that CPU utilization stays high
reduce the initial wait for IDB when no work happens on main thread
take better advantage of the concurrency opportunity, and try to keep both main/browser and IDB processes busy at the same time.

This is what I achieved:

This achieves 22% improvement on my benchmark, and likely more free performance for apps that didn't opt to manually tune IncrementalIDB by supplying serializeChunk/deserializeChunk.

Instead of calling IDBObjectStore.getAll(), I'm fetching multiple megachunks (chunks of chunks 🙃) - currently 20 requests using adjacent IDBKeyRanges. AFAICT, the IDB process in both Safari and Chrome does the first phase (actual disk/db work) sequentially, so there's no win here, but the XPC is more efficient for some reason. I guess since the IDB process sends more messages to browser process, there are fewer gaps in processing them on browser side, so CPU utilization stays higher.

In a further improvement (I call this megachunk interleaving), I only request first half of the megachunks initially, and then in in onsuccess of each one I request the (i+n/2)th chunk. This reduces the initial wait for IDB to almost nothing, and improves concurrency, as the IDB process is kept busy while JS is processing the first half of its work. (I also moved most of the chunk processing - JSON.parse and optional deserializeChunk from the end of the process much earlier - to each megachunk's onSuccess, so that main and IDB processes can be kept busy at the same time… I think this should also improve GC pressure a little bit, but I haven't yet figured out a good technique for measuring that, since it's very noisy)

I'm almost out of ideas for further improvements for now, and the law of diminishing returns is catching up to me, so it'll probably the last PR in the series for a while...

PS. In case you were wondering about using IDBCursor to maximize concurrency opportunity — I tried that multiple times, and it doesn't work. I tried interleaving multiple IDBCursors, and I got to nearly the same performance as interleaved megachunking, but still slower. There are just too many useless pauses on main thread...

radex · 2021-01-20T10:23:49Z

@techfort We've been running this internally for a while, found no issues so far

techfort · 2021-01-22T08:43:44Z

@radex this looks fantastic, i'm merging and sometime today i'll get round to doing a new release. I should really automate this release crap

radex added 11 commits December 9, 2020 17:05

[IncrementalIDB] populateLoki debug

e922219

[IncrementalIDB] Megachunking PoC - back!

a5d6b4d

[IncrementalIDB] Process megachunks as they come - 15% win

1f6ece2

[IncrementalIDB] Interleaving

24d6aa3

More IDB cursor experiments

2dc821c

[IncrementalIDB] Clean up…

be68cc6

[IncrementalIDB] Clean up

3b0768a

[IncrementalIDB] Document the idea behind MegaChunking briefly

a8a5e36

Merge branch 'master' of github.com:techfort/LokiJS

5f3312e

Merge branch 'master' into megachunking

608102f

[IncrementalIDB] Tweaks & clean up

9369a91

radex mentioned this pull request Dec 11, 2020

[perf] Faster web app launch with MegaChunking Nozbe/WatermelonDB#893

Merged

radex marked this pull request as ready for review January 20, 2021 10:23

radex changed the title ~~[WIP] IncrementalIDB - MegaChunking~~ IncrementalIDB - MegaChunking Jan 20, 2021

techfort closed this Jan 22, 2021

techfort reopened this Jan 22, 2021

techfort merged commit 189e103 into techfort:master Jan 22, 2021

radex deleted the megachunking branch January 22, 2021 09:05

radex mentioned this pull request Oct 13, 2021

Incremental IDB performance improvements #899

Merged

acdcjunior mentioned this pull request Jun 7, 2022

Adding encryption capabilities to incremental-indexeddb-adapter.js Nozbe/WatermelonDB#1096

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

IncrementalIDB - MegaChunking#874

IncrementalIDB - MegaChunking#874
techfort merged 11 commits intotechfort:masterfrom
Nozbe:megachunking

radex commented Dec 10, 2020 •

edited

Loading

Uh oh!

radex commented Jan 20, 2021

Uh oh!

techfort commented Jan 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

radex commented Dec 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

radex commented Jan 20, 2021

Uh oh!

techfort commented Jan 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

radex commented Dec 10, 2020 •

edited

Loading