Fluffy: Implement offer cache to hold content ids of recent offers by bhartnett · Pull Request #3233 · status-im/nimbus-eth1

bhartnett · 2025-04-23T05:13:01Z

Reduces load on the database during gossip process as it is very common to receive multiple copies of the same content from different peers as content is gossipped through the network.

Changes in this PR:

Offer cache holds content id of recently stored offers
Content cache is checked before checking the database during offer flow
If the database is pruned then the content ids in the offer cache are invalidated so that we don't incorrectly indicate that some pruned content is already stored.

kdeme

I'm fine with this cache addition as offers do typically come in fairly close to eachother and the change is quite minimal.

Two things however:

Curious to see any actual data on this.
I was also wondering how often this occurs versus the version where offers come in too close to each other that the actual content of the first offer is not stored yet. Of course for that scenario we cannot really add a cache as the content offered could be failed to send or invalid.

kdeme · 2025-04-23T09:32:54Z

+      for k, v in p.offerCache.mpairs():
+        v = false


Probably faster to just reinitialize the cache? And that way you also don't need to boolean I think?

Probably faster to just reinitialize the cache? And that way you also don't need to boolean I think?

I was looking though the minilru code and I didn't find a clear function so I went with this method. But yes, reinit is probably better. I'll update.

bhartnett · 2025-04-23T11:50:05Z

Curious to see any actual data on this.

I guess I can add some cache hit cache miss metrics for this and gossip some data in a local testnet to see the results.

I was also wondering how often this occurs versus the version where offers come in too close to each other that the actual content of the first offer is not stored yet. Of course for that scenario we cannot really add a cache as the content offered could be failed to send or invalid.

This is something we could use metrics to get data on a well. To address this problem I think we should put a limit on the max number of concurrent offers per content id. The current limits are per content id and per peer but there is no limit on multiple peers sending the same content concurrently.

bhartnett · 2025-04-23T14:36:08Z

@kdeme Here are some metrics showing the usage of the offer cache when running a local testnet with 16 nodes and gossipping content using 20 workers running for 10 mins or so. All content is sent to one of the fluffy instances (node 2) which gossips the content to its peers.

Node 1:

# HELP portal_offer_cache_hits Portal wire protocol local content lookups that hit the offer cache
# TYPE portal_offer_cache_hits counter
portal_offer_cache_hits_total{protocol_id="500a"} 47330.0
portal_offer_cache_hits_created{protocol_id="500a"} 1745418350.0

# HELP portal_offer_cache_misses Portal wire protocol local content lookups that don't hit the offer cache
# TYPE portal_offer_cache_misses counter
portal_offer_cache_misses_total{protocol_id="500a"} 9918.0
portal_offer_cache_misses_created{protocol_id="500a"} 1745418350.0

Node 2 (the node which the portal bridge is connected to):

# HELP portal_offer_cache_hits Portal wire protocol local content lookups that hit the offer cache
# TYPE portal_offer_cache_hits counter

# HELP portal_offer_cache_misses Portal wire protocol local content lookups that don't hit the offer cache
# TYPE portal_offer_cache_misses counter
portal_offer_cache_misses_total{protocol_id="500a"} 47465.0
portal_offer_cache_misses_created{protocol_id="500a"} 1745418349.0

Node 3:

# HELP portal_offer_cache_hits Portal wire protocol local content lookups that hit the offer cache
# TYPE portal_offer_cache_hits counter
portal_offer_cache_hits_total{protocol_id="500a"} 48479.0
portal_offer_cache_hits_created{protocol_id="500a"} 1745418350.0

# HELP portal_offer_cache_misses Portal wire protocol local content lookups that don't hit the offer cache
# TYPE portal_offer_cache_misses counter
portal_offer_cache_misses_total{protocol_id="500a"} 9175.0
portal_offer_cache_misses_created{protocol_id="500a"} 1745418350.0

Node 4:

# HELP portal_offer_cache_hits Portal wire protocol local content lookups that hit the offer cache
# TYPE portal_offer_cache_hits counter
portal_offer_cache_hits_total{protocol_id="500a"} 17025.0
portal_offer_cache_hits_created{protocol_id="500a"} 1745418350.0

# HELP portal_offer_cache_misses Portal wire protocol local content lookups that don't hit the offer cache
# TYPE portal_offer_cache_misses counter
portal_offer_cache_misses_total{protocol_id="500a"} 18976.0
portal_offer_cache_misses_created{protocol_id="500a"} 1745418350.0

It appears that at least 50% of the content lookups hit the cache during the gossip process. Of course the other benefit of this change is DOS protection in which case rejecting recently offered content would be much faster and not require a database lookup.

bhartnett and others added 16 commits November 1, 2024 09:57

Commit progress.

15a224a

Remove getSszDecoded from ContentDb.

29ed761

Update ContentDb get to use onData callback to reduce copies.

f0cf1fa

Use templates for helper procs in ContentDb.

37c795b

Add contains handler to portal protocol.

39aa06f

Update test.

3a93b24

Update fluffy-tools.

2900261

Improve performance of DbGetHandler.

09a9951

Implement offer cache.

6404565

Cache offers in json rpc APIs.

a76d712

Merge branch 'master' into fluffy-offer-cache

2d69b93

Fix last merge.

24d9cdb

Update init config.

7a37a08

Merge branch 'master' into fluffy-offer-contentid-cache

5fadb5c

Only cache content ids in offer cache.

a86b74d

Invalidate cache when the database is pruned.

0d1f8a4

bhartnett requested a review from kdeme April 23, 2025 06:08

Move code to improve diff in PR.

351b002

kdeme reviewed Apr 23, 2025

View reviewed changes

bhartnett added 2 commits April 23, 2025 21:15

Improve code and add unit tests.

e5fb4a7

Undo changes to flakey test.

c22ea4b

bhartnett requested a review from kdeme April 23, 2025 15:20

kdeme approved these changes Apr 23, 2025

View reviewed changes

bhartnett merged commit 24d1dcf into master Apr 24, 2025

bhartnett deleted the fluffy-offer-contentid-cache branch April 24, 2025 00:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fluffy: Implement offer cache to hold content ids of recent offers#3233

Fluffy: Implement offer cache to hold content ids of recent offers#3233
bhartnett merged 19 commits intomasterfrom
fluffy-offer-contentid-cache

bhartnett commented Apr 23, 2025 •

edited

Loading

Uh oh!

kdeme left a comment •

edited

Loading

Uh oh!

kdeme Apr 23, 2025

Uh oh!

bhartnett Apr 23, 2025

Uh oh!

bhartnett commented Apr 23, 2025 •

edited

Loading

Uh oh!

bhartnett commented Apr 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bhartnett commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kdeme left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kdeme Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

bhartnett Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

bhartnett commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhartnett commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bhartnett commented Apr 23, 2025 •

edited

Loading

kdeme left a comment •

edited

Loading

bhartnett commented Apr 23, 2025 •

edited

Loading

bhartnett commented Apr 23, 2025 •

edited

Loading