execution/stagedsync: handle sync loop block limit exhaustion #16268

taratorio · 2025-07-24T14:43:38Z

ProcessNewBlocks runs 10,000 blocks on start up after rm chaindata (doesn't take into account sync.loop.block.limit) - causes quite a lot of bloat on bloatnet
Also Mark reported similar issue while working on parallel exec but for UpdateForkChoice
Leaving this in draft for now - need to run some more tests

…ved logging

…im-exhausted

mh0lt · 2025-07-25T07:52:18Z

execution/stagedsync/exec3.go

Maybe as this is about db perseveration we should use to break the loop:

executor.readState().SizeEstimate() >= commitThreshold

rather than a hard block count:

if blockNum-startBlockNum+1 >= uint64(cfg.syncCfg.LoopBlockLimit) { loopBlockLimitExhausted = true break }

I think that otherwise we'll need to adjust this by circumstance - which means we'll end up breaking more often than we need to for small block

that's something we can improve in the future, dont think it should be done in this PR

mh0lt · 2025-07-25T07:58:57Z

execution/eth1/forkchoice.go

Does this mean we're lifting this:

err = e.executionPipeline.RunPrune(e.db, tx, initialCycle)

If so doesn't it imply that this loop belongs there rather than here ?

One other thing I'm not sure of the implication here is that we always run execution with an external tx ? If that is the case we can probably remove the tx handling from exec. (See my comment below around the loop break arbitrator).

One thing that occurs to me is that if we do this we could possibly move flushing upto this level as well, so we do flush and prune in the same place. I think this leads to better resource control. It may also mean that we want to pass a shared domain into the exec process - as that is where updates are currently cached prior to flushing.

One think to note about that - we could move flush/write to a separate goroutine as all the other actions in exec apart from flushing only need a RO transaction - so there is no need for them to run in the same routine.

I think that the SD could now be contained in a TemporalRO tx - but not completely sure.

Does this mean we're lifting this:

err = e.executionPipeline.RunPrune(e.db, tx, initialCycle)

If so doesn't it imply that this loop belongs there rather than here ?

Not sure what you mean by this, we're already calling e.executionPipeline.RunPrune in FCU (it is done in the end of the request processing, in a background goroutine) - so nothing is really lifted.

If we've had a case where hasMore=true it means that we've exhausted the exec loop and we should run prune after it.

One other thing I'm not sure of the implication here is that we always run execution with an external tx ? If that is the case we can probably remove the tx handling from exec. (See my comment below around the loop break arbitrator).

We should probably keep the internal/external tx concept in the loop. I think FCU at the moment always calls with external tx. However that is easy to improve/change (in a future PR). For example, if we are processing only 1 block (at chain tip) we can pass tx=db.RwTx (external tx) to the exec loop. If we are not on chain tip we can pass an tx=nil (internal tx). But we need to first analyse if that is necessary thing to do or not.

sudeepdino008 · 2025-07-26T15:01:33Z

fyi 1e97169 stops ExecV3 from terminating earlier.

taratorio · 2025-07-30T11:35:16Z

fyi 1e97169 stops ExecV3 from terminating earlier.

not really, that commit just delays the early termination by +32 blocks (changesetSafeRange)

…im-exhausted

we recently merged #16268 which should fix the flaky test - let's give it a try and see how it goes

ProcessNewBlocks runs 10,000 blocks on start up after rm chaindata (doesn't take into account sync.loop.block.limit) - causes quite a lot of bloat on bloatnet Also Mark reported similar issue while working on parallel exec but for UpdateForkChoice Leaving this in draft for now - need to run some more tests

#16484) cherry-pick dc3413d ProcessNewBlocks runs 10,000 blocks on start up after rm chaindata (doesn't take into account sync.loop.block.limit) - causes quite a lot of bloat on bloatnet Also Mark reported similar issue while working on parallel exec but for UpdateForkChoice Leaving this in draft for now - need to run some more tests

taratorio added 6 commits July 24, 2025 15:01

execution/stagedsync: handle sync loop block limit exhaustion

1a11d76

execution/stagedsync: handle sync loop block limit exhaustion - impro…

5d3763b

…ved logging

execution/stagedsync: handle sync loop block limit exhaustion - impro…

c64fc2f

…ved logging

Merge branch 'main' of github.com:erigontech/erigon into loop-block-l…

879968c

…im-exhausted

add RunPrune after hasMore in update fork choice

3678c63

Merge branch 'main' of github.com:erigontech/erigon into loop-block-l…

922f543

…im-exhausted

taratorio requested review from AskAlexSharov, Giulio2002, mh0lt, somnergy and yperbasis July 24, 2025 14:43

taratorio changed the title ~~execution/stagedsync: handle sync loop block limit exhaustion~~ [DO-NOT-MERGE] execution/stagedsync: handle sync loop block limit exhaustion Jul 24, 2025

taratorio marked this pull request as ready for review July 24, 2025 15:11

AskAlexSharov approved these changes Jul 24, 2025

View reviewed changes

mh0lt reviewed Jul 25, 2025

View reviewed changes

taratorio mentioned this pull request Jul 25, 2025

fix TestDump flakiness #16279

Merged

taratorio added 2 commits July 30, 2025 15:58

Merge branch 'main' of github.com:erigontech/erigon into loop-block-l…

1b43c4b

…im-exhausted

fix tests

fcf78b6

taratorio changed the title ~~[DO-NOT-MERGE] execution/stagedsync: handle sync loop block limit exhaustion~~ execution/stagedsync: handle sync loop block limit exhaustion Jul 30, 2025

taratorio added 4 commits July 30, 2025 17:05

check if loop block limit > 0

54cf763

guard mining step

94b3524

improvements

392e883

improvements

baa277f

Giulio2002 approved these changes Jul 31, 2025

View reviewed changes

taratorio merged commit dc3413d into main Aug 1, 2025
14 of 15 checks passed

taratorio deleted the loop-block-lim-exhausted branch August 1, 2025 14:39

AskAlexSharov mentioned this pull request Aug 3, 2025

—sync.blocks.lool.limit better support (Chaindaata grow cases) #16426

Closed

taratorio mentioned this pull request Aug 5, 2025

turbo/snapshotsync: re-enable TestDump #16451

Merged

taratorio added a commit that referenced this pull request Aug 5, 2025

turbo/snapshotsync: re-enable TestDump (#16451)

1b7b83c

we recently merged #16268 which should fix the flaky test - let's give it a try and see how it goes

OBITOONDEADO1MZ approved these changes Aug 5, 2025

View reviewed changes

anacrolix mentioned this pull request Aug 15, 2025

Delete torrent file before dropping #16657

Merged

taratorio mentioned this pull request Sep 15, 2025

flaky test: turbo/snapshotsync/freezeblocks/dump_test.go #15231

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

execution/stagedsync: handle sync loop block limit exhaustion #16268

execution/stagedsync: handle sync loop block limit exhaustion #16268

Uh oh!

taratorio commented Jul 24, 2025 •

edited

Loading

Uh oh!

mh0lt Jul 25, 2025

Uh oh!

taratorio Jul 30, 2025

Uh oh!

mh0lt Jul 25, 2025 •

edited

Loading

Uh oh!

taratorio Jul 30, 2025

Uh oh!

taratorio Jul 30, 2025 •

edited

Loading

Uh oh!

sudeepdino008 commented Jul 26, 2025 •

edited

Loading

Uh oh!

taratorio commented Jul 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

execution/stagedsync: handle sync loop block limit exhaustion #16268

execution/stagedsync: handle sync loop block limit exhaustion #16268

Uh oh!

Conversation

taratorio commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mh0lt Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

taratorio Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

mh0lt Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

taratorio Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

taratorio Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sudeepdino008 commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taratorio commented Jul 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

taratorio commented Jul 24, 2025 •

edited

Loading

mh0lt Jul 25, 2025 •

edited

Loading

taratorio Jul 30, 2025 •

edited

Loading

sudeepdino008 commented Jul 26, 2025 •

edited

Loading