Prune unnecessary transaction sequences from corpus by samalws-tob · Pull Request #625 · crytic/medusa

samalws-tob · 2025-04-29T17:13:23Z

This PR adds a job which runs once a minute that prunes unnecessary txn sequences from the corpus. It removes sequences that no longer contribute any new coverage compared to the rest. This can happen when, for example, sequence A adds new coverage, sequence B builds upon sequence A and adds strictly more coverage, making sequence A now unnecessary since it doesn't "contribute anything" to the corpus.
The job works by:

Shuffle the corpus
Initialize a blank coverage map
Run through the corpus one-by-one. For each sequence, run the sequence and add coverage to the coverage map. If there is no new coverage, we can remove this sequence from the corpus

The job takes about 5-10 seconds, during which time no fuzzing at all can occur

This seems to be pretty effective at reducing corpus size. In a small test I've been running, it's been removing about 100 entries per minute during the initial phase of fuzzing where the corpus quickly grows to 1000 entries (first 5 minutes) and has now slowed down to removing about 50 entries per minute (15 minutes in, corpus size 1400). Corpus growth is slow at this phase because of pruning; the corpus would probably have 2000-2500 entries by now (rather than 1400) if not for pruning.

…nto struct

fuzzing/corpus_pruner.go

fuzzing/fuzzer.go

fuzzing/corpus/corpus.go

Co-authored-by: anishnaik <[email protected]>

samalws-tob force-pushed the prune-sequences branch 6 times, most recently from 61ff5c4 to 0761cce Compare May 2, 2025 17:41

samalws-tob added 6 commits May 2, 2025 14:26

prune sequences WIP

4d893c4

keep track of total pruning

4ecd1aa

do it only every 3 minutes

68d49f6

resolve TODOs, change logging format, add config option, move state i…

af01dc9

…nto struct

comments and docs

341c95f

Prune in parallel

41a8e8e

samalws-tob force-pushed the prune-sequences branch from 0761cce to 41a8e8e Compare May 2, 2025 18:28

add comments

e1748b0

samalws-tob marked this pull request as ready for review May 5, 2025 13:08

samalws-tob requested review from Xenomega, anishnaik and bsamuels453 as code owners May 5, 2025 13:08