fix: multi-chunk batched bucket sums folding error by jacobtrombetta · Pull Request #262 · spaceandtimefdn/blitzar

jacobtrombetta · 2025-05-30T15:11:23Z

Rationale for this change

Batch commitments with elements that are represented in 32-bytes, have 256-384 elements in the sequence, and are the same length will attempt bucket method multiexponention. The bucket method has a max chunk size of 2^20. In cases where the batch size and element length go above the max chuck size, the commitments are incorrect. For example, batch_size = 1<<3 and element_length = 1<<17 will return an expected result, element_length = 1<<17 + 1 will return an unexpected result. You can see test cases that reproduce the error in commit 6d6c2e4.

The issue is the mtxbk::accumulate_buckets_impl method. In cases where the number of chunks, num_chunks, is greater than 1, the partial bucket sums get split by the max chunk size number of elements. After all the buckets are accumulated, the call to the combine_partial_bucket_sums kernel does not handle the data offset and stride calculations to account for the multi-chunk partial bucket sums array.

To solve this issue fold_kernel is added. The purpose of fold_kernel is to take the chunked partial sums and fold them into a single bucked sums array. The fold_kernel class performs a segmented left fold of the partial sums:

out[index] = sum(partial_bucket_sums[index + i * out_size]) for i in [0, num_of_folds)

where the

num_of_folds = partial_bucket_sums.size() / out.size().

The tests will have to be updated when the max chunk size is increased or decreased from its 1<<20 limit.

What changes are included in this PR?

fold_kernel with a segmented_left_fold_partial_bucket_sums method is added to the multiexp package with tests.
accumulation with multi-chunk cases will now use segmented_left_fold_partial_bucket_sums when all the partial buckets are accumulated.
Tests are added to multieponentiation which reproduce the error state. They add <2 seconds to the overall test time.

Are these changes tested?

Yes. Also tested by replacing the libblitzar-linux-x86_64.so in the blitzar-rs and nova projects and confirming the failing tests pass.

Copilot

Pull Request Overview

This PR fixes incorrect bucket sums when the number of partial-bucket chunks exceeds the GPU kernel’s max chunk size by introducing a folding kernel and updating the accumulation logic and tests.

Adds segmented_left_fold_partial_bucket_sums kernel and its unit tests to fold multi-chunk partial sums.
Updates accumulate_buckets_impl to use the new fold kernel instead of combine_partial_bucket_sums.
Extends multiexponentiation tests to verify results when input sizes exceed the max chunk threshold.

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
sxt/multiexp/bucket_method/fold_kernel.h	Defines the new segmented left‐fold GPU kernel
sxt/multiexp/bucket_method/fold_kernel.t.cc	Adds tests for single‐ and multi‐bucket folding
sxt/multiexp/bucket_method/accumulation.h	Replaces old combination kernel launch with fold kernel launch
sxt/multiexp/bucket_method/multiexponentiation.t.cc	Adds tests for multiexponentiation with element counts over chunk size
sxt/multiexp/bucket_method/BUILD	Registers the `fold_kernel` component

Comments suppressed due to low confidence (2)

sxt/multiexp/bucket_method/fold_kernel.h:45

[nitpick] The variables bucket_group_size and num_bucket_groups map to gridDim.x and blockDim.x but their names are confusing. Consider renaming them to num_buckets and num_chunks (and chunk_index for threadIdx.x) to make the folding logic clearer.

auto bucket_group_size = gridDim.x;

sxt/multiexp/bucket_method/fold_kernel.t.cc:60

The unit tests cover single‐output scenarios only. Add a SECTION that launches the kernel with gridDim.y > 1 to verify folding across multiple outputs.

// end of tests

sxt/multiexp/bucket_method/multiexponentiation.t.cc

tlovell-sxt

Sorry to ask for this, just not as familiar w/ this code and its terms. I think it'd help me understand a little if I had some mental model for how a multi-exponentiation maps to "buckets" and "bucket groups" and "elements" and "chunks" and "folds", etc. Maybe we can hop on a call or maybe text + a diagram would be sufficient

jacobtrombetta · 2025-06-05T16:08:21Z

Sorry to ask for this, just not as familiar w/ this code and its terms. I think it'd help me understand a little if I had some mental model for how a multi-exponentiation maps to "buckets" and "bucket groups" and "elements" and "chunks" and "folds", etc. Maybe we can hop on a call or maybe text + a diagram would be sufficient

@tlovell-sxt a diagram is a good idea. I'll create one and set up a call to walk through it.

… folding error

… on multiple chunks

sxt/multiexp/bucket_method/accumulation.h

SxT-Release · 2025-06-18T15:59:51Z

🎉 This PR is included in version 1.115.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

jacobtrombetta marked this pull request as ready for review May 30, 2025 17:12

jacobtrombetta requested a review from Copilot May 30, 2025 17:12

Copilot AI reviewed May 30, 2025

View reviewed changes

sxt/multiexp/bucket_method/multiexponentiation.t.cc Outdated Show resolved Hide resolved

sxt/multiexp/bucket_method/multiexponentiation.t.cc Outdated Show resolved Hide resolved

jacobtrombetta requested review from stuarttimwhite and tlovell-sxt May 30, 2025 19:34

tlovell-sxt reviewed Jun 4, 2025

View reviewed changes

jacobtrombetta added 6 commits June 16, 2025 11:33

test: add multiexponentiation tests identifying batched bucket method…

666f693

… folding error

feat: add fold_kernel to multiexp package

1bec910

fix: update accumulation algorithm to use folding kernel when working…

0e9564a

… on multiple chunks

doc: update fold_kernel docs

a96d7a5

chore: fix format

b6d5a80

chore: address Copilot's code review

a5a5ee9

jacobtrombetta force-pushed the fix/batched-bucket-method-folding-error branch from 24ff343 to a5a5ee9 Compare June 16, 2025 15:33

tlovell-sxt reviewed Jun 17, 2025

View reviewed changes

sxt/multiexp/bucket_method/accumulation.h Show resolved Hide resolved

tlovell-sxt approved these changes Jun 18, 2025

View reviewed changes

jacobtrombetta merged commit 6b5203d into main Jun 18, 2025
8 checks passed

jacobtrombetta deleted the fix/batched-bucket-method-folding-error branch June 18, 2025 14:31

SxT-Release added the released label Jun 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: multi-chunk batched bucket sums folding error#262

fix: multi-chunk batched bucket sums folding error#262
jacobtrombetta merged 6 commits intomainfrom
fix/batched-bucket-method-folding-error

jacobtrombetta commented May 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

tlovell-sxt left a comment •

edited

Loading

Uh oh!

jacobtrombetta commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

SxT-Release commented Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jacobtrombetta commented May 30, 2025

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

tlovell-sxt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jacobtrombetta commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

SxT-Release commented Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tlovell-sxt left a comment •

edited

Loading