Support Predicate Pushdown for Parquet Lists (#2108)#2999

Merged

tustvold merged 5 commits intoapache:masterfrom

tustvold:add-buffer-to-column-level-decoder

Nov 5, 2022

Contributor

tustvold commented Nov 2, 2022 •

edited

Loading

Which issue does this PR close?

Closes #2108

Rationale for this change

This PR adds support for skipping repetition levels by adding a buffer to ColumnLevelDecoderImpl. This buffer is also used to speed up skipping definition levels, as it allows for vectorised decoding.

What changes are included in this PR?

Are there any user-facing changes?


          Add buffer to ColumnLevelDecoderImpl (apache#2108)

574c4e0

github-actions bot added the parquet label

tustvold marked this pull request as draft

November 2, 2022 19:05

Contributor Author

tustvold commented Nov 2, 2022

Will update with rep_levels support


          Implement skip_rep_levels

ff36781

tustvold changed the title ~~Add buffer to ColumnLevelDecoderImpl (#2108)~~ Implement skip_rep_levels for ColumnLevelDecoderImpl (#2108)


          Add integration test

2287b24

tustvold changed the title ~~Implement skip_rep_levels for ColumnLevelDecoderImpl (#2108)~~ Support Predicate Pushdown for Parquet Lists (#2108)

tustvold requested a review from Ted-Jiang

November 4, 2022 02:39

tustvold marked this pull request as ready for review

November 4, 2022 02:39

tustvold requested a review from alamb

November 4, 2022 02:39

Contributor Author

tustvold commented Nov 4, 2022

https://arrow.apache.org/blog/2022/10/08/arrow-parquet-encoding-part-2/ may be helpful here

tustvold added 2 commits

November 4, 2022 15:59


          Merge remote-tracking branch 'upstream/master' into add-buffer-to-col…

eca720b

…umn-level-decoder


          Clippy

b84b51f

Member

Ted-Jiang commented Nov 4, 2022

I will review this carefully tomorrow morning!

Ted-Jiang approved these changes

View reviewed changes

Member

Ted-Jiang left a comment

Nice improvement! ❤️ @tustvold

parquet/src/column/reader/decoder.rs

-                  fn read(&mut self, out: &mut Self::Slice, range: Range<usize>) -> Result<usize> {
+                  fn read(&mut self, out: &mut Self::Slice, mut range: Range<usize>) -> Result<usize> {
+                      let read_from_buffer = match self.buffer.is_empty() {

Member

Ted-Jiang Nov 5, 2022

👍

parquet/src/column/reader/decoder.rs

+                      loop {
+                          if self.buffer.is_empty() {
+                              // Read SKIP_BUFFER_SIZE as we don't know how many to read

Member

Ted-Jiang Nov 5, 2022

👍

parquet/src/column/reader/decoder.rs

+                          // Find end of record
+                          while to_skip < self.buffer.len() && self.buffer[to_skip] != 0 {
+                              to_skip += 1;

Member

Ted-Jiang Nov 5, 2022

Nice check !

parquet/src/column/reader/decoder.rs

+                          }
+                          level_skip += to_skip;
+                          if to_skip >= self.buffer.len() {

Member

Ted-Jiang Nov 5, 2022

I think here only need ==

Contributor Author

tustvold Nov 5, 2022

Yeah, the hope is that >= helps LLVM elide bound checks that follow. Not sure it makes any difference, just habit 😅

parquet/src/column/reader/decoder.rs

+                      let mut read = 0;
+                      let mut decoded = vec![];
+                      let mut expected = vec![];
+                      while read < encoded.len() {

Member

Ted-Jiang Nov 5, 2022

Nice fuzz tests ! 👍

Ted-Jiang reviewed

View reviewed changes

parquet/src/column/reader/decoder.rs

                   }
               }
+              const SKIP_BUFFER_SIZE: usize = 1024;

Member

Ted-Jiang Nov 5, 2022

@tustvold choose SKIP_BUFFER_SIZE: usize = 1024, 1024* i16 just 2Kb, which supported by AVX_256, so this is the best choice 🤔 (i am not familiar with this, could you give me some inforamtion😂)

Contributor Author

tustvold Nov 5, 2022

It's a somewhat arbitrary number, it tries to strike a balance between small enough to not be overly wasteful but large enough that we don't end up repeatedly reading too few values

tustvold merged commit e2c4199 into apache:master

ursabot commented Nov 5, 2022

Benchmark runs are scheduled for baseline = fc58036 and contender = e2c4199. e2c4199 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ec2-t3-xlarge-us-east-2] ec2-t3-xlarge-us-east-2
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on test-mac-arm] test-mac-arm
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-i9-9960x] ursa-i9-9960x
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-thinkcentre-m75q] ursa-thinkcentre-m75q
Buildkite builds:
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

tustvold mentioned this pull request

Implement skip_rep_levels #2122

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

parquet