don't use mmap for compression except for huge chunks in V4 raw index #7931

richardstartin · 2021-12-19T17:12:23Z

Following on from #7930, it's possible to speed up building of V4 raw indexes slightly (~10%) by using a compression buffer and only using memory mapping for huge chunks. The size of the compression buffer is fixed because the size of the chunk buffer itself is fixed, so the potential for very large compression buffers which exists with V2 and V3 does not apply here.

master

Benchmark                                  (_chunkCompressionType)      (_distribution)  (_maxChunkSize)  (_records)  Mode  Cnt           Score     Error  Units
BenchmarkRawForwardIndexWriter.writeV4                         LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5        9857.203 ± 167.084  ms/op
BenchmarkRawForwardIndexWriter.writeV4:b                       LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5  7781048545.000                #
BenchmarkRawForwardIndexWriter.writeV4:kb                      LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5     7598680.000                #
BenchmarkRawForwardIndexWriter.writeV4:mb                      LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5        7420.000                #
BenchmarkRawForwardIndexWriter.writeV4                         LZ4           EXP(0.001)          1048576      100000    ss    5        1802.487 ± 112.871  ms/op
BenchmarkRawForwardIndexWriter.writeV4:b                       LZ4           EXP(0.001)          1048576      100000    ss    5  1416542875.000                #
BenchmarkRawForwardIndexWriter.writeV4:kb                      LZ4           EXP(0.001)          1048576      100000    ss    5     1383340.000                #
BenchmarkRawForwardIndexWriter.writeV4:mb                      LZ4           EXP(0.001)          1048576      100000    ss    5        1350.000                #

branch

Benchmark                                  (_chunkCompressionType)      (_distribution)  (_maxChunkSize)  (_records)  Mode  Cnt           Score     Error  Units
BenchmarkRawForwardIndexWriter.writeV4                         LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5        8788.638 ± 256.238  ms/op
BenchmarkRawForwardIndexWriter.writeV4:b                       LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5  7781048545.000                #
BenchmarkRawForwardIndexWriter.writeV4:kb                      LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5     7598680.000                #
BenchmarkRawForwardIndexWriter.writeV4:mb                      LZ4  UNIFORM(1000,10000)          1048576      100000    ss    5        7420.000                #
BenchmarkRawForwardIndexWriter.writeV4                         LZ4           EXP(0.001)          1048576      100000    ss    5        1625.317 ±  48.117  ms/op
BenchmarkRawForwardIndexWriter.writeV4:b                       LZ4           EXP(0.001)          1048576      100000    ss    5  1416542875.000                #
BenchmarkRawForwardIndexWriter.writeV4:kb                      LZ4           EXP(0.001)          1048576      100000    ss    5     1383340.000                #
BenchmarkRawForwardIndexWriter.writeV4:mb                      LZ4           EXP(0.001)          1048576      100000    ss    5        1350.000                #

codecov-commenter · 2021-12-19T17:41:38Z

Codecov Report

Merging #7931 (319bd58) into master (6037cac) will decrease coverage by 1.18%.
The diff coverage is 100.00%.

@@             Coverage Diff              @@
##             master    #7931      +/-   ##
============================================
- Coverage     71.15%   69.97%   -1.19%     
- Complexity     4111     4118       +7     
============================================
  Files          1593     1593              
  Lines         82365    82377      +12     
  Branches      12270    12273       +3     
============================================
- Hits          58609    57642     -967     
- Misses        19806    20799     +993     
+ Partials       3950     3936      -14

Flag	Coverage Δ
integration1	`?`
integration2	`27.66% <0.00%> (+0.07%)`	⬆️
unittests1	`68.09% <100.00%> (+0.02%)`	⬆️
unittests2	`14.32% <0.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...riter/impl/VarByteChunkSVForwardIndexWriterV4.java	`96.69% <100.00%> (+0.36%)`	⬆️
...pinot/minion/exception/TaskCancelledException.java	`0.00% <0.00%> (-100.00%)`	⬇️
...nverttorawindex/ConvertToRawIndexTaskExecutor.java	`0.00% <0.00%> (-100.00%)`	⬇️
...e/pinot/common/minion/MergeRollupTaskMetadata.java	`0.00% <0.00%> (-94.74%)`	⬇️
...plugin/segmentuploader/SegmentUploaderDefault.java	`0.00% <0.00%> (-87.10%)`	⬇️
.../transform/function/MapValueTransformFunction.java	`0.00% <0.00%> (-85.30%)`	⬇️
...ot/common/messages/RoutingTableRebuildMessage.java	`0.00% <0.00%> (-81.82%)`	⬇️
...data/manager/realtime/DefaultSegmentCommitter.java	`0.00% <0.00%> (-80.00%)`	⬇️
...ache/pinot/common/lineage/SegmentLineageUtils.java	`22.22% <0.00%> (-77.78%)`	⬇️
...ore/startree/executor/StarTreeGroupByExecutor.java	`0.00% <0.00%> (-77.78%)`	⬇️
... and 101 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6037cac...319bd58. Read the comment docs.

siddharthteotia · 2021-12-20T22:04:24Z

@richardstartin , can we please have a short discussion around raw index. I didn't get a chance to review v4 format and want to be on the same page here w.r.t changes / improvements being made.

Want to also make sure that V4 subsumes the V3 format changes I had introduced since V3 supports a critical production use case of ours. It is configurable through FieldConfig. I wanted to make V3 default going forward but looks like you want to make V4 default.

So, let's please have a short discussion to get clarity on the path forward. I can then help review this and PR #7934

Will follow-up offline on the discussion.

mcvsubbu · 2021-12-21T17:34:11Z

Thanks, @siddharthteotia

siddharthteotia · 2021-12-21T18:23:13Z

Discussed offline with @richardstartin

The change here is limited to V4 format and I am ok with the reason behind the change. Basically, with V4 the use of memory mapped buffer for compression is pretty much eliminated. So the direct buffer is introduced in this PR to help improve the performance for the vast majority of cases
Regarding V3 v/s V4: they have similarities but the main difference is 4GB limitation imposed in V4. V3 has no such limitation as the chunk offset in the file header is tracked using long. V3 also derives the numDocsPerChunk (as opposed to using fixed 1000 rows per chunk) or packs a single row in the chunk if the row > 1MB. In V3 we never thought of using memory mapping for compressed buffer even though our chunks are potentially huge in size (from the production use case where V3 is being used).
Regarding changing the default from V2: I didn't make V3 default initially since wanted to test it out in production. It's been used for over an year now and hasn't had any issues. We discussed that we can bridge the functionality gap between V3 and V4 as a new change and then hopefully remove V3 code (ensuring the V4+ reader continues to read it) if possible since V3 and V4 had common goals

mcvsubbu · 2021-12-21T18:42:37Z

Thanks for reaching consensus, we should do more of these interactions. Way faster than comments in PRs.

richardstartin · 2021-12-21T18:55:56Z

For posterity, feedback was solicited on the design proposal for the V4 index in #7616.

don't use mmap for compression except for huge chunks

319bd58

richardstartin changed the title ~~don't use mmap for compression except for huge chunks~~ don't use mmap for compression except for huge chunks in V4 raw index Dec 19, 2021

Jackie-Jiang approved these changes Dec 20, 2021

View reviewed changes

siddharthteotia approved these changes Dec 21, 2021

View reviewed changes

siddharthteotia merged commit 0eb63c5 into apache:master Dec 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

don't use mmap for compression except for huge chunks in V4 raw index #7931

don't use mmap for compression except for huge chunks in V4 raw index #7931

Uh oh!

richardstartin commented Dec 19, 2021 •

edited

Loading

Uh oh!

codecov-commenter commented Dec 19, 2021 •

edited

Loading

Uh oh!

siddharthteotia commented Dec 20, 2021 •

edited

Loading

Uh oh!

mcvsubbu commented Dec 21, 2021

Uh oh!

siddharthteotia commented Dec 21, 2021 •

edited

Loading

Uh oh!

mcvsubbu commented Dec 21, 2021

Uh oh!

richardstartin commented Dec 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

don't use mmap for compression except for huge chunks in V4 raw index #7931

don't use mmap for compression except for huge chunks in V4 raw index #7931

Uh oh!

Conversation

richardstartin commented Dec 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Dec 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

siddharthteotia commented Dec 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcvsubbu commented Dec 21, 2021

Uh oh!

siddharthteotia commented Dec 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcvsubbu commented Dec 21, 2021

Uh oh!

richardstartin commented Dec 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

richardstartin commented Dec 19, 2021 •

edited

Loading

codecov-commenter commented Dec 19, 2021 •

edited

Loading

siddharthteotia commented Dec 20, 2021 •

edited

Loading

siddharthteotia commented Dec 21, 2021 •

edited

Loading