-
Notifications
You must be signed in to change notification settings - Fork 1.4k
don't use mmap for compression except for huge chunks in V4 raw index #7931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
don't use mmap for compression except for huge chunks in V4 raw index #7931
Conversation
Codecov Report
@@ Coverage Diff @@
## master #7931 +/- ##
============================================
- Coverage 71.15% 69.97% -1.19%
- Complexity 4111 4118 +7
============================================
Files 1593 1593
Lines 82365 82377 +12
Branches 12270 12273 +3
============================================
- Hits 58609 57642 -967
- Misses 19806 20799 +993
+ Partials 3950 3936 -14
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
|
@richardstartin , can we please have a short discussion around raw index. I didn't get a chance to review v4 format and want to be on the same page here w.r.t changes / improvements being made. Want to also make sure that V4 subsumes the V3 format changes I had introduced since V3 supports a critical production use case of ours. It is configurable through FieldConfig. I wanted to make V3 default going forward but looks like you want to make V4 default. So, let's please have a short discussion to get clarity on the path forward. I can then help review this and PR #7934 Will follow-up offline on the discussion. |
|
Thanks, @siddharthteotia |
|
Discussed offline with @richardstartin
|
|
Thanks for reaching consensus, we should do more of these interactions. Way faster than comments in PRs. |
|
For posterity, feedback was solicited on the design proposal for the V4 index in #7616. |
Following on from #7930, it's possible to speed up building of V4 raw indexes slightly (~10%) by using a compression buffer and only using memory mapping for huge chunks. The size of the compression buffer is fixed because the size of the chunk buffer itself is fixed, so the potential for very large compression buffers which exists with V2 and V3 does not apply here.
master
branch