ARROW-5880: [C++][Parquet] Use TypedBufferBuilder instead of ArrayBuilder in writer.cc#4827
ARROW-5880: [C++][Parquet] Use TypedBufferBuilder instead of ArrayBuilder in writer.cc#4827emkornfield wants to merge 11 commits intoapache:masterfrom
Conversation
emkornfield
commented
Jul 9, 2019
- Other small cleanups/comments based on my grokking of the code.
- Vectorize inner loop of List based LevelBuilder where possible
cpp/src/parquet/arrow/writer.cc
Outdated
There was a problem hiding this comment.
| // arrays otherwise we would preallocate. An upper boun on memory | |
| // arrays otherwise we would preallocate. An upper bound on memory |
|
@bkietz thanks for the review, all good suggestions. |
|
Added some missing exports. Beware as you make more changes on the Parquet codebase that my patch #4841 is slightly disruptive to the src/parquet/arrow directory (but should not be too bad to rebase) |
|
@wesm thanks for the fixes. I will look more closely as I modify code to watch out for potential merge conflicts (I have enough other things on my plate that it might be a few days anyways before I get back to this). I restarted the failing travis-CI job which looked spurious |
a14fbe5 to
94ce3e5
Compare
Co-Authored-By: Benjamin Kietzman <[email protected]>
Co-Authored-By: Benjamin Kietzman <[email protected]>
Co-Authored-By: Benjamin Kietzman <[email protected]>
94ce3e5 to
2f77313
Compare
Codecov Report
@@ Coverage Diff @@
## master #4827 +/- ##
==========================================
+ Coverage 87.31% 89.05% +1.74%
==========================================
Files 894 718 -176
Lines 134066 100316 -33750
==========================================
- Hits 117055 89334 -27721
+ Misses 16659 10982 -5677
+ Partials 352 0 -352
Continue to review full report at Codecov.
|
…lder in writer.cc - Other small cleanups/comments based on my grokking of the code. - Vectorize inner loop of List based LevelBuilder where possible Closes #4827 from emkornfield/parquet_cleanup and squashes the following commits: 2f77313 <Wes McKinney> Export operator<< symbols for DataType, TimeUnit on Windows e1698e5 <Micah Kornfield> fix compilation issues 152a6ee <Micah Kornfield> change back to signed it seems to be what the API uses 67ccf3e <Micah Kornfield> update references ef7fade <Micah Kornfield> remove unneeded nesting 6776e61 <emkornfield> Update cpp/src/parquet/arrow/writer.cc 7cbc8e4 <emkornfield> Update cpp/src/parquet/arrow/writer.cc 83ee5ae <emkornfield> Update cpp/src/parquet/arrow/writer.cc 325af52 <Micah Kornfield> undo reserve 4516110 <Micah Kornfield> Small style issues 7795352 <Micah Kornfield> change from array builder to buffer builde Lead-authored-by: Micah Kornfield <[email protected]> Co-authored-by: emkornfield <[email protected]> Co-authored-by: Wes McKinney <[email protected]> Signed-off-by: Wes McKinney <[email protected]>