PARQUET-157: Divide by zero fix#102
PARQUET-157: Divide by zero fix#102jimfcarroll wants to merge 2 commits intoapache:masterfrom jimfcarroll:divide-by-zero-fix
Conversation
There was a problem hiding this comment.
Does this need to be logged if the total time processing and reading is 0ms? I think it would be better to do this:
if (totalTime > 0) {
long percentReading = ...;
long percentProcessing = ...;
LOG.info("time spent so far"...);
}There was a problem hiding this comment.
I'll make that change.
I have a .gitignore change in a subsequent commit. Do you want that removed?
|
Just one comment and I'll commit this. In the future, could you also explain any changes you include that aren't related to the fix? Wrapping the info messages in a check for |
…so avoid compiling log statements in some cases where it's unnecessary.
|
Okay. I updated the PR so it incorporates your suggestion. It makes sense to me. If you want me to pull the .gitignore commit, let me know. |
|
+1 LGTM |
|
Thanks @jimfcarroll! |
|
Thanks! I'm looking forward to 1.6.0. |
There is a divide by zero error in logging code inside the InternalParquetRecordReader. I've been running with this fixed for a while but everytime I revert I hit the problem again. I can't believe anyone else hasn't had this problem. I submitted a Jira ticket a few weeks ago but didn't hear anything on the list so here's the fix. This also avoids compiling log statements in some cases where it's unnecessary inside the checkRead method of InternalParquetRecordReader. Also added a .gitignore entry to clean up a build artifact. Author: Jim Carroll <[email protected]> Closes apache#102 from jimfcarroll/divide-by-zero-fix and squashes the following commits: 423200c [Jim Carroll] Filter out parquet-scrooge build artifact from git. 22337f3 [Jim Carroll] PARQUET-157: Fix a divide by zero error when Parquet runs quickly. Also avoid compiling log statements in some cases where it's unnecessary.
There is a divide by zero error in logging code inside the InternalParquetRecordReader. I've been running with this fixed for a while but everytime I revert I hit the problem again. I can't believe anyone else hasn't had this problem. I submitted a Jira ticket a few weeks ago but didn't hear anything on the list so here's the fix. This also avoids compiling log statements in some cases where it's unnecessary inside the checkRead method of InternalParquetRecordReader. Also added a .gitignore entry to clean up a build artifact. Author: Jim Carroll <[email protected]> Closes apache#102 from jimfcarroll/divide-by-zero-fix and squashes the following commits: 423200c [Jim Carroll] Filter out parquet-scrooge build artifact from git. 22337f3 [Jim Carroll] PARQUET-157: Fix a divide by zero error when Parquet runs quickly. Also avoid compiling log statements in some cases where it's unnecessary.
There is a divide by zero error in logging code inside the InternalParquetRecordReader. I've been running with this fixed for a while but everytime I revert I hit the problem again. I can't believe anyone else hasn't had this problem. I submitted a Jira ticket a few weeks ago but didn't hear anything on the list so here's the fix.
This also avoids compiling log statements in some cases where it's unnecessary inside the checkRead method of InternalParquetRecordReader.
Also added a .gitignore entry to clean up a build artifact.