ARROW-4563: [Python] Validate decimal128() precision input by pitrou · Pull Request #3647 · apache/arrow

pitrou · 2019-02-14T16:06:31Z

Also add a debug check on the C++ side.

pitrou · 2019-02-14T16:06:42Z

cc @pravindra for the gandiva changes.

Also add a debug check on the C++ side.

pravindra · 2019-02-14T16:47:30Z

cpp/src/arrow/type.cc

+Decimal128Type::Decimal128Type(int32_t precision, int32_t scale)
+    : DecimalType(16, precision, scale) {
+  DCHECK_GE(precision, 1);
+  DCHECK_LE(precision, 38);


also, check for scale >= 0 && scale <= precision ?

I'm not sure. Why can't scale be arbitrary? It's simply an exponent.

From this

Precision is the number of digits in a number. Scale is the number of digits to the right of the decimal point in a number. For example, the number 123.45 has a precision of 5 and a scale of 2.

The number of digits after the decimal must be >=0 and must be <= the digits in the number. Am I missing something ?

This is Microsoft-specific. There is no a priori reason why scale should be limited. Apparently Oracle allows scales between -128 and 127 (?):
https://docs.oracle.com/cd/A81042_01/DOC/server.816/a76965/c10datyp.htm#743

the 0 to 38 thing is pretty ubiquitous in the database landscape

Spark SQL https://spark.apache.org/docs/2.0.2/api/java/org/apache/spark/sql/types/DecimalType.html

Impala https://github.com/apache/impala/blob/master/be/src/udf/udf.h#L664

Hive https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Types#LanguageManualTypes-DecimalsdecimalDecimals

Presto https://prestodb.github.io/docs/current/functions/decimal.html

MapD seems to be capped at 19 digits of precision instead of 38, presumably to fit in 64 bits https://www.omnisci.com/docs/latest/5_datatypes.html

@pitrou - thanks, I didn't realize that oracle allows scale to be more than precision, and -ve also. The decimal functions in gandiva don't handle this.

However, I checked the links from @wesm - spark-sql, impala and presto (and of course, sql-server) all require scale to be <= precision.

ARROW-4563: [Python] Validate decimal128() precision input

5a4cd6a

Also add a debug check on the C++ side.

pitrou force-pushed the ARROW-4563-py-validate-decimal128-inputs branch from 26e2dcc to 5a4cd6a Compare February 14, 2019 16:29

pravindra reviewed Feb 14, 2019

View reviewed changes

pravindra approved these changes Feb 14, 2019

View reviewed changes

pitrou closed this in b9819e8 Feb 14, 2019

pitrou deleted the ARROW-4563-py-validate-decimal128-inputs branch February 14, 2019 20:20

asfimport mentioned this pull request Apr 11, 2019

[Python] pa.decimal128 should validate inputs #21109

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-4563: [Python] Validate decimal128() precision input#3647

ARROW-4563: [Python] Validate decimal128() precision input#3647
pitrou wants to merge 1 commit intoapache:masterfrom
pitrou:ARROW-4563-py-validate-decimal128-inputs

pitrou commented Feb 14, 2019

Uh oh!

pitrou commented Feb 14, 2019

Uh oh!

pravindra Feb 14, 2019 •

edited

Loading

Uh oh!

pitrou Feb 14, 2019

Uh oh!

pravindra Feb 14, 2019

Uh oh!

pitrou Feb 14, 2019

Uh oh!

wesm Feb 14, 2019

Uh oh!

pravindra Feb 14, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pitrou commented Feb 14, 2019

Uh oh!

pitrou commented Feb 14, 2019

Uh oh!

pravindra Feb 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pitrou Feb 14, 2019

Choose a reason for hiding this comment

Uh oh!

pravindra Feb 14, 2019

Choose a reason for hiding this comment

Uh oh!

pitrou Feb 14, 2019

Choose a reason for hiding this comment

Uh oh!

wesm Feb 14, 2019

Choose a reason for hiding this comment

Uh oh!

pravindra Feb 14, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pravindra Feb 14, 2019 •

edited

Loading