[SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions #39619

zhengruifeng · 2023-01-17T03:55:10Z

What changes were proposed in this pull request?

1, #39068 reused the UnresolvedAttribute for the UnresolvedNamedLambdaVariable, but then Column('x') and UnresolvedNamedLambdaVariable('x') are mixed in lambda x: x + cdf.x (since we use x/y/z as augment names); this PR adds the UnresolvedNamedLambdaVariable back to distinguish between Column('x') and UnresolvedNamedLambdaVariable('x');

2, the refreshVarName logic in PySpark was added in #32523 to address similar issue in PySpark's Lambda Function, this PR adds a similar function in the Python Client to avoid rewriting the function expression in the server side, which is unnecessary and prone to error .

Why are the changes needed?

before this PR, the nested lambda function doesn't work properly

Does this PR introduce any user-facing change?

no

How was this patch tested?

enabled UT and added UT

init init

zhengruifeng · 2023-01-17T04:04:16Z

cc @HyukjinKwon @cloud-fan @hvanhovell

zhengruifeng · 2023-01-17T05:08:01Z

thanks for the reviews, merged into master

github-actions bot added CONNECT CORE PYTHON SQL labels Jan 17, 2023

init

767498b

init init

zhengruifeng force-pushed the connect_fix_nested_lambda branch from ac2814d to 767498b Compare January 17, 2023 03:59

cloud-fan approved these changes Jan 17, 2023

View reviewed changes

HyukjinKwon approved these changes Jan 17, 2023

View reviewed changes

zhengruifeng closed this in bf80aa4 Jan 17, 2023

zhengruifeng deleted the connect_fix_nested_lambda branch January 17, 2023 05:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions #39619

[SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions #39619

Uh oh!

zhengruifeng commented Jan 17, 2023

Uh oh!

zhengruifeng commented Jan 17, 2023

Uh oh!

zhengruifeng commented Jan 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions #39619

[SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions #39619

Uh oh!

Conversation

zhengruifeng commented Jan 17, 2023

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

zhengruifeng commented Jan 17, 2023

Uh oh!

zhengruifeng commented Jan 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants