ARROW-1758: [Python] Remove pickle=True option for object serialization by Licht-T · Pull Request #1347 · apache/arrow

Licht-T · 2017-11-22T16:20:42Z

wesm · 2017-11-22T17:52:51Z

python/pyarrow/serialization.py

If no custom serializer/deserializer is passed, should this default to pickle?

Currently it defaults to using __dict__, which is more efficient so it seems a good default, we use that in Ray for pretty much all types (however rarely I have seen cases where it doesn't work). I'd prefer to keep __dict__ the default but no big deals since it can easily be changed.

What about lambdas?

In Ray, lambdas are handled with cloudpickle as are builtin types like "type"; most user defined types are handled via dict; namedtuples are special cased.

One reasonable way to handle this is to provide default callbacks for types we encounter that don't work with dict (using cloudpickle or a custom serializer/deserializer), there are already examples of this in the repo, and use dict for anything else. That worked well for us so far. If this strategy doesn't work for the user, they have full flexibility by providing their own serialization context.

What do you think?

This is sufficiently advanced stuff that I think it's not unreasonable to ask people to explicitly use cloudpickle in a case like this. I would be OK if we changed this patch to do that to get the build passing, and we can always move on later

wesm · 2017-11-23T02:07:23Z

The tests fail here, need to use cloudpickle, I think

Licht-T · 2017-11-23T02:09:53Z

Okay, I'll check.

Change-Id: Id4423a228ae2388c3e3f75d5650f0f0126fa9cc8

wesm · 2017-11-26T21:13:46Z

@pcmoritz @robertnishihara does this look ok?

robertnishihara · 2017-11-26T21:19:16Z

Looks good to me.

pcmoritz · 2017-11-26T22:11:40Z

+1 LGTM

Licht-T · 2017-11-27T23:26:22Z

Thanks @wesm!

wesm reviewed Nov 22, 2017

View reviewed changes

wesm force-pushed the clean-pickle-option-for-object-serialization branch from ebfc414 to 4e71bd3 Compare November 22, 2017 22:33

Licht-T and others added 2 commits November 26, 2017 13:58

CLN: Remove pickle=True option for object serialization

ba998dd

Use cloudpickle for lambda serialization if available

927f154

Change-Id: Id4423a228ae2388c3e3f75d5650f0f0126fa9cc8

wesm force-pushed the clean-pickle-option-for-object-serialization branch from 4e71bd3 to 927f154 Compare November 26, 2017 19:01

pcmoritz closed this in 85e2d89 Nov 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-1758: [Python] Remove pickle=True option for object serialization#1347

ARROW-1758: [Python] Remove pickle=True option for object serialization#1347
Licht-T wants to merge 2 commits intoapache:masterfrom
Licht-T:clean-pickle-option-for-object-serialization

Licht-T commented Nov 22, 2017

Uh oh!

wesm Nov 22, 2017

Uh oh!

pcmoritz Nov 22, 2017 •

edited by wesm

Loading

Uh oh!

wesm Nov 23, 2017

Uh oh!

pcmoritz Nov 23, 2017

Uh oh!

wesm Nov 23, 2017

Uh oh!

wesm commented Nov 23, 2017

Uh oh!

Licht-T commented Nov 23, 2017

Uh oh!

wesm commented Nov 26, 2017

Uh oh!

robertnishihara commented Nov 26, 2017

Uh oh!

pcmoritz commented Nov 26, 2017

Uh oh!

Licht-T commented Nov 27, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Licht-T commented Nov 22, 2017

Uh oh!

wesm Nov 22, 2017

Choose a reason for hiding this comment

Uh oh!

pcmoritz Nov 22, 2017 • edited by wesm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wesm Nov 23, 2017

Choose a reason for hiding this comment

Uh oh!

pcmoritz Nov 23, 2017

Choose a reason for hiding this comment

Uh oh!

wesm Nov 23, 2017

Choose a reason for hiding this comment

Uh oh!

wesm commented Nov 23, 2017

Uh oh!

Licht-T commented Nov 23, 2017

Uh oh!

wesm commented Nov 26, 2017

Uh oh!

robertnishihara commented Nov 26, 2017

Uh oh!

pcmoritz commented Nov 26, 2017

Uh oh!

Licht-T commented Nov 27, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pcmoritz Nov 22, 2017 •

edited by wesm

Loading