[pytorch] Buffer in Pickler to improve performance. #28043

jjlilley · 2019-10-15T21:23:46Z

Stack from ghstack:

[pytorch] Buffer in Pickler to improve performance. #28043 [pytorch] Double-buffer in Pickler to reduce #calls.

This change adds a small fixed-size buffer to Pickler to
avoid calling writer_() and the associated downstream checks
on a per-opcode/per-byte basis.

We end up still doing a bounds check in the common case,
but the memcpy() is a fixed size. And we reduce the number
of backend calls.

In practice, this change speeds up the Pickle1MInts benchmark
for me locally from roughly 56msec to 22msec.

Additionally, in this change we convert a few pushIValue() on
typed lists, where we know the type to be double/int/boot to be
pushInt() to bypass a bit of logic.

We should additionally change the Unpickler, though keeping
this separate, since the std::function<> prototype needs to be
changed to do buffering (return value needs to change from bool
to size_t)

Differential Revision: D17939311

This change adds a small fixed-size buffer to Pickler to avoid calling writer_() and the associated downstream checks on a per-opcode/per-byte basis. We end up still doing a bounds check in the common case, but the memcpy() is a fixed size. And we reduce the number of backend calls. In practice, this change speeds up the Pickle1MInts benchmark for me locally from roughly 56msec to 22msec. Additionally, in this change we convert a few pushIValue() on typed lists, where we know the type to be double/int/boot to be pushInt() to bypass a bit of logic. We should additionally change the Unpickler, though keeping this separate, since the std::function<> prototype needs to be changed to do buffering (return value needs to change from bool to size_t) Differential Revision: [D17939311](https://our.internmc.facebook.com/intern/diff/D17939311/) [ghstack-poisoned]

This change adds a small fixed-size buffer to Pickler to avoid calling writer_() and the associated downstream checks on a per-opcode/per-byte basis. We end up still doing a bounds check in the common case, but the memcpy() is a fixed size. And we reduce the number of backend calls. In practice, this change speeds up the Pickle1MInts benchmark for me locally from roughly 56msec to 22msec. Additionally, in this change we convert a few pushIValue() on typed lists, where we know the type to be double/int/boot to be pushInt() to bypass a bit of logic. We should additionally change the Unpickler, though keeping this separate, since the std::function<> prototype needs to be changed to do buffering (return value needs to change from bool to size_t) Differential Revision: [D17939311](https://our.internmc.facebook.com/intern/diff/D17939311/) ghstack-source-id: 91964964 Pull Request resolved: #28043

jjlilley · 2019-10-16T00:16:06Z

This change is identical to (approved) 27720, except that

I added a static assert per the review comments.
the one-line TODO comment related to the LONG4 is removed

There was a tooling failure this afternoon which made it tricky for me to update 27720 (particularly, I used the web-based export-from-phabricator rather than ghexport, which has an infra issue now, and precludes using ghexport on followup diffs).

If it's possible to stamp this, that would be great!

jjlilley requested a review from apaszke as a code owner October 15, 2019 21:23

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Oct 15, 2019

jjlilley requested review from resistor and zdevito October 16, 2019 00:09

jjlilley changed the title ~~[pytorch] Double-buffer in Pickler to reduce #calls.~~ [pytorch] Buffer in Pickler to improve performance. Oct 16, 2019

jjlilley mentioned this pull request Oct 16, 2019

Buffer in Pickler to improve performance. #27720

Closed

jjlilley closed this Oct 16, 2019

facebook-github-bot deleted the gh/jjlilley/2/head branch November 16, 2019 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pytorch] Buffer in Pickler to improve performance. #28043

[pytorch] Buffer in Pickler to improve performance. #28043

Uh oh!

jjlilley commented Oct 15, 2019 •

edited

Loading

Uh oh!

jjlilley commented Oct 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[pytorch] Buffer in Pickler to improve performance. #28043

[pytorch] Buffer in Pickler to improve performance. #28043

Uh oh!

Conversation

jjlilley commented Oct 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jjlilley commented Oct 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jjlilley commented Oct 15, 2019 •

edited

Loading