Improve legacy QuantizedLinear functions to reduce overhead #29773

xiaomengy · 2019-11-14T00:27:45Z

Summary:
Improve legacy QuantizedLinear functions to reduce overhead.

The legacy QuantizedLinear functions contains may unnecessary zeros and view ops which brings a large amount of overhead. This PR reduces them and improve the existing model performance.

Differential Revision: D18494988

facebook-github-bot · 2019-11-14T00:27:59Z

This pull request was exported from Phabricator. Differential Revision: D18494988

facebook-github-bot · 2019-11-15T00:02:57Z

This pull request was exported from Phabricator. Differential Revision: D18494988

facebook-github-bot · 2019-11-15T01:09:00Z

This pull request was exported from Phabricator. Differential Revision: D18494988

facebook-github-bot · 2019-11-15T05:54:11Z

This pull request was exported from Phabricator. Differential Revision: D18494988

facebook-github-bot · 2019-11-15T18:33:40Z

This pull request was exported from Phabricator. Differential Revision: D18494988

jianyuh

LGTM! Any optimizations applied here can be also applied to https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/quantized/cpu/qlinear_dynamic.cpp ? Maybe another PR.

jianyuh · 2019-11-15T18:35:47Z

aten/src/ATen/native/QuantizedLinear.cpp

Nit: the spaces added in the comment: Is it some CPP coding style standard? Changing this might mess up the blame.

jianyuh · 2019-11-15T18:40:00Z

aten/src/ATen/native/QuantizedLinear.cpp

Wonder if the bias.contiguous(); implementation under the hood has got such checks?

Actually the difference is if we use reference here, there will be no tensor shallow copy at all. But anyway we can change it back since it is not a serious thing.

…29773) Summary: Pull Request resolved: pytorch#29773 Improve legacy QuantizedLinear functions to reduce overhead. Separate from the stack of D18381988. Test Plan: buck test mode/dev-nosan //caffe2/test:jit -- "quant" Reviewed By: lly-zero-one Differential Revision: D18494988 fbshipit-source-id: 081687ea8d1e9c67f1213930a40a482e0090a029

facebook-github-bot · 2019-11-15T19:11:01Z

This pull request was exported from Phabricator. Differential Revision: D18494988

facebook-github-bot · 2019-11-16T17:39:02Z

This pull request has been merged in 0995929.

Summary: Pull Request resolved: pytorch/pytorch#29773 Improve legacy QuantizedLinear functions to reduce overhead. Separate from the stack of D18381988. Test Plan: buck test mode/dev-nosan //caffe2/test:jit -- "quant" Reviewed By: lly-zero-one Differential Revision: D18494988 fbshipit-source-id: 5627d7e8b0b7a750852eead9e28c5a9b3fa70559

jamesr66a

All the unrelated style changes make it very hard to tell what the actual functional changes from this PR are, and it messes up the blame. Please either refrain from making these changes or separate them out into a separate PR in the future

jamesr66a · 2019-11-17T00:23:56Z

aten/src/ATen/native/QuantizedLinear.cpp

+  Tensor quantized = at::native::empty_like(
+      weight_contig, weight_contig.options().dtype(at::kChar));
+  // Tensor quantized = at::native::empty_cpu(
+  //     weight_contig.sizes(), weight_contig.options().dtype(at::kChar));


This probably shouldn't be here

facebook-github-bot added the fb-exported label Nov 14, 2019

xiaomengy requested a review from jianyuh November 14, 2019 00:33

jianyuh requested a review from jamesr66a November 14, 2019 01:21

xiaomengy force-pushed the export-D18494988 branch from 735d147 to 060b10a Compare November 15, 2019 00:02

xiaomengy force-pushed the export-D18494988 branch from 060b10a to 2969fb5 Compare November 15, 2019 01:09

xiaomengy force-pushed the export-D18494988 branch from 2969fb5 to 0fce0d9 Compare November 15, 2019 05:54

xiaomengy force-pushed the export-D18494988 branch from 0fce0d9 to 0f4de7d Compare November 15, 2019 18:33

jianyuh approved these changes Nov 15, 2019

View reviewed changes

xiaomengy force-pushed the export-D18494988 branch from 0f4de7d to d05b479 Compare November 15, 2019 19:10

facebook-github-bot closed this in 0995929 Nov 16, 2019

facebook-github-bot added the merged label Nov 16, 2019

jamesr66a reviewed Nov 17, 2019

View reviewed changes

mruberry added the Merged label Oct 28, 2020

Improve legacy QuantizedLinear functions to reduce overhead #29773

Improve legacy QuantizedLinear functions to reduce overhead #29773

Uh oh!

Conversation

xiaomengy commented Nov 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Nov 14, 2019

Uh oh!

facebook-github-bot commented Nov 15, 2019

Uh oh!

facebook-github-bot commented Nov 15, 2019

Uh oh!

facebook-github-bot commented Nov 15, 2019

Uh oh!

facebook-github-bot commented Nov 15, 2019

Uh oh!

jianyuh left a comment

Choose a reason for hiding this comment

Uh oh!

jianyuh Nov 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaomengy Nov 15, 2019

Choose a reason for hiding this comment

Uh oh!

jianyuh Nov 15, 2019

Choose a reason for hiding this comment

Uh oh!

xiaomengy Nov 15, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Nov 15, 2019

Uh oh!

facebook-github-bot commented Nov 16, 2019

Uh oh!

jamesr66a left a comment

Choose a reason for hiding this comment

Uh oh!

jamesr66a Nov 17, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xiaomengy commented Nov 14, 2019 •

edited

Loading

jianyuh Nov 15, 2019 •

edited

Loading