fix cosine_similarity #18168

ailzhang · 2019-03-19T03:57:19Z

fixes #18057 according to @colesbury 's suggestion. Thanks!
cc: @ezyang

facebook-github-bot

@ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ailzhang · 2019-03-19T18:48:53Z

@pytorchbot retest this please

ailzhang · 2019-03-19T18:54:03Z

@pytorchbot rebase this please

ezyang

Nice! I didn't know this was a trick you could do. Do you know how it's justified?

ezyang · 2019-03-19T21:42:56Z

Oh, I see Sam's comment now. If you wanna be nice, put it in a comment :)

facebook-github-bot

@ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ssnl · 2019-03-19T22:10:59Z

aten/src/ATen/native/Distance.cpp

-  return w12.div_((w1 * w2).clamp_min_(eps));
+  Tensor w1 = at::sum(x1 * x1, dim);
+  Tensor w2 = at::sum(x2 * x2, dim);
+  Tensor n12 = (w1 * w2).sqrt_().clamp_min(eps);


arguably we should use rsqrt here for better precision :)

This makes the precision worse.

On CPU, rsqrt(x) is implemented as 1/sqrt(x) so now three operations with rounding instead of two. (There's no std::rsqrt in C++. The x86 rsqrt instructions are low-precision).

With CUDA it's a little different, but sqrt(x) is generally more precise on modern GPUs.

Thanks @colesbury ! I will send a patch to fix this.

ssnl · 2019-03-19T23:10:15Z

aten/src/ATen/native/Distance.cpp

-  return w12.div_((w1 * w2).clamp_min_(eps));
+  Tensor w1 = at::sum(x1 * x1, dim);
+  Tensor w2 = at::sum(x2 * x2, dim);
+  Tensor n12 = (w1 * w2).rsqrt_().clamp_min(eps);


Thanks! Although you probably now want to do either clamp_min(eps * eps) before rsqrt or clamp_max(1.0 / eps) after rsqrt.

Ah nice catch! Thanks!

facebook-github-bot

@ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: fixes #18057 according to colesbury 's suggestion. Thanks! cc: ezyang Pull Request resolved: pytorch/pytorch#18168 Differential Revision: D14520953 Pulled By: ailzhang fbshipit-source-id: 970e6cfb482d857a81721ec1d0ee4a4df84a0450

fix cosine_similarity

dc93505

facebook-github-bot reviewed Mar 19, 2019

View reviewed changes

ailzhang requested a review from ezyang March 19, 2019 14:24

Merge remote-tracking branch 'origin/master' into HEAD

a272e71

ezyang approved these changes Mar 19, 2019

View reviewed changes

add comment

6aab899

facebook-github-bot reviewed Mar 19, 2019

View reviewed changes

ssnl reviewed Mar 19, 2019

View reviewed changes

address comment

fdd9bd7

ssnl reviewed Mar 19, 2019

View reviewed changes

address comment

7af875c

facebook-github-bot reviewed Mar 20, 2019

View reviewed changes

facebook-github-bot closed this in 8895bfb Mar 20, 2019

pytorchbot added the merged label Mar 20, 2019

ailzhang mentioned this pull request Mar 20, 2019

Using sqrt for better precision in cosine_similarity #18250

Closed

zasdfgbnm mentioned this pull request Mar 21, 2019

New update of torch-nightly causes nan in forces aiqm/torchani#194

Closed

fix cosine_similarity #18168

fix cosine_similarity #18168

Uh oh!

Conversation

ailzhang commented Mar 19, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ailzhang commented Mar 19, 2019

Uh oh!

ailzhang commented Mar 19, 2019

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Mar 19, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ssnl Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

colesbury Mar 20, 2019

Choose a reason for hiding this comment

Uh oh!

ailzhang Mar 20, 2019

Choose a reason for hiding this comment

Uh oh!

ssnl Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

ailzhang Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants