Implement torch.nn.Embedding / EmbeddingBag in PyTorch C++ API #26358

anjali411 · 2019-09-17T18:53:59Z

added more variables to EmbeddingOptions and updated EmbeddingImpl reset, forward functions. Also added EmbeddingBag.

This PR is BC-breaking in the following way:

Previously, EmbeddingOptions supports count and dimension as options arguments. After this PR, they are renamed to num_embeddings and embedding_dim respectively.

… var names, updated get var calls

yf225

Thanks a lot @anjali411 ! The PR looks awesome as a start. I left some comments regarding the module options and how we implement the constructor and forward.

yf225 · 2019-09-17T20:04:22Z

torch/csrc/api/include/torch/nn/modules/embedding.h

 struct TORCH_API EmbeddingOptions {
  EmbeddingOptions(int64_t count, int64_t dimension);
-  /// The number of embeddings (number of rows in the table).
+  // The number of embeddings (number of rows in the table).


We should actually use /// instead of // because we need all the parameter comments to show up as docs in https://pytorch.org/cppdocs/api/structtorch_1_1nn_1_1_embedding_options.html#exhale-struct-structtorch-1-1nn-1-1-embedding-options.

Also I think we should change the comment to match the Python side:

pytorch/torch/nn/modules/sparse.py

Line 17 in 9181b9c

num_embeddings (int): size of the dictionary of embeddings

yf225 · 2019-09-17T20:05:43Z

torch/csrc/api/include/torch/nn/modules/embedding.h

+  // The number of embeddings (number of rows in the table).
  TORCH_ARG(int64_t, count);
-  /// The size of each embedding vector (number of columns in the table).
+  // The size of each embedding vector (number of columns in the table).


ditto for matching Python side:

pytorch/torch/nn/modules/sparse.py

Line 18 in 9181b9c

embedding_dim (int): the size of each embedding vector

yf225 · 2019-09-17T20:08:58Z

torch/csrc/api/include/torch/nn/modules/embedding.h

+  TORCH_ARG(int64_t, num_embeddings);
+  // The size of each embedding vector (number of columns in the table).
+  TORCH_ARG(int64_t, embedding_dim);
+  // If given, pads the output with the embedding vector at :attr:`padding_idx (initialized to zeros) whenever it encounters the index.


:attr: probably doesn't work in C++ API docs now, and we can change it to:

Suggested change

// If given, pads the output with the embedding vector at :attr:`padding_idx (initialized to zeros) whenever it encounters the index.

// If given, pads the output with the embedding vector at `padding_idx` (initialized to zeros) whenever it encounters the index.

yf225 · 2019-09-17T20:12:24Z

torch/csrc/api/include/torch/nn/modules/embedding.h

+  // The p of the p-norm to compute for the :attr:`max_norm` option. Default ``2``
+  TORCH_ARG(float, norm_type)=2.;
+  // If given, this will scale gradients by the inverse of frequency of the words in the mini-batch. Default ``False``.
+  TORCH_ARG(bool, scale_grad_by_freq)=false;


nit:

Suggested change

TORCH_ARG(bool, scale_grad_by_freq)=false;

TORCH_ARG(bool, scale_grad_by_freq) = false;

yf225 · 2019-09-17T20:13:25Z

torch/csrc/api/include/torch/nn/modules/embedding.h

+  // If given, pads the output with the embedding vector at :attr:`padding_idx (initialized to zeros) whenever it encounters the index.
+  TORCH_ARG(c10::optional<int64_t>, padding_idx)=c10::nullopt;
+  // If given, each embedding vector with norm larger than :attr:`max_norm` is renormalized to have norm :attr:`max_norm`.
+  TORCH_ARG(c10::optional<float>, max_norm)=c10::nullopt;


We might not need to specify c10::nullopt for padding_idx and max_norm, because the default value of c10::optional should already be c10::nullopt.

torch/csrc/api/src/nn/modules/embedding.cpp

yf225 · 2019-09-17T20:31:33Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    else{
+      assert((padding_idx >= -num_embeddings) && "Padding_idx must be within num_embedding");
+      *padding_idx = *padding_idx+num_embeddings;
+      options.padding_idx_ = padding_idx;


Suggested change

options.padding_idx_ = padding_idx;

options.padding_idx(padding_idx);

yf225 · 2019-09-17T20:36:31Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    }
+    else{
+      assert((*options.padding_idx() >= -(*options.weight()).size(0)) && "Padding_idx must be within num_embedding");
+      options.padding_idx(*options.padding_idx() + (*options.weight_).size(0));


minor nit: we should likely swap the order here to match the Python implementation:

Suggested change

options.padding_idx(*options.padding_idx() + (*options.weight_).size(0));

options.padding_idx((*options.weight_).size(0) + *options.padding_idx());

also options.weight_ can be changed to options._weight(), assuming we rename weight to _weight.

yf225 · 2019-09-17T20:38:55Z

torch/csrc/api/src/nn/modules/embedding.cpp

+  }
+
+  if(options.max_norm() != c10::nullopt){
+    input.contiguous();


.contiguous() is not an in-place function, and we should likely do:

Suggested change

input.contiguous();

input = input.contiguous();

This also matches the Python implementation.

I also think we need to implement and call _no_grad_embedding_renorm_ here.

for _no_grad_embedding_renorm_, we can do this:

#include <torch/utils.h> { torch::NoGradGuard no_grad; torch::embedding_renorm(...); }

yf225 · 2019-09-17T20:40:59Z

torch/csrc/api/src/nn/modules/embedding.cpp

+  if(options.max_norm() != c10::nullopt){
+    input.contiguous();
+  }
+  return torch::embedding(*options.weight(), /*indices=*/input, *options.padding_idx(), options.scale_grad_by_freq(), options.sparse());


This should pass weight instead of *options.weight(), once we put weight back as the module's attribute.

yf225 · 2019-09-17T21:28:58Z

torch/csrc/api/src/nn/modules/embedding.cpp

-      "weight", torch::empty({options.count_, options.dimension_}));
-  NoGradGuard guard;
-  weight.normal_(0, 1);
+  (*(options.weight_)).Tensor::normal_(0, 1);


We can use torch::nn::init::normal_(weight), if we include the torch/nn/init.h header.

yf225

Thanks @anjali411 ! I left some comments

yf225 · 2019-09-18T15:49:34Z

test/cpp/api/modules.cpp

  ASSERT_EQ(
-      c10::str(Embedding(10, 2)),
-      "torch::nn::Embedding(count=10, dimension=2)");
+      c10::str(Embedding(num_embeddings=10, embedding_dim=2)),


keyword arguments don't work here because we are dealing with C++ :/ The expectation is that the user knows the first two arguments are num_embeddings and embedding_dim, and they can just call:

Suggested change

c10::str(Embedding(num_embeddings=10, embedding_dim=2)),

c10::str(Embedding(10, 2)),

yf225 · 2019-09-18T15:51:10Z

test/cpp/api/modules.cpp

+      c10::str(Embedding(num_embeddings=10, embedding_dim=2)),
+      "torch::nn::Embedding(num_embeddings=10, embedding_dim=2)");
+  ASSERT_EQ(
+      c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)),


Since this one involves optional arguments, we would write:

Suggested change

c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)),

c10::str(Embedding(EmbeddingOptions(10, 2).padding_idx(3).max_norm(2))),

yf225 · 2019-09-18T15:52:29Z

test/cpp/api/modules.cpp

+      c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)),
+      "torch::nn::Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)");
+  ASSERT_EQ(
+      c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2, norm_type=2.5, scale_grad_by_freq=true, sparse=true)),


ditto here, we would write:

Suggested change

c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2, norm_type=2.5, scale_grad_by_freq=true, sparse=true)),

c10::str(Embedding(EmbeddingOptions(10, 2).padding_idx(3).max_norm(2).norm_type(2.5).scale_grad_by_freq(true).sparse(true))),

yf225 · 2019-09-18T15:53:24Z

test/cpp/api/modules.cpp

          fc(register_module("fc", torch::nn::Linear(4, 5))),
          table(register_module("table", torch::nn::Embedding(10, 2))),
+          table(register_module("table", torch::nn::Embedding(10, 2, padding_idx=3, max_norm=2))),
+          table(register_module("table", torch::nn::Embedding(10, 2, padding_idx=3, max_norm=2, norm_type=2.5, scale_grad_by_freq=true, sparse=true))),


ditto here, we would need to use EmbeddingOptions to express both the required arguments and optional arguments.

yf225 · 2019-09-18T15:53:32Z

test/cpp/api/modules.cpp

-      "  (table): torch::nn::Embedding(count=10, dimension=2)\n"
+      "  (table): torch::nn::Embedding(num_embeddings=10, embedding_dim=2)\n"
+      "  (table): torch::nn::Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)"
+      "  (table): torch::nn::Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2, norm_type=2.5, scale_grad_by_freq=true, sparse=true)"


ditto here, we would need to use EmbeddingOptions to express both the required arguments and optional arguments.

yf225 · 2019-09-18T16:21:33Z

torch/csrc/api/src/nn/modules/embedding.cpp

-  return torch::embedding(weight, /*indices=*/input);
+  if(options.padding_idx() != c10::nullopt){
+    if(*options.padding_idx() > 0){
+      assert((*options.padding_idx() < (weight.size(0)) && "Padding_idx must be within num_embeddings");


ditto for TORCH_CHECK(condition, message)

yf225 · 2019-09-18T16:21:36Z

torch/csrc/api/src/nn/modules/embedding.cpp

+      assert((*options.padding_idx() < (weight.size(0)) && "Padding_idx must be within num_embeddings");
+    }
+    else{
+      assert((*options.padding_idx() >= -(weight.size(0)) && "Padding_idx must be within num_embedding");


ditto for TORCH_CHECK(condition, message)

yf225 · 2019-09-18T16:23:12Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    if(*options.padding_idx() > 0){
+      assert((*options.padding_idx() < (weight.size(0)) && "Padding_idx must be within num_embeddings");
+    }
+    else{


same situation here, we should check } else if (*options.padding_idx() < 0) { instead, to match the Python implementation

yf225 · 2019-09-18T16:25:39Z

torch/csrc/api/src/nn/modules/embedding.cpp

+  if(options.max_norm() != c10::nullopt){
+    input = input.contiguous();
+    torch::NoGradGuard no_grad;
+    torch::embedding_renorm(weight, input, *options.max_norm(), options.norm_type());


We likely want torch::embedding_renorm_ instead of torch::embedding_renorm

yf225 · 2019-09-18T16:28:48Z

torch/csrc/api/src/nn/modules/embedding.cpp

+  if(options.scale_grad_by_freq()){
+    stream << ",scale_grad_by_freq=" << options.scale_grad_by_freq();
+  }
+  if(options.sparse()){


A note on formatting: in general we would want to write

if (options.sparse()) {

instead of

if(options.sparse()){

, and

} else {

instead of

} else {

to be consistent with the formatting of the other parts of C++ API.

yf225 · 2019-09-19T21:22:46Z

@pytorchbot rebase this please

yf225

I did an initial pass, and will do another one tomorrow.

yf225 · 2019-09-20T03:08:19Z

torch/csrc/api/include/torch/nn/modules/embedding.h

  Tensor forward(const Tensor& indices);

+  static EmbeddingImpl& from_pretrained(Tensor embeddings, bool freeze = true, c10::optional<int64_t> padding_idx = c10::nullopt,
+          c10::optional<float> max_norm = c10::nullopt, float norm_type = 2., bool scale_grad_by_freq = false, bool sparse = false);


Sorry for giving the wrong idea about how to implement from_pretrained: I think we will need to move from_pretrained to the Embedding class, not EmbeddingImpl, because we want people to be able to use it with torch::nn::Embedding::from_pretrained(...). I wrote a gist to illustrate the idea: https://gist.github.com/yf225/8eee0ef3f6afd2317092900927a43994.

We should also use EmbeddingOptions instead of writing out the parameters explicitly, as shown in the gist.

Also the return type shouldn't be a reference, and should just be Embedding (after we move this to the Embedding class).

yf225 · 2019-09-20T03:09:52Z

torch/csrc/api/include/torch/nn/modules/embedding.h

+      c10::optional<torch::Tensor> per_sample_weights = c10::nullopt);
+
+    static EmbeddingBagImpl& EmbeddingBagImpl::from_pretrained(Tensor embeddings, bool freeze = true, c10::optional<float> mex_norm = c10::nullopt,
+       float norm_type = 2., bool scale_grad_by_freq = false, string mode = "sum", bool sparse = false);


ditto here: we would need to move from_pretrained to EmbeddingBag class, so that people can do torch::nn::EmbeddingBag::from_pretrained(...).

ditto for using EmbeddingBagOptions as well.

torch/csrc/api/src/nn/modules/embedding.cpp

yf225 · 2019-09-20T15:11:06Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    torch::nn::init.normal_(weight);
+  }
+  else {
+    TORCH_CHECK((weight.size(0) == options.num_embeddings()) && (weight.size(1) == options.embedding_dim()), "Shape of _weight does not match num_embeddings and embedding_dim");


I think there are two issues here:

We should be checking the sizes of _weight instead of weight, to match the Python implementation.

In the error message, it should say Shape of weight instead of Shape of _weight, to match the Python implementation.

We can also improve the size checking with the following, to better match Python side:

Suggested change

TORCH_CHECK((weight.size(0) == options.num_embeddings()) && (weight.size(1) == options.embedding_dim()), "Shape of _weight does not match num_embeddings and embedding_dim");

TORCH_CHECK((*options._weight()).sizes() == torch::IntArrayRef({options.num_embeddings(), options.embedding_dim()}), "Shape of weight does not match num_embeddings and embedding_dim"

yf225 · 2019-09-20T15:15:12Z

torch/csrc/api/src/nn/modules/embedding.cpp

+std::tuple<Tensor, Tensor, Tensor, Tensor> EmbeddingBagImpl::forward(const Tensor& input, c10::optional<torch::Tensor> offsets,
+  c10::optional<torch::Tensor> per_sample_weights) {
+
+    TORCH_CHECK(per_sample_weights == c10::nullopt || ((input.size(0) == per_sample_weights.size(0)) && input.size(1) == per_sample_weights.size(1)),


We can simplify this as:

Suggested change

TORCH_CHECK(per_sample_weights == c10::nullopt || ((input.size(0) == per_sample_weights.size(0)) && input.size(1) == per_sample_weights.size(1)),

TORCH_CHECK(per_sample_weights == c10::nullopt || input.sizes() == per_sample_weights.sizes())

yf225 · 2019-09-20T15:22:09Z

torch/csrc/api/src/nn/modules/embedding.cpp

+
+    TORCH_CHECK(per_sample_weights == c10::nullopt || ((input.size(0) == per_sample_weights.size(0)) && input.size(1) == per_sample_weights.size(1)),
+      "embedding_bag: If per_sample_weights ({", per_sample_weights.size(0), ", ", per_sample_weights.size(1), "}) is not null,
+                            then it must have the same shape as the input ({", input.size(0), ", ", input.size(1), "})\n");


I think we might want to write the error message as following:

"embedding_bag: If per_sample_weights (", per_sample_weights.sizes(), ") is not null, ", "then it must have the same shape as the input (", input.sizes(), ")");

Specifically there are a few issues with the original error message:

The spaces before then it must will be printed to the screen when the error message shows, which is likely not what we want.

We don't need \n at the end of the message because TORCH_CHECK will handle it automatically.

We can use .sizes() to simplify the size printing.

yf225 · 2019-09-20T15:30:18Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    if(input.dim() == 2) {
+      TORCH_CHECK(offsets == c10::nullopt,
+        "if input is 2D, then offsets has to be null, as input is treated is a mini-batch of
+                    fixed length sequences. However, found an offsets Tensor"); //check about adding type


ditto here: we likely shouldn't put spaces before fixed length sequences, and should do

"if input is 2D, then offsets has to be null, as input is treated is a mini-batch of ", "fixed length sequences ...");

We can probably just write However, found offsets of type Tensor, since this is enough information for the user to fix the issue.

yf225 · 2019-09-20T15:39:28Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    }
+  }
+
+  if (!options._weight().has_value()) {


We might want to do if (options._weight() == c10::nullopt) { to look more similar to Python implementation.

yf225 · 2019-09-20T16:13:05Z

torch/csrc/api/src/nn/modules/embedding.cpp

+      stream << ",norm_type=" << options.norm_type();
+    }
+    if(options.scale_grad_by_freq()) {
+      stream << ",scale_grad_by_freq=" << options.scale_grad_by_freq();


Suggested change

stream << ",scale_grad_by_freq=" << options.scale_grad_by_freq();

stream << ", scale_grad_by_freq=" << options.scale_grad_by_freq();

yf225 · 2019-09-20T16:13:19Z

torch/csrc/api/src/nn/modules/embedding.cpp

+    if(options.scale_grad_by_freq()) {
+      stream << ",scale_grad_by_freq=" << options.scale_grad_by_freq();
+    }
+    stream << ",mode="<<mode<<")";


Suggested change

stream << ",mode="<<mode<<")";

stream << ", mode=" << mode << ")";

yf225 · 2019-09-20T16:15:06Z

torch/csrc/api/src/nn/modules/embedding.cpp

+
+  EmbeddingBagImpl& EmbeddingBagImpl::from_pretrained(Tensor embeddings, bool freeze = true, c10::optional<float> mex_norm = c10::nullopt,
+     float norm_type = 2., bool scale_grad_by_freq = false, string mode = "sum", bool sparse = false) {
+      TORCH_CHECK(embeddings.dim() == 2, "Embeddings parameter is expected to be 2-embedding_dimal");


Suggested change

TORCH_CHECK(embeddings.dim() == 2, "Embeddings parameter is expected to be 2-embedding_dimal");

TORCH_CHECK(embeddings.dim() == 2, "Embeddings parameter is expected to be 2-dimensional");

yf225 · 2019-09-20T16:16:04Z

torch/csrc/api/src/nn/modules/embedding.cpp

+              norm_type=norm_type,
+              scale_grad_by_freq=scale_grad_by_freq,
+              mode=mode,
+              sparse=sparse);


We need to pass the EmbeddingBag options to the constructor, because keyword argument is not supported in C++.

yf225 · 2019-09-20T16:17:02Z

torch/csrc/api/include/torch/nn/modules/embedding.h

  Tensor forward(const Tensor& indices);

+  static EmbeddingImpl& from_pretrained(Tensor embeddings, bool freeze = true, c10::optional<int64_t> padding_idx = c10::nullopt,
+          c10::optional<float> max_norm = c10::nullopt, float norm_type = 2., bool scale_grad_by_freq = false, bool sparse = false);


Also the return type shouldn't be a reference, and should just be Embedding (after we move this to the Embedding class).

Summary: With this PR, we establish the following conventions: 1. Options in C++ module / optimizer constructors should always be `const SomeOptions&` type, not `SomeOptions` type. 2. The options constructor arg should always be named `options_`, not `options`, to not be confused with the module / optimizer's internal field `options`. 3. We never use `std::move` to assign `options_` to the module / optimizer's internal field `options` in the constructor definition. Instead, we simply use `options(options_)`. Here is the reasoning: We might be tempted to declare the constructor as `SomeModule(SomeOptions options_)` and have `options(std::move(options_))` in the member initialization list. However, this can be a dangerous design because the constructor might use `options_` to set values for other member fields in the member initialization list (e.g. https://github.com/pytorch/pytorch/blob/8317f75b79fb78ceeeb928aa23a901d57274b9e1/torch/csrc/api/include/torch/optim/lbfgs.h#L30-L34), and use-after-move can cause hard-to-debug problems. Instead, we choose to explicitly use `const SomeOptions&` type for `options_`, and never use `std::move` to assign it to the internal `options` field. This way we have stronger guarantee on the validity of `options_` at any point in the constructor. Notable exceptions to the above conventions: 1. C++ Embedding module doesn't adhere to the conventions now, which will be fixed after #26358 is landed. 2. C++ dataloader and dataset classes likely need similar changes. We will do it when we start to work on dataloader/dataset parity. Thanks ShahriarSS for discovering the options usage inconsistency! 🚀 Pull Request resolved: #26483 Differential Revision: D17500451 Pulled By: yf225 fbshipit-source-id: 49361a3519e4ede933789db75731d40144f0b617

yf225 · 2019-09-23T17:12:02Z

@anjali411 There seems to be a conflict with master - please feel free to just use your version :)

yf225 · 2019-09-23T18:44:43Z

test/cpp/api/modules.cpp

+      "  (table): torch::nn::Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2, norm_type=2.5, scale_grad_by_freq=true, sparse=true)"
      "  (inner): InnerTestModule(\n"
      "    (fc): torch::nn::Linear(in=3, out=4, with_bias=true)\n"
      "    (table): torch::nn::Embedding(count=10, dimension=2)\n"


We probably need to add tests for EmbeddingBag and Embedding::from_pretrained to the test suite. Some examples are as follows, copied from Python docs (https://pytorch.org/docs/stable/nn.html#embedding):

>>> # an Embedding module containing 10 tensors of size 3 >>> embedding_sum = nn.EmbeddingBag(10, 3, mode='sum') >>> # a batch of 2 samples of 4 indices each >>> input = torch.LongTensor([1,2,4,5,4,3,2,9]) >>> offsets = torch.LongTensor([0,4]) >>> embedding_sum(input, offsets) tensor([[-0.8861, -5.4350, -0.0523], [ 1.1306, -2.5798, -1.0044]])

>>> # FloatTensor containing pretrained weights >>> weight = torch.FloatTensor([[1, 2.3, 3], [4, 5.1, 6.3]]) >>> embeddingbag = nn.EmbeddingBag.from_pretrained(weight) >>> # Get embeddings for index 1 >>> input = torch.LongTensor([[1, 0]]) >>> embeddingbag(input) tensor([[ 2.5000, 3.7000, 4.6500]])

>>> # FloatTensor containing pretrained weights >>> weight = torch.FloatTensor([[1, 2.3, 3], [4, 5.1, 6.3]]) >>> embedding = nn.Embedding.from_pretrained(weight) >>> # Get embeddings for index 1 >>> input = torch.LongTensor([1]) >>> embedding(input) tensor([[ 4.0000, 5.1000, 6.3000]])

Summary: With this PR, we establish the following conventions: 1. Options in C++ module / optimizer constructors should always be `const SomeOptions&` type, not `SomeOptions` type. 2. The options constructor arg should always be named `options_`, not `options`, to not be confused with the module / optimizer's internal field `options`. 3. We never use `std::move` to assign `options_` to the module / optimizer's internal field `options` in the constructor definition. Instead, we simply use `options(options_)`. Here is the reasoning: We might be tempted to declare the constructor as `SomeModule(SomeOptions options_)` and have `options(std::move(options_))` in the member initialization list. However, this can be a dangerous design because the constructor might use `options_` to set values for other member fields in the member initialization list (e.g. https://github.com/pytorch/pytorch/blob/8317f75b79fb78ceeeb928aa23a901d57274b9e1/torch/csrc/api/include/torch/optim/lbfgs.h#L30-L34), and use-after-move can cause hard-to-debug problems. Instead, we choose to explicitly use `const SomeOptions&` type for `options_`, and never use `std::move` to assign it to the internal `options` field. This way we have stronger guarantee on the validity of `options_` at any point in the constructor. Notable exceptions to the above conventions: 1. C++ Embedding module doesn't adhere to the conventions now, which will be fixed after pytorch#26358 is landed. 2. C++ dataloader and dataset classes likely need similar changes. We will do it when we start to work on dataloader/dataset parity. Thanks ShahriarSS for discovering the options usage inconsistency! 🚀 Pull Request resolved: pytorch#26483 Differential Revision: D17500451 Pulled By: yf225 fbshipit-source-id: 49361a3519e4ede933789db75731d40144f0b617

yf225 · 2019-10-01T17:07:33Z

@pytorchbot rebase this please

pytorchbot · 2019-10-01T17:07:36Z

There's nothing to do! This branch is already up to date with master (46539ee).

(To learn more about this bot, see Bot commands.)

…ruct a module which has no default constructor

facebook-github-bot

@anjali411 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

yf225 · 2019-10-08T18:34:39Z

@pytorchbot rebase this please

yf225

Thanks so much for the awesome work @anjali411 !

Note to self: write BC-breaking notes before landing this PR.

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-10-09T07:11:00Z

@yf225 merged this pull request in a37be20.

…ch#26358) Summary: added more variables to EmbeddingOptions and updated EmbeddingImpl reset, forward functions. Also added EmbeddingBag. ----- This PR is BC-breaking in the following way: Previously, `EmbeddingOptions` supports `count` and `dimension` as options arguments. After this PR, they are renamed to `num_embeddings` and `embedding_dim` respectively. Pull Request resolved: pytorch#26358 Differential Revision: D17714337 Pulled By: yf225 fbshipit-source-id: f9f969c68e4bece106b92f8e2e02ac39c8455fb7

anjali411 added 2 commits September 16, 2019 17:35

added data members, updated constructor and forward method

6a281bc

updated embedding functions

04ede2b

anjali411 requested review from ebetica, goldsborough and yf225 as code owners September 17, 2019 18:53

pytorchbot added the module: cpp Related to C++ API label Sep 17, 2019

anjali411 added 2 commits September 17, 2019 15:09

updated pretty print

821e383

added tests, updated variable name to make it similar to py embedding…

2366d55

… var names, updated get var calls

yf225 reviewed Sep 17, 2019

View reviewed changes

ezyang added the open source label Sep 18, 2019

changed constructor and a bunch of other things based on comments

9885566

yf225 reviewed Sep 18, 2019

View reviewed changes

yf225 mentioned this pull request Sep 19, 2019

Fix options usage in C++ module / optimizer constructors #26483

Closed

made changes based on comments and added EmbeddingBagImpl

fa7387c

Merge remote-tracking branch 'origin/master' into HEAD

e6a0858

yf225 reviewed Sep 20, 2019

View reviewed changes

added Embedding::pretrained(...) and EmbeddingBag::pretrained(...)

694a851

yf225 reviewed Sep 20, 2019

View reviewed changes

anjali411 added 2 commits September 23, 2019 10:13

updated methods based on comments

ba9bdbe

made options optional for pre_trained

1efa921

yf225 reviewed Sep 23, 2019

View reviewed changes

changed tests

c56d8a0

yf225 mentioned this pull request Sep 24, 2019

Python/C++ API Parity: torch.nn modules and functional #25883

Open

anjali411 added 2 commits September 24, 2019 11:07

debugging

0018b70

fixed bugs

6a5929f

Will Feng added 5 commits October 1, 2019 13:11

try to fix Windows build error C2338: You are trying to default const…

18f6a6e

…ruct a module which has no default constructor

better formatting

d5d70fe

try to debug Windows build error

f460828

try to debug Windows build error

e103f02

try to debug Windows build error

f8df789

yf225 added the module: bc-breaking Related to a BC-breaking change label Oct 1, 2019

Will Feng added 2 commits October 1, 2019 15:35

try to debug Windows build error

1079d19

try to debug Windows build error

074b976

facebook-github-bot reviewed Oct 2, 2019

View reviewed changes

Will Feng added 2 commits October 7, 2019 11:36

DEBUG: remove TORCH_API, and comment out from_pretrained

1bc3403

DEBUG: uncomment Embedding::from_pretrained

137e2cf

yf225 force-pushed the master branch from b6cfb3e to 137e2cf Compare October 8, 2019 03:03

Will Feng added 6 commits October 7, 2019 20:39

try to move definition of Embedding::from_pretrained to header file

4c58b2a

also move EmbeddingBag::from_pretrained to header file

0838b85

clean up tests

b2f803a

use torch::Tensor instead of optional<Tensor>

97ca2eb

fix comment

378b5a1

fix bug

0a9c306

Merge remote-tracking branch 'origin/master' into HEAD

dfd3621

yf225 approved these changes Oct 8, 2019

View reviewed changes

facebook-github-bot reviewed Oct 8, 2019

View reviewed changes

facebook-github-bot closed this in a37be20 Oct 9, 2019

facebook-github-bot added the merged label Oct 9, 2019

mruberry added the Merged label Oct 28, 2020

	// If given, pads the output with the embedding vector at :attr:`padding_idx (initialized to zeros) whenever it encounters the index.
	// If given, pads the output with the embedding vector at `padding_idx` (initialized to zeros) whenever it encounters the index.

	TORCH_ARG(bool, scale_grad_by_freq)=false;
	TORCH_ARG(bool, scale_grad_by_freq) = false;

	options.padding_idx_ = padding_idx;
	options.padding_idx(padding_idx);

	options.padding_idx(options.padding_idx() + (options.weight_).size(0));
	options.padding_idx((options.weight_).size(0) + options.padding_idx());

	c10::str(Embedding(num_embeddings=10, embedding_dim=2)),
	c10::str(Embedding(10, 2)),

	c10::str(Embedding(num_embeddings=10, embedding_dim=2, padding_idx=3, max_norm=2)),
	c10::str(Embedding(EmbeddingOptions(10, 2).padding_idx(3).max_norm(2))),

	TORCH_CHECK((weight.size(0) == options.num_embeddings()) && (weight.size(1) == options.embedding_dim()), "Shape of _weight does not match num_embeddings and embedding_dim");
	TORCH_CHECK((*options._weight()).sizes() == torch::IntArrayRef({options.num_embeddings(), options.embedding_dim()}), "Shape of weight does not match num_embeddings and embedding_dim"

	TORCH_CHECK(per_sample_weights == c10::nullopt \|\| ((input.size(0) == per_sample_weights.size(0)) && input.size(1) == per_sample_weights.size(1)),
	TORCH_CHECK(per_sample_weights == c10::nullopt \|\| input.sizes() == per_sample_weights.sizes())

	stream << ",scale_grad_by_freq=" << options.scale_grad_by_freq();
	stream << ", scale_grad_by_freq=" << options.scale_grad_by_freq();

	stream << ",mode="<<mode<<")";
	stream << ", mode=" << mode << ")";

	TORCH_CHECK(embeddings.dim() == 2, "Embeddings parameter is expected to be 2-embedding_dimal");
	TORCH_CHECK(embeddings.dim() == 2, "Embeddings parameter is expected to be 2-dimensional");

Implement torch.nn.Embedding / EmbeddingBag in PyTorch C++ API #26358

Implement torch.nn.Embedding / EmbeddingBag in PyTorch C++ API #26358

Uh oh!

Conversation

anjali411 commented Sep 17, 2019 • edited by yf225 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yf225 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yf225 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yf225 commented Sep 19, 2019

Uh oh!

yf225 left a comment

Choose a reason for hiding this comment

Uh oh!

yf225 Sep 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anjali411 commented Sep 17, 2019 •

edited by yf225

Loading

yf225 Sep 20, 2019 •

edited

Loading

yf225 Sep 23, 2019 •

edited

Loading