templating return type of loss functions #2339

mrityunjay-tripathi · 2020-03-25T12:09:10Z

In accordance to #2338
Any other suggestions/ideas are most welcome.

mrityunjay-tripathi · 2020-03-26T04:27:41Z

Hi, Can anyone tell what has happened to Linux builds?

zoq · 2020-03-27T11:48:03Z

I think what you have seen is related to: https://azure.microsoft.com/en-us/blog/our-commitment-to-customers-and-microsoft-cloud-services-continuity/

src/mlpack/methods/ann/loss_functions/huber_loss_impl.hpp

zoq

Just some really minor style issues, if you can fix those, this is ready from my side.

src/mlpack/methods/ann/loss_functions/sigmoid_cross_entropy_error.hpp

src/mlpack/methods/ann/loss_functions/reconstruction_loss.hpp

src/mlpack/methods/ann/loss_functions/negative_log_likelihood.hpp

src/mlpack/methods/ann/loss_functions/mean_squared_logarithmic_error.hpp

src/mlpack/methods/ann/loss_functions/mean_squared_error.hpp

src/mlpack/methods/ann/loss_functions/huber_loss.hpp

src/mlpack/methods/ann/loss_functions/hinge_embedding_loss.hpp

src/mlpack/methods/ann/loss_functions/earth_mover_distance.hpp

src/mlpack/methods/ann/loss_functions/dice_loss.hpp

src/mlpack/methods/ann/loss_functions/cross_entropy_error.hpp

favre49

LGTM, could you also add to HISTORY.md?

kartikdutt18 · 2020-03-30T14:00:23Z

Hey @mrityunjay-tripathi, @favre49, @zoq Do you think there should be a way to test this ?
i.e. We can test it for a float datatype (or any other datatype) so that we can ensure everything is perfect but more importantly also show a new contributor or someone who likes looking at mlpack code why we did this (since this aims to solve #2338).
Testing generally makes me feel safer with the code that gets merged. I hope this is okay. Let me know what you think. Thanks a lot.

mrityunjay-tripathi · 2020-03-30T16:31:29Z

Hey @mrityunjay-tripathi, @favre49, @zoq Do you think there should be a way to test this?
i.e. We can test it for a float datatype (or any other datatype) so that we can ensure everything is perfect but more importantly also show a new contributor or someone who likes looking at mlpack code why we did this (since this aims to solve #2338).
Testing generally makes me feel safer with the code that gets merged. I hope this is okay. Let me know what you think. Thanks a lot.

Sure @kartikdutt18. Testing for the precision of all the loss functions is going to be a lot of work. But the currently added test of loss functions doesn't have even precision sufficient for float type, leave alone double. Or if you are saying just to test if we are getting correct return type or not? We can check that just by using a single line: typeid(x).name() or by checking the precision of returned value by using something like std::numeric_limits<InputType::elem_type>::max_digits10. The point is if the user feeds in float type matrix as input and target, float value will be returned by armadillo. This seems a thing more of armadillo to me. I'm still trying to figure out what kind of test can make sense?

kartikdutt18 · 2020-03-30T18:52:22Z

I don't think we need to add tests for all layers. A single test to show that various data types like float can be used as both input and output. Let me know what you think.

kartikdutt18 · 2020-03-31T03:50:01Z

Hey @mrityunjay-tripathi, I looked around and I found this issue #506, So this what I think you can do. In loss function tests use arma::fmat in some cases (not all so that we preserve double as well). We need not check for precision or datatype of output. If this works correctly, which I think it does it will return the correct output. The way we will test is by using something similar to contradiction. Since arma::fmat shouldn't work with current loss functions but does after your PR then we can say that templating solved the issue. I hope this makes sense. @zoq, @favre49, I think there might be a better way to test this, Could you help us out with the same. Thanks a lot.

mrityunjay-tripathi · 2020-03-31T08:55:15Z

It's good that builds are passing :)

favre49 · 2020-03-31T09:10:21Z

Personally, I'm not a fan of the idea of testing float on some tests and double on others - it makes the tests more convoluted and confusing, and doesn't feel like good testing practice. To me, the point of these tests is ensuring the loss functions return the correct "answer". I agree with @mrityunjay-tripathi that the datatype of these tests feels like something that we should be trusting Armadillo on.

We can test it for a float datatype (or any other datatype) so that we can ensure everything is perfect but more importantly also show a new contributor or someone who likes looking at mlpack code why we did this

I'm not sure I understood this? Ideally the fact that this works should be obvious from documentation, not from tests

mlpack-bot

Second approval provided automatically after 24 hours. 👍

kartikdutt18 · 2020-03-31T09:27:44Z

I'm not sure I understood this? Ideally the fact that this works should be obvious from documentation, not from tests

Generally I feel many people learn about the codebase by looking at the tests and why certain things are implemented they way are. Since what we had worked and we are adding another feature it made sense to me test the problem that it aimed to solve (in this case, using float datatype) before we merged it.
The way I would have tested it would be:

Create a separate Test Case, say LossFunctionTemplateTest and added the one or two loss function objects to show they worked on float or another datatype. This is what was done for Valid and Same padding options for convolution layers. Original tests weren't changed at all.
Sample Code:

/**
 * Simple test to show support for various datatypes supported by loss functions.
 */
BOOST_AUTO_TEST_CASE(LossFunctionTemplateTest)
{
   // Shows that loss functions support various datatypes including float.
   input = MatType("17.45 12.91 13.63 29.01 7.12 15.47 31.52 31.97");
   target = MatType("16.52 13.11 13.67 29.51 24.31 15.03 30.72 34.07");
   ElemType loss = module.Forward(input, target);
   BOOST_REQUIRE_CLOSE_FRACTION(loss, 2.410631F, 0.00001F);

   // We also support uint or any other datatype that is supported by armadillo.
   input = arma::ones<arma::umat>(10, 1);
   target = arma::ones<arma::umat>(10, 1);
   loss = module.Forward(input, target);

}

Maybe it's a problem with me, I consider testing to be absolutely necessary for any feature addition, but maybe it's unnecessary here. If you will feel that it is unnecessary, @mrityunjay-tripathi can use git revert to back to the previous state.
Thanks a lot.

mrityunjay-tripathi · 2020-03-31T09:53:29Z

That's okay. No test is unnecessary. Let's wait and see what other members think.

zoq · 2020-03-31T19:42:41Z

I like the general idea, in ensmallen, we run each test against arma::mat and arma::fmat. Here is an example: https://github.com/mlpack/ensmallen/blob/master/tests/ftml_test.cpp.

So ideally we can do the same here in mlpack, that includes everything, not only the ann methods. That said, I think it's out of scope for this particular PR. Also, we have to make sure the datatype used makes sense, like using arma::umat might work but the output is probably not what we want, without proper quantization.

zoq · 2020-03-31T19:45:16Z

So personally I would revert the change, the changes work with our main datatype right now, so it doesn't break anything.

kartikdutt18 · 2020-04-01T03:17:59Z

I like the general idea, in ensmallen, we run each test against arma::mat and arma::fmat. Here is an example: https://github.com/mlpack/ensmallen/blob/master/tests/ftml_test.cpp.

This looks nice. It should not be hard to extend tests for fmat as well, I think we would be able to close #1062 when methods (other than ann) also support float and are tested. Do you mind if I open up an issue for new contributors to get involved with the codebase through testing or I can create a PR as well if this seems like something that we can add.

like using arma::umat might work but the output is probably not what we want, without proper quantization.

Agreed, that makes sense.

kartikdutt18 · 2020-04-01T03:20:52Z

Till then @mrityunjay-tripathi, Kindly revert the changes and we can merge this. Hopefully tests for fmat etc. will follow soon. Sorry that you and @favre49, had to deal with me. Thanks a lot.

zoq

Thanks for the contribution, no more comments from my side.

zoq · 2020-04-01T18:24:07Z

Thanks again, that is a good basis we can build on.

mlpack-bot bot added s: needs review s: unanswered s: unlabeled labels Mar 25, 2020

favre49 added c: methods and removed s: unlabeled labels Mar 25, 2020

zoq added t: added feature and removed s: unanswered labels Mar 27, 2020

zoq reviewed Mar 27, 2020

View reviewed changes

src/mlpack/methods/ann/loss_functions/huber_loss_impl.hpp Outdated Show resolved Hide resolved

mrityunjay-tripathi requested a review from zoq March 28, 2020 13:57

zoq reviewed Mar 28, 2020

View reviewed changes

mrityunjay-tripathi force-pushed the data_type_refactoring branch from d8b37e2 to b196896 Compare March 29, 2020 01:53

mrityunjay-tripathi requested a review from zoq March 29, 2020 08:52

favre49 approved these changes Mar 30, 2020

View reviewed changes

mlpack-bot bot approved these changes Mar 31, 2020

View reviewed changes

mlpack-bot bot removed the s: needs review label Mar 31, 2020

templating return type of loss functions

2f23fe8

mrityunjay-tripathi force-pushed the data_type_refactoring branch from 9adf0e3 to 2f23fe8 Compare April 1, 2020 05:30

Merge branch 'master' into data_type_refactoring

94ae364

zoq approved these changes Apr 1, 2020

View reviewed changes

zoq merged commit 0382c3e into mlpack:master Apr 1, 2020

zoq mentioned this pull request Apr 1, 2020

Data type of loss functions #2338

Closed

mrityunjay-tripathi deleted the data_type_refactoring branch April 2, 2020 04:48

AndreiMihalea added a commit to AndreiMihalea/mlpack that referenced this pull request Apr 2, 2020

Changed return type accordingly to mlpack#2339

3c2c3ce

Uh oh!

templating return type of loss functions #2339

templating return type of loss functions #2339

Uh oh!

Conversation

mrityunjay-tripathi commented Mar 25, 2020

Uh oh!

mrityunjay-tripathi commented Mar 26, 2020

Uh oh!

zoq commented Mar 27, 2020

Uh oh!

Uh oh!

zoq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

favre49 left a comment

Choose a reason for hiding this comment

Uh oh!

kartikdutt18 commented Mar 30, 2020

Uh oh!

mrityunjay-tripathi commented Mar 30, 2020

Uh oh!

kartikdutt18 commented Mar 30, 2020

Uh oh!

kartikdutt18 commented Mar 31, 2020

Uh oh!

mrityunjay-tripathi commented Mar 31, 2020

Uh oh!

favre49 commented Mar 31, 2020

Uh oh!

mlpack-bot bot left a comment

Choose a reason for hiding this comment

Uh oh!

kartikdutt18 commented Mar 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrityunjay-tripathi commented Mar 31, 2020

Uh oh!

zoq commented Mar 31, 2020

Uh oh!

zoq commented Mar 31, 2020

Uh oh!

kartikdutt18 commented Apr 1, 2020

Uh oh!

kartikdutt18 commented Apr 1, 2020

Uh oh!

zoq left a comment

Choose a reason for hiding this comment

Uh oh!

zoq commented Apr 1, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kartikdutt18 commented Mar 31, 2020 •

edited

Loading