Document `core/math/` #3615

rcurtin · 2024-01-31T15:32:14Z

I went through and documented all the functions in core/math/. The result is this updated core.md page where all examples are tested:

https://www.ratml.org/misc/mlpack-markdown-doc/core.html

There were quite a number of changes I made here, and many things are removed:

ClampNonNegative(), ClampNonPositive(), and ClampRange() are removed, preferring instead std::max() and std::min(). That code dates back to ~2007 or so, and the code claimed empirical improvements. I couldn't reproduce any of them, and these implementations were generally noticeably slower than, say, std::max(lo, std::min(x, hi)).
RemoveRows() can be expressed as a simple call to .rows() in a matrix, so I removed that. I had to refactor LocalCoordinateCoding and SparseCoding to track the active atoms instead of the inactive atoms as a result; basically, RemoveRows() let you give a vector of row indexes to remove, and .rows() requires the rows you want to keep. So I had to "invert" what we were keeping track of.
VectorPower() was never used anywhere, so I removed it. (I forget what used it years ago.)
Center() is so simple that I just inlined its implementation.
WhitenUsingSVD(), Orthogonalize(), Svec(), SvecIndex(), SymKronId(), and Sign() weren't used anywhere, so I removed them. I believe a couple of those are used in ensmallen and are in that codebase now.
ObtainDistinctSamples() can be replaced with arma::randperm() and so it is now removed.

I also made a couple small bugfixes:

Loading grayscale images had a bug; STB will return the actual number of channels in the image even if loading was done in grayscale, and in that case we shouldn't set info.Channels(). See the STB documentation.
ColumnsToBlocks had a minor offset calculation bug when the margin size is greater than 1 pixel. Easy fix.

Lastly, I chose not to document MakeAlias() at this time, because there is the outstanding issue of how it should be refactored, since there is another implementation (slightly different) of the same function in src/mlpack/methods/ann/.

Whatever empirical improvements were seen in 2007 aren't seen today; a simpler implementation with std::max() and std::min() appears to always outperform by a relatively small margin. So, prefer the less-maintenance approach instead, and depend on the standard library.

…Col more carefully.

shrit

Everything looks good, we should merge this as soon as possible.

Three minor observations:

In this PR I can see load.md and index.md files but I remember that I have reviewed these previously, also if not I did not find the link in here.
We have the Mahalanobis distance, but it is not here, maybe not yet ?
For the RBF kernel I did not know we could just compute it based on the distance, does this mean that it will work on any arbitrary distance or do we need to have the point vectors already declared, if this is the case we should mention that these are related.

shrit · 2024-02-06T12:58:57Z

src/mlpack/methods/sparse_coding/sparse_coding_impl.hpp

  for (size_t j = 0; j < atoms; ++j)
  {
-    if (arma::accu(codes.row(j) != 0) == 0)
-      inactiveAtoms.push_back(j);
+    if (arma::any(codes.row(j) != 0))
+      activeAtoms.push_back((arma::uword) j);
  }


I would just define j as arma::uword instead of casting it later, but eventually it should be the same

Agreed, that's easier; done in 9b8a4ec. 👍

mlpack-bot

Second approval provided automatically after 24 hours. 👍

rcurtin · 2024-02-21T21:42:17Z

Everything looks good, we should merge this as soon as possible.

Ha, then life happened to me... 😄

Three minor observations:

In this PR I can see load.md and index.md files but I remember that I have reviewed these previously, also if not I did not find the link in here.

Those were actually part of #3603, which should be merged very shortly.

We have the Mahalanobis distance, but it is not here, maybe not yet ?

Yep, this class was actually documented as part of #3603, and in this PR I only took care of core/math/ (not yet core/metrics/), so MahalanobisDistance will be added, just haven't done it yet.

For the RBF kernel I did not know we could just compute it based on the distance, does this mean that it will work on any arbitrary distance or do we need to have the point vectors already declared, if this is the case we should mention that these are related.

Hm, I thought maybe this was already clear enough given how it's written---you can call Evaluate(3.0) and it will give a result, no need to even know about the points. Maybe the example that's already there demonstrates this well enough?

// Evaluate the kernel value when the distance between two points is already
// computed.
const double distance = 1.5;
const double k3 = g.Evaluate(distance);

rcurtin · 2024-02-21T21:43:57Z

I'll merge this one too once the build passes. Thanks for the review @shrit, sorry it took me so long to get back to 😄

rcurtin added 23 commits January 23, 2024 08:57

Remove entirely unused VectorPower().

9c87ae9

Remove unused functionality from lin_alg.hpp.

46e0221

Replace RemoveRows() with a simple call to rows().

1b3849b

Don't use math:: namespace for Range; it doesn't exist anymore.

0820672

Document ColumnCovariance().

13b4d53

Document ColumnsToBlocks.

163f224

Basic documentation outline for digamma/trigamma; still needs some work.

70cc7ae

Remove Center() as its implementation is so simple.

5be9a90

Fix minor errors in inlined Center() implementation.

1105235

Fix includes after lin_alg.hpp move.

9e2a7fb

Document logarithmic utilities.

50c475c

Document random number utilities.

81c59a1

Document RandomBasis() and add examples for random number utilities.

45f3bf8

Document last few remaining functions in core/math/.

21b4440

Fix bug when loading grayscale: make sure info.Channels() is 1.

1accad6

Fix offset bug: when bufSize > 1, we have to calculate maxRow and max…

b674f5e

…Col more carefully.

Fix minor compilation issues in examples.

cee933d

Handle some TODOs.

b5b8a13

Minor wording changes and other fixes.

08b3cd1

Merge branch 'main-docs' into core-docs

4c32f66

Merge remote-tracking branch 'origin/master' into core-docs

7f95dcb

Remove lin_alg_impl.hpp (accidentally re-added).

be93b1b

rcurtin added c: core c: documentation t: added feature labels Jan 31, 2024

rcurtin added 2 commits January 31, 2024 10:35

Remove ObtainDistinctSamples().

c2d0ce7

Oops, fix typo in Center() change.

0b3e6f3

shrit approved these changes Feb 6, 2024

View reviewed changes

mlpack-bot bot approved these changes Feb 7, 2024

View reviewed changes

Merge remote-tracking branch 'rcurtin/main-docs' into core-docs

78a934d

Use arma::uword instead.

9b8a4ec

Merge remote-tracking branch 'origin/master' into core-docs

6cf2287

rcurtin mentioned this pull request Feb 23, 2024

Using, std:: for standard library functions that does not have any. #3629

Merged

rcurtin merged commit 23dc135 into mlpack:master Feb 23, 2024

rcurtin deleted the core-docs branch February 23, 2024 22:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Document `core/math/` #3615

Document `core/math/` #3615

Uh oh!

rcurtin commented Jan 31, 2024 •

edited

Loading

Uh oh!

shrit left a comment •

edited

Loading

Uh oh!

shrit Feb 6, 2024

Uh oh!

rcurtin Feb 21, 2024

Uh oh!

mlpack-bot bot left a comment

Uh oh!

rcurtin commented Feb 21, 2024 •

edited

Loading

Uh oh!

rcurtin commented Feb 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Document core/math/ #3615

Document core/math/ #3615

Uh oh!

Conversation

rcurtin commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shrit left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shrit Feb 6, 2024

Choose a reason for hiding this comment

Uh oh!

rcurtin Feb 21, 2024

Choose a reason for hiding this comment

Uh oh!

mlpack-bot bot left a comment

Choose a reason for hiding this comment

Uh oh!

rcurtin commented Feb 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rcurtin commented Feb 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Document `core/math/` #3615

Document `core/math/` #3615

rcurtin commented Jan 31, 2024 •

edited

Loading

shrit left a comment •

edited

Loading

rcurtin commented Feb 21, 2024 •

edited

Loading