ENH: Improve np.linalg.det performance #28649

eendebakpt · 2025-04-04T14:53:27Z

We can improve performance of np.linalg.det for small arrays by up to 40% with 3 changes:

Move _assert_stacked_2d check into _assert_stacked_square
Improve performance of _commonType by using a cache
Avoid r.astype(...) for scalar arguments making a copy of the data (and internally converting to an array and back)

In this PR we perform the first step.

numpy/linalg/_linalg.py

tylerjereddy

So far asv isn't showing much perf improvement on this branch on x86_64 Linux (i9-13900K):

asv continuous -E virtualenv -e -b "time_det.*" main linalg_refactor

BENCHMARKS NOT SIGNIFICANTLY CHANGED.

It may be because you're only making the first of the series of proposed changes.

Regardless of the performance changes, I suppose this is a reduction in lines of code, so maybe "ok" on its own anyway.

tylerjereddy · 2025-04-04T21:36:05Z

numpy/linalg/_linalg.py

        if issubclass(type_, inexact):
-            if isComplexType(type_):
-                is_complex = True
+            is_complex = is_complex or isComplexType(type_)


Is this really worth doing? It takes my brain longer to process and shouldn't matter much performance-wise?

I'm also not sure this routine is worth it, but if one goes for it, I'd start with something like

types = set(a.dtype.type for a in arrays)

which will generally reduce the number already, and then do

is_complex = any(isComplexType(type_) for type_ in types)

and something along similar lines for result_type (but with a check whether one can really not simply use the built-in np.result_type - not obvious to me).

But I'd do it in a separate PR.

Interesting idea! It might not work out because the common use case is with len(arrays) just 1 or 2 and the set overhead is too large. I will remove this change here and test out in a new PR.

mhvk

I like the cleanup, though I'm not surprised it doesn't have that much of an effect on speed. A suggestion in-line for an extra (if very minor) performance boost.

Also, would suggest to do just the removal of _assert_stacked_square here.

mhvk · 2025-04-04T23:50:45Z

numpy/linalg/_linalg.py

        if issubclass(type_, inexact):
-            if isComplexType(type_):
-                is_complex = True
+            is_complex = is_complex or isComplexType(type_)


I'm also not sure this routine is worth it, but if one goes for it, I'd start with something like

types = set(a.dtype.type for a in arrays)

which will generally reduce the number already, and then do

is_complex = any(isComplexType(type_) for type_ in types)

and something along similar lines for result_type (but with a check whether one can really not simply use the built-in np.result_type - not obvious to me).

But I'd do it in a separate PR.

mhvk · 2025-04-04T23:53:49Z

numpy/linalg/_linalg.py

        w = gufunc(a, signature=signature)
    return w.astype(_realType(result_t), copy=False)

-def _convertarray(a):


Nice catch that this is not actually used!

mhvk · 2025-04-04T23:56:14Z

numpy/linalg/_linalg.py


 def _assert_stacked_square(*arrays):
    for a in arrays:
+        if a.ndim < 2:


I like the combination. If one really wants to get out the most, it could be

try: m, n = a.shape[-2:] except ValueError: riase LinAlgError(f"{a.ndim}-dimensional...") from None if m != n: ...

Using that these days try/except has no cost if no exception is raised.

Sorry, pushed this after submitting the review - above is the most relevant comment! (and not very relevant at that!)

The _assert_stacked_square is about 10% faster using your suggestion, I updated the PR with this.

mhvk

Looks good to me, thanks! Let's get it in.

eendebakpt · 2025-04-10T18:41:10Z

Thanks for reviewing! Next PR is #28686

…8649) * ENH: Improve np.linalg.det performance * Update numpy/linalg/_linalg.py * revert change to complex detection * use suggestion * whitespace * add more small array benchmarks * trigger build

ENH: Improve np.linalg.det performance

a4ff9e0

github-actions bot added the 01 - Enhancement label Apr 4, 2025

eendebakpt marked this pull request as draft April 4, 2025 15:18

eendebakpt commented Apr 4, 2025

View reviewed changes

numpy/linalg/_linalg.py Outdated Show resolved Hide resolved

Update numpy/linalg/_linalg.py

05abe63

eendebakpt marked this pull request as ready for review April 4, 2025 16:01

tylerjereddy added the component: numpy.linalg label Apr 4, 2025

tylerjereddy reviewed Apr 4, 2025

View reviewed changes

mhvk reviewed Apr 4, 2025

View reviewed changes

eendebakpt added 5 commits April 9, 2025 09:32

revert change to complex detection

c020ed7

use suggestion

a760647

whitespace

3f094cd

add more small array benchmarks

ab6cc07

trigger build

2fc384f

mhvk approved these changes Apr 9, 2025

View reviewed changes

mhvk merged commit 422ca44 into numpy:main Apr 9, 2025
69 of 71 checks passed

ngoldbaum mentioned this pull request May 20, 2025

MNT: xfail flaky test on pypy #29014

Closed

Uh oh!

ENH: Improve np.linalg.det performance #28649

ENH: Improve np.linalg.det performance #28649

Uh oh!

Conversation

eendebakpt commented Apr 4, 2025

Uh oh!

Uh oh!

tylerjereddy left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eendebakpt commented Apr 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants