Implement cupy.linalg.eig / cupy.linalg.eigvals clone of PR #8854 by mfoerste4 · Pull Request #8980 · cupy/cupy

mfoerste4 · 2025-02-24T20:54:59Z

This is PR #8854 with an updated main branch
CC @leofang

Check version range

leofang · 2025-02-24T20:57:09Z

/test mini

leofang · 2025-02-25T03:03:33Z

@mfoerste4 found that the OOM issue is due to unguarded calls to the new cuSOLVER routines, which is only available in recent CUDA 12.x (which one I forgot, it's in release notes and the linked issue). The evidence is CUDA 12.8 CIs passed. We need to add a version guard like what we did in the past for other cuSOLVER routines.

leofang · 2025-02-25T12:24:30Z

/test mini

asi1024 · 2025-02-25T12:58:53Z

@mfoerste4 Thank you for taking over the pull request! Considering the _geev function is called repeatedly by the eig function, I think some redundant calculations can be eliminated.

My understanding is that work_device_size and work_host_size return the same value for all _geev calls. If that is correct, we could allocate work_device and work_host in advance and reuse them.
We can eliminate the copy when cocatenating the return values of _geev with cupy.stack. Could you fix to allocate the concatenated memory in advance?
Please avoid repeatedly calling all(w.imag == 0.0). It should be checked against the combined result. Also, instead of using the Python primitive all, rewrite it to use the cupy.ndarray method, like (w.imag == 0.0).all().
Could you convert the input array to Fortran format in advance before calling _geev?

leofang · 2025-02-26T18:50:50Z

We need to add a version guard like what we did in the past for other cuSOLVER routines.

Looks like it's fixed now! (The remaining CI failures are irrelevant.)

I think some redundant calculations can be eliminated.

@mfoerste4 could you kindly take a look at Akifumi-san's comments?

EarlMilktea · 2025-02-27T06:30:20Z

@mfoerste4 @leofang Sorry for making you wait long (was very busy recently). Please keep me up to date!

mfoerste4 · 2025-02-27T13:39:05Z

@asi1024 , thanks for your response, comments inline

@mfoerste4 Thank you for taking over the pull request! Considering the _geev function is called repeatedly by the eig function, I think some redundant calculations can be eliminated.
* My understanding is that `work_device_size` and `work_host_size` return the same value for all `_geev` calls. If that is correct, we could allocate `work_device` and `work_host` in advance and reuse them.

I don't think we can make any assumptions about these in general. For stacked computations you are probably right.

* Please avoid repeatedly calling `all(w.imag == 0.0)`. It should be checked against the combined result. Also, instead of using the Python primitive all, rewrite it to use the `cupy.ndarray` method, like `(w.imag == 0.0).all()`.

Yes, I will add that

* Could you convert the input array to Fortran format in advance before calling `_geev`?

I don't think that is possible. IIUC we need each matrix in Fortran format, but each matrix should remain coalesced in memory. In strides for a (B x M x M) input this would be (M*M, 1, M) which is not Fortran style. So I guess the best we can do is the local transpose for each matrix in _geev (as we need a copy anyways as cusolver might overwrite)

* We can eliminate the copy when cocatenating the return values of `_geev` with `cupy.stack`. Could you fix to allocate the concatenated memory in advance?

The stacking for multiple geev computes seems to be broken in the current PR - the tests only cover 2D. For pre-allocation the same issue with the data layout arises as mentioned above.

As this is my first contact with cupy - if you have a proposal on how to implement the stacking I would be happy to hear it.

…eig_eigvals_support

mfoerste4 · 2025-03-26T16:26:52Z

@asi1024 , I moved the batching into the inner loop and used the pre-allocated output(s) as much as possible. For real input we still need a local instance for real eigenvectors because of the way the cusolver API is designed.

kmaehashi · 2025-04-16T13:03:11Z

@asi1024 kindly ping

ev-br · 2025-05-19T12:25:07Z

It's great to see this, I'm looking forward to be able to use it!

For the dtype of eigenvalues: it can be argued that numpy's decision to have value-dependent dtype is a mistake. Other array libraries (notably, jax.numpy, pytorch) always return eigenvalues as a complex array, and it's up to a user to decide if they want to check for zero imaginary parts.
So maybe CuPy could consider to do the same and avoid numpy's "return w.real if w.imag==0 else w" contraption.
For completeness, here's a link to an Array API discussion: data-apis/array-api#935

mfoerste4 · 2025-05-22T18:54:17Z

@asi1024 , I don't think there are any open items left. Would you mind triggering the main CI tests?

asi1024 · 2025-05-28T02:12:08Z

/test mini

asi1024 · 2025-05-28T05:39:36Z

@ev-br @mfoerste4

So maybe CuPy could consider to do the same and avoid numpy's "return w.real if w.imag==0 else w" contraption.

Our team concluded that it is acceptable to avoid changing the output tensor's dtype based on the output value. It is reasonable in the sense of avoiding DtoH synchronization.

kmaehashi · 2025-05-28T06:00:01Z

Could you skip eiv/eivgals tests on older CUDA using this?

@pytest.mark.skipif(
    cupy.cuda.runtime.runtimeGetVersion() < 12060, reason='Requires CUDA 12.6+')

Also let's uncomment these two lines to list these APIs on docs:

cupy/docs/source/reference/linalg.rst

Lines 62 to 64 in 380e629

    
           # linalg.eig 
        
           linalg.eigh 
        
           # linalg.eigvals

mergify · 2025-05-28T23:03:05Z

This pull request is now in conflicts. Could you fix it @mfoerste4? 🙏

mfoerste4 · 2025-05-28T23:07:22Z

Thanks @kmaehashi for the review. I have made the requested changes. Please note that the tests had to be adjusted to account for cupy now always returning complex types.
I also changed the host buffer to be pinned memory as this was suggested by the cusolver documentation.

kmaehashi · 2025-05-29T00:20:40Z

Thanks @mfoerste4! The change looks good to me, let me kick the CI again.
@asi1024 Do you have any other suggestions on this PR?

/test mini

asi1024

Sorry for my late review. LGTM!

EarlMilktea and others added 15 commits December 27, 2024 18:16

🚧 Add declarations

64e219b

Check version range

🚧 Impl. binding

7c58ab2

🚧 Impl. _geev

c0ac309

🚨 Fix linter warnings

8044d04

💡 Move comment to the right place

fcd8801

🔀 Merge branch 'main' into impl-eig

5e9b36c

🚧 Impl. eig/eigvals

cc30bde

🐛 Fix type mismatch

7bcf77d

✅ Rename existing tests

6b1be99

✅ Add eig/eigvals tests

86d3cc0

📝 Add docstring

edff908

✅ Add hermitian tests

f8d22f8

📝 Note on ev order

b3ca462

Merge branch 'main' into test_pr

bee2aff

Merge branch 'main' into eig_eigvals_support

f94fa41

leofang added the takeover Pull-requests taken over from other contributor label Feb 24, 2025

leofang added the cat:feature New features/APIs label Feb 24, 2025

asi1024 self-assigned this Feb 25, 2025

asi1024 added the prio:high label Feb 25, 2025

mfoerste4 and others added 3 commits February 25, 2025 09:41

add guard around geev

7c410e3

run pre-commit

84ae826

Merge branch 'main' into eig_eigvals_support

4d4c6cd

pass real matrix to cusolver if only eigenvalues are requested

9593b62

mfoerste4 added 2 commits March 26, 2025 16:21

move matrix batching to inner loop

ca6f802

Merge branch 'eig_eigvals_support' of github.com:mfoerste4/cupy into …

ff91d0a

…eig_eigvals_support

kmaehashi mentioned this pull request Apr 3, 2025

[Tracker] Implement all numpy.* APIs in CuPy #6078

Open

86 tasks

This was referenced Apr 28, 2025

ENH: signal: filter design functions array API standard support scipy/scipy#22886

Merged

RFC: add non-symmetric eigenvalue solvers, eig / eivals, to the linalg extension data-apis/array-api#935

Closed

mfoerste4 and others added 3 commits May 9, 2025 19:29

Merge branch 'cupy:main' into eig_eigvals_support

4e88f40

remove starred index for <python3.11

8c0ade3

Merge branch 'cupy:main' into eig_eigvals_support

2f43a7b

ev-br mentioned this pull request May 21, 2025

Make eig/eigvals always return complex eigenvalues numpy/numpy#29000

Closed

mfoerste4 added 4 commits May 28, 2025 15:00

use pinned memory for host workspace as recommended by cusolver

e451eb0

always return complex dtype

c44c2e4

add eig/eigvals API to doc

d7c14a2

adjust tests to cupy always returning complex

52a8e63

Merge branch 'main' into eig_eigvals_support

17985a0

asi1024 approved these changes May 29, 2025

View reviewed changes

asi1024 merged commit 1cbf094 into cupy:main May 29, 2025
62 checks passed

kmaehashi added this to the v14.0.0a2 milestone May 29, 2025

kmaehashi mentioned this pull request May 30, 2025

Fix cuSOLVER feature/version detection for eig and eigvals #9147

Merged

kmaehashi mentioned this pull request Jul 4, 2025

cannot import name 'eig' from 'cupy.linalg' #9228

Closed

ev-br mentioned this pull request Oct 13, 2025

feat: add eig and eigvals to the linalg extension data-apis/array-api#978

Merged

Uh oh!

Conversation

mfoerste4 commented Feb 24, 2025

Uh oh!

leofang commented Feb 24, 2025

Uh oh!

leofang commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leofang commented Feb 25, 2025

Uh oh!

asi1024 commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leofang commented Feb 26, 2025

Uh oh!

EarlMilktea commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfoerste4 commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfoerste4 commented Mar 26, 2025

Uh oh!

kmaehashi commented Apr 16, 2025

Uh oh!

ev-br commented May 19, 2025

Uh oh!

mfoerste4 commented May 22, 2025

Uh oh!

asi1024 commented May 28, 2025

Uh oh!

asi1024 commented May 28, 2025

Uh oh!

kmaehashi commented May 28, 2025

Uh oh!

mergify Bot commented May 28, 2025

Uh oh!

mfoerste4 commented May 28, 2025

Uh oh!

kmaehashi commented May 29, 2025

Uh oh!

asi1024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

leofang commented Feb 25, 2025 •

edited

Loading

asi1024 commented Feb 25, 2025 •

edited

Loading

EarlMilktea commented Feb 27, 2025 •

edited

Loading

mfoerste4 commented Feb 27, 2025 •

edited

Loading