Adjoint matmul in QobjEvo by amilsted · Pull Request #2802 · qutip/qutip

amilsted · 2026-01-14T00:14:26Z

This PR introduces a matrix-form master-equation solver variant, which has dramatically better performance when operators are dense, helping to close the gap to other packages such as QuantumOptics.jl and dynamiqs. It also adds some new "cast-complex-to-double" C++ low-level matmul routines, inspired by existing qutip matvec code, that vectorize better than the existing cython code (at least they are significantly faster on my arm Macbook Pro and, for CSR matrices, help to bring performance closer to the CSC routines used by QuantumOptics.jl).

The matrix-form master-equation solver introduces a new "rhs" class LindbladMatrixForm that uses matmul operations to implement the action of the Liouvillian on a density matrix without constructing explicit superoperator matrices. For dense operators of dimension n, this action scales as O(n^3) versus O(n^4) for the current "explicit superoperator matrix" approach. I have also found cases where using CSR representation with the matrix-form solver is faster than both CSR explicit superoperators and fully-dense matrix-form solves.

The matrix-form solver is currently opt-in via the mesolve API and is also usable via the new MESolverMatrixForm Solver class.

I am happy to provide benchmark code for both claimed improvements.

I worked with gen AI tools to produce this PR. I reviewed AI-generated changes in detail myself and iterated on the changes to ensure a good fit with existing paradigms in the qutip codebase and its tests.

I realize this is a fairly significant addition and fully expect to make possibly major adjustments during the review process! For example, should it be "LindbladMatrixForm" or "LiouvillianMatrixForm"? : )

Update: I split the original PR into two. This PR now just introduces underlying matmul routine changes. A second PR will bring in the master-equation solver.

- Add C++ implementations: matmul_csr_dense, matmul_diag_vector - Update Cython interface with new function declarations - Add matmul_dag_dense_dia_dense for adjoint diagonal operations - Update test_mathematics.py with test refactoring

- Add matmul_data and adjoint_rmatmul_data methods - Support scale parameters for efficient accumulation - Update _brtensor.pyx for signature compatibility

- Implement matmul_data for efficient matrix multiplication - Add adjoint_rmatmul_data for right multiplication with adjoint - Support scale parameters for accumulation operations - Update test_qobjevo.py with test refactoring

pmenczel

Hi @amilsted, thank you very much for your contribution. It will take time to look at this in detail, but it looks like very good and thorough work. Benchmark code would be nice to have (and could potentially be repackaged into a tutorial notebook at some point?)

Storing superoperators not as a big Liouvillian matrix but in a more "analytic" form has been discussed internally for a while, and is a big goal for one of the next major releases. That would be a deeper change, applying not only to Lindblad-type superoperators... however, it will take some time. I imagine that it would make sense to add your feature to QuTiP now, even if it might become obsolete at some later time. We need to have some discussion within the admin team.

The following are just a few things I noticed scrolling through the code for the first time. (Also, please have a look at the warnings generated in the test runs; we treat warnings as errors.)

amilsted · 2026-01-15T18:37:57Z

Looks like tests are failing because of warnings. I added coverage for the in-place behavior of the matmul routines, including cases where input and output matrices have different C/Fortran ordering choices (some of the existing code was actually broken for this case). These paths are slower and currently warn. Maybe these should actually be error paths? In any case, at the moment it looks like I just need to teach the tests to expect warnings here.

amilsted · 2026-01-15T18:53:05Z

Storing superoperators not as a big Liouvillian matrix but in a more "analytic" form has been discussed internally for a while, and is a big goal for one of the next major releases. That would be a deeper change, applying not only to Lindblad-type superoperators... however, it will take some time. I imagine that it would make sense to add your feature to QuTiP now, even if it might become obsolete at some later time. We need to have some discussion within the admin team.

Thank you for taking a look! It did occur to me that it might make sense to have a general matrix-form Liouvillian class. I hope that the matmul op work here is useful for future changes along these lines. It wouldn't be hard to generalize LindbladMatrixForm to be a general superoperator class. Could keep the same matmul efficiency if we e.g. handle [H_nh, None] and [None, H_nh] by treating None as a noop identity.

amilsted · 2026-01-15T18:54:19Z

Maybe I should also add: the explicit superoperator form for the Liouvillian is still often the fastest for sparse martrices, a according to my benchmarks, so you may not want to drop it entirely!

Ericgig · 2026-01-15T19:13:16Z

@amilsted
With the size of the PR, would you accept to split it into chunk for easier review?
As a first step, only include the core changes with tests, (except the new lindblad_matrix_form.pyx.)
This should cut it in about half.

Ericgig · 2026-01-15T20:58:32Z

Thank you for adding mixed C, F order in the tests.

It would be good to have tests that check the warnings are properly raised:
with pytest.warns(OrderEfficiencyWarning) as warning:

I don't think they should be error.
The design of the low level function is that it should just work, even if not optimal.

amilsted · 2026-01-16T22:53:57Z

Thank you for adding mixed C, F order in the tests.

It would be good to have tests that check the warnings are properly raised: with pytest.warns(OrderEfficiencyWarning) as warning:

I don't think they should be error. The design of the low level function is that it should just work, even if not optimal.

I added some code to just swallow these warnings in all cases. It's currently difficult to detect when they should or shouldn't occur, but I can work on doing that if you prefer.

I also did some refactoring as I wasn't quite happy with how the tests were structured. Now there is a separate test variant with its own id for out-of-place as well as each in-place ordering.

amilsted · 2026-01-16T23:35:08Z

@amilsted With the size of the PR, would you accept to split it into chunk for easier review? As a first step, only include the core changes with tests, (except the new lindblad_matrix_form.pyx.) This should cut it in about half.

I have now done this. This PR now only includes the matmul stuff. I will make a second for the mesolve stuff. Would you like me to do that now or wait until we are done with this?

For now, I have put the remaining changes here: amilsted#1

amilsted · 2026-01-21T21:51:29Z

Added some benchmark results in a branch here: https://github.com/amilsted/qutip/tree/matmul_bench

See the full results here: https://github.com/amilsted/qutip/blob/matmul_bench/BENCHMARKING.md

In matmul microbenchmarks:

CSR matrix-matrix operations get significantly faster on Linux x86_64 and Apple Silicon (15-30%). This is important for the matrix-form solver in the upcoming second PR!
CSR matrix-vector operations get significantly faster on Apple Silicon (15-20%) and are about the same on x86_64 (and shouldn't change as the SSE specialized routine is still used unchanged here)
DIA matrix-matrix and matrix-vector gets significantly faster on Apple Silicon (15-30%). Matrix-vector is somewhat slower on x86_64 (~5%) and matrix-matrix is about the same. The slowdown seems to be due to use of intermediate variables to avoid writing to vectors all the time in the inner loop. This significantly helps Apple Silicon (more registers?), but slightly hurts x86_64.

In JC model dynamics (sesolve and vectorized mesolve, so only exercising mat-vec routines):

~5% performance improvement for CSR on Apple Silicon, up to ~15% with DIA.
Essentially no change on x86_64 for most cases. Perhaps a small slowdown for very small Hilbert spaces, but could be noise.

Ericgig

Thank you for the benchmarks. I will try running them tomorrow.

nwlambert · 2026-01-22T06:16:32Z

Just wanted to add, thanks for this, looks cool! I had a quick question, I remember looking at what dynamiqs were doing a while back, and they had a nice trick in the Lindblad construction to slightly reduce the number of matrix-matrix multiplications, at the the cost of some small numerical error (https://github.com/dynamiqs/dynamiqs/blob/fdc6c8913bba2b0a03eb0ad314ca01b5a519030b/dynamiqs/integrators/core/diffrax_integrator.py#L297 ). Just curious if you tried this in your benchmarks, and if it was worthwhile.

amilsted · 2026-01-22T18:22:47Z

Just wanted to add, thanks for this, looks cool! I had a quick question, I remember looking at what dynamiqs were doing a while back, and they had a nice trick in the Lindblad construction to slightly reduce the number of matrix-matrix multiplications, at the the cost of some small numerical error (https://github.com/dynamiqs/dynamiqs/blob/fdc6c8913bba2b0a03eb0ad314ca01b5a519030b/dynamiqs/integrators/core/diffrax_integrator.py#L297 ). Just curious if you tried this in your benchmarks, and if it was worthwhile.

Oh man! Yes, I had come across this, but only vaguely remembered it existed. I think it's worth doing for the second PR :)

Slight changes to the number of matmul ops can indeed determine the answer to "which solver variant is best". There are many cases where it is a close call!

Ericgig

I am still running benchmarks, but I am happy with everything except the tests for cases with inputs. Auto detection from signature and adding an argument that you must check that it's not used anywhere except one specific case is not a clean solution.

Ericgig · 2026-01-22T20:35:45Z

+    # Check if op supports out parameter and out_type is Dense.
+    # If so, we will add in-place test variants for both C and F ordered
+    # output arrays.
+    import inspect
+    supports_out = False
+    if out_type is Dense:
+        try:
+            sig = inspect.signature(op)
+            supports_out = 'out' in sig.parameters
+        except (ValueError, TypeError):
+            # Can't introspect (e.g., built-in function), assume no out support
+            pass
+
+    # Determine out_prealloc values
+    out_prealloc_values = [None, "C", "F"] if supports_out else [None]
+


This is not the way...
This will cause problem if the out argument was ever used in another way or named something else.
It would also skip test were the output is not Dense ,cupy , jax, cuQuantum backends uses non-Dense states in our solvers and we should test them too.

Why not use TernaryOpMixin (or ScaledTernaryOpMixin) with out as simply the third input:
the operation is really A @ B * scale + out. With the output being the same instance as out a bonus.
The function tested will have to be wrapped since scale comes before out (or we could inverse them in the functions).
It would allow to add tests form other specialization easily (says we want to test with a CSR out) and keep test_mathematically_correct and cases_type_shape_product simpler.

Point taken. That sounds like a better approach!

I have done this now with a InPlaceMatmulMixin, since the semantics of TernaryOpMixin were a bit different. It does seem a lot cleaner. What do you think?

This time I have verified that the rest of the tests also pass :)

Ericgig

Benchmarks look good on my side.
If you could just remove the unused op_matmul, it's ready.

I saw you removed the towncrier entry. Knowing that there is a follow up PR, it's fine, but this alone is lot of work and deserve a mention.

coveralls · 2026-01-27T14:40:38Z

coverage: 86.98%. first build
when pulling 428e9b6 on amilsted:matrix_mesolve_pr
into 7589bf3 on qutip:master.

amilsted · 2026-01-27T20:52:51Z

Benchmarks look good on my side. If you could just remove the unused op_matmul, it's ready.

I saw you removed the towncrier entry. Knowing that there is a follow up PR, it's fine, but this alone is lot of work and deserve a mention.

Removed the op and added the towncrier entry.

Ashley Milsted added 3 commits January 13, 2026 15:39

Add enhanced matmul implementations with tests

18063b6

- Add C++ implementations: matmul_csr_dense, matmul_diag_vector - Update Cython interface with new function declarations - Add matmul_dag_dense_dia_dense for adjoint diagonal operations - Update test_mathematics.py with test refactoring

Enhance Element class with new matmul methods

1020550

- Add matmul_data and adjoint_rmatmul_data methods - Support scale parameters for efficient accumulation - Update _brtensor.pyx for signature compatibility

Add matmul_data and adjoint_rmatmul_data to QobjEvo

d2c18f3

- Implement matmul_data for efficient matrix multiplication - Add adjoint_rmatmul_data for right multiplication with adjoint - Support scale parameters for accumulation operations - Update test_qobjevo.py with test refactoring

pmenczel reviewed Jan 15, 2026

View reviewed changes

Comment thread qutip/solver/mesolve.py Outdated

Comment thread doc/guide/dynamics/dynamics-options.rst Outdated

Comment thread qutip/core/cy/_element.pyx

Comment thread qutip/core/cy/lindblad_matrix_form.pyx Outdated

Comment thread qutip/solver/mesolve.py Outdated

Ericgig reviewed Jan 15, 2026

View reviewed changes

Comment thread qutip/core/cy/_element.pyx Outdated

Comment thread qutip/core/cy/_element.pyx Outdated

Comment thread qutip/core/cy/_element.pyx

Comment thread qutip/core/data/matmul.pyx

Comment thread qutip/core/data/matmul.pyx Outdated

Fix PEP8

602c9fb

amilsted force-pushed the matrix_mesolve_pr branch from 7c64c4b to 602c9fb Compare January 16, 2026 19:52

Suppress OrderEfficiencyWarning

3f36767

amilsted changed the title ~~Matrix-form master-equation solver~~ Adjoint matmul in QobjEvo Jan 16, 2026

Ashley Milsted added 2 commits January 16, 2026 13:52

Fix build

5ff7484

Refactor matmul tests to better use parameterization.

6501567

Ashley Milsted added 3 commits January 16, 2026 15:24

Use matmul_dag

439e247

Fix tests

1e210ba

remove unicode daggers

5c18ce3

more daggers

91965bb

amilsted mentioned this pull request Jan 17, 2026

Matrix mesolve amilsted/qutip#1

Draft

Ashley Milsted added 2 commits January 19, 2026 17:38

Special-case scale==1 DIA routines to restore perf

3bb86b1

restore informational comments

7e7dcf4

Ericgig reviewed Jan 21, 2026

View reviewed changes

Comment thread qutip/core/cy/_element.pyx Outdated

Ericgig reviewed Jan 22, 2026

View reviewed changes

Ericgig reviewed Jan 23, 2026

View reviewed changes

Comment thread qutip/core/data/src/matmul_diag_vector.cpp Outdated

Ericgig reviewed Jan 23, 2026

View reviewed changes

Comment thread qutip/core/data/matmul.pyx

Ashley Milsted added 7 commits January 23, 2026 15:36

Switch to using a ternary op class

32c9b53

Make intermediate vars conditional on arm64

b9b3d4c

Merge ternary in-place matmul test refactor

89e7aae

NULL -> None

f159bee

cast coeff before *

f19d5b4

revert to tmp-buffer approach for DIA matmul scaling

3652b16

Remove now-unused scaling branches

dfaada9

amilsted commented Jan 26, 2026

View reviewed changes

Comment thread qutip/core/data/matmul.pyx

Remove duplicate def

9dc6db8

Ericgig approved these changes Jan 27, 2026

View reviewed changes

Comment thread qutip/core/cy/_element.pyx Outdated

Comment thread qutip/tests/core/data/test_mathematics.py Outdated

Ashley Milsted added 3 commits January 27, 2026 12:04

remove vestigal op_numpys

70ac9e3

formatting

198cd50

towncrier

428e9b6

Ericgig merged commit 860bc98 into qutip:master Jan 28, 2026
15 of 16 checks passed

amilsted mentioned this pull request Jan 28, 2026

Matrix-form master-equation solver #2811

Merged

Uh oh!

Conversation

amilsted commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pmenczel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amilsted commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amilsted commented Jan 15, 2026

Uh oh!

amilsted commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ericgig commented Jan 15, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ericgig commented Jan 15, 2026

Uh oh!

amilsted commented Jan 16, 2026

Uh oh!

amilsted commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amilsted commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ericgig left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nwlambert commented Jan 22, 2026

Uh oh!

amilsted commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ericgig left a comment

Choose a reason for hiding this comment

Uh oh!

Ericgig Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

amilsted Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

amilsted Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ericgig left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coveralls commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amilsted commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

amilsted commented Jan 14, 2026 •

edited

Loading

pmenczel left a comment •

edited

Loading

amilsted commented Jan 15, 2026 •

edited

Loading

amilsted commented Jan 15, 2026 •

edited

Loading

amilsted commented Jan 16, 2026 •

edited

Loading

amilsted commented Jan 21, 2026 •

edited

Loading

amilsted commented Jan 22, 2026 •

edited

Loading

amilsted Jan 24, 2026 •

edited

Loading

coveralls commented Jan 27, 2026 •

edited

Loading