ENH: ensure intent(inplace) does not mangle input array #29929

mhvk · 2025-10-11T20:23:45Z

While looking at trying to store data on instances, I ran into problems with f2py assuming one could just swap data between array instances. This is pretty bad form, but was probably necessary at the time, since we did not yet have WRITEBACKIFCOPY. But now we do. So, this PR uses it.

The commits are

Make the required changes and test them with a new piece of fortran code;
Adjust the tests that check the wrapper directly. This needed a small addition to wrap;
Adjust the documentation;
Changelog entry.

I'm quite pleased with how this turned out!

mhvk · 2025-10-12T00:54:01Z

@charris, @HaoZeke - not sure you are the right ones, but most fixes for f2py recently seem to have been by you, so I was hoping you might be willing to have a look at this fairly involved PR, which makes intent(inplace) behave much more reasonably. Thanks!

seberg · 2025-10-12T08:24:47Z

doc/release/upcoming_changes/29929.improvement.rst

+presumably modified by the fortran routine, is copied back to the
+input routine. This means one no longer has the risk that pre-existing
+views or slices of the input array start pointing to unallocated
+memory.


Sorry, but will keep it at a bit of a fly-by comment for now. This is scary (mostly the old code, it's scary enough that I suspect we should just change it... although I do wonder if we should maybe go towards an error)!
It seems to me that the swapping for outputs would lead to out=float_arr for integer array functions to just swap the float_arr to be an integer one!?

I know, the old way meant that if you put in an integer array, your own instance would become a float array! With my new mechanism, you'd keep an integer array, with the copied-back float stuff truncated and possibly overflowing. So, definitely an aspect of "don't do that, then!".

I do think it is correct that if you put in a float32 to a fortran routine taking only float64, you just get reduced precision output (instead of your array becoming float64!). It is like the ufunc would do it (with the same mechanism, I think...).

Anyway, much of this predates having machinery to deal with these cases, and I find it quite remarkable how well f2py has held up -- I've quite happily used it myself when I started with python, and wanted not to lose all the fortran code I had written.

At first glance, I think this was a long standing problem that prevented full API compliance. Or maybe I am thinking of another one where data pointers were swapped ...

mhvk · 2025-11-11T02:16:30Z

@HaoZeke - would you have time to look at this PR for f2py. I think it does solve a longstanding issue (and is needed for some of my other work... we really shouldn't be mangling arrays, swapping pointers to memory, etc.). I'm also fairly confident the code itself is right, but would want to be sure the changes I made to the documentation are useful/good.

seberg · 2025-11-12T18:55:23Z

We just brought it up, and I think the main reason this might fail I think is that before there was never any casts, out was basically ignored (i.e. replaced) if it doesn't fit perfectly.
But with this, we cast the result to the given out with its dtype.

That is clearly better, but will it break downstream code? We could also just disable inplace or just raise an error when the dtype doesn't match and a copy would be needed?

To me that last bit makes sense anyway, the user wanted in-place and didn't get it? That way we don't have to worry about changed behavior only about new errors.

mattip · 2025-11-12T18:55:28Z

We discussed this at a recent triage meeting and would prefer to just disallow this whole construct. Is that possible? There were some other opinions too.

HaoZeke · 2025-11-12T19:28:04Z

Sorry for being AWOL I have penciled time in this weekend to review this properly and test some older projects.

mhvk · 2025-11-12T19:40:20Z

Yes, seems certainly to make sense to error on bad cases, I should have considered that...

I'll wait with doing anything until @HaoZeke has had a chance to have a look, but FWIW, removing the option to have "inplace" would seem bad, in that it would somewhat gratuitously break code that worked perfectly well. However, refusing to run when the dtype changes seems reasonable, i.e., there would be a run-time failure if you put in, e.g., an integer array when a float was expected (compiled with "inplace").

The question then arises what we consider a "dtype change". Effectively, this is a casting question, and it seems one would definitely like to allow "equiv" (byte-order changes). The "safe" option is not relevant, since casts will happen in both directions. I think one would also want to allow "same_kind", as it seems OK to put in, e.g., a float32 array in float64 code (or vice versa), and it would probably not materially affect the result. But I think we would then error on anything requiring "unsafe". (Ideally, I guess it would be an option, but I do not know how to implement that...)

p.s. I do hope we can make a decision that does not, effectively, prevent changes to how ndarray organizes its associated memory, though! Ideally, those become proper implementation details...

seberg · 2025-11-13T08:07:10Z

Yeah, we should move on, it's just would be nicer to know how much and how badly it might break things. It seemed like this isn't used much (thus why "do we need this" came up).

I tend towards it being OK to require exact (equivalent) dtype. I do think we do it in a few places for out= parameters (but would have to try to remember where).
UFuncs are a bit different, in that they at least use buffered casts, so there is no full copy with out=.

To me an "in-place" use-case should almost always be truly in-place anyway and it may actually surprise some users if it's allowed to not be, so I would tend towards the easy way out.

We could deprecate it even, but with f2py being transient and this apparently almost never used (and most use-cases probably unaffected), I would be happy to try to flip the switch for 2.5. (We should add a release note).

mhvk · 2025-12-10T14:47:00Z

@HaoZeke, @seberg - I've updated my fix to intent(inplace) to ensure the input array is the same kind as the array needed by the fortran routine. I decided against enforcing strictly the same dtype, as I worried that greatly increased the likelihood of breaking existing code (in particular, for 32 and 64 bit float and int). Obviously, happy to change it if you think it is better (e.g., we could explicitly allow only 32 and 64 bit versions, not 8 and 16, which are much less likely to have been used in any fortran code, and much more likely to give wrong results).

Note that I think there are other pieces that might break. E.g., it doesn't look like the current code checks byte order.

I think it would be reasonable to deprecate the use of intent(inplace) overall, but probably best as follow-up.

seberg · 2025-12-10T16:47:54Z

I would prefer the strict version since that will shout at you in a useful way, while the other one could in principle just do the wrong thing silently.
(The main issue I see is that I am not sure if downstream builds newer versions with newer NumPy versions, allowing for updating a downstream library in a bug-fix release, which is undesireable either way though)

mhvk · 2025-12-10T17:21:04Z

I can certainly do that, but as I mentioned, I worry about breaking reasonable code (float32 code being fed float64 in particular). Let's see what @HaoZeke thinks...

mhvk added 01 - Enhancement component: numpy.f2py labels Oct 11, 2025

mhvk force-pushed the fortranobject-no-array-swap branch from e0cacc6 to 32ed872 Compare October 12, 2025 00:11

seberg reviewed Oct 12, 2025

View reviewed changes

mhvk mentioned this pull request Oct 13, 2025

ENH, PERF: allocate memory as part of the array object for scalars #29878

Draft

seberg added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Nov 13, 2025

seberg added the triage review Issue/PR to be discussed at the next triage meeting label Dec 10, 2025

mhvk added 5 commits December 10, 2025 11:00

MAINT: Use WRITEBACKIFCOPY for in-place arrays.

ebfe36b

MAINT,TST,BUG: Adjust tests for using WRITEBACKIFCOPY

e68bd21

DOC: update description of the effects of intent(inplace)

36788f2

DOC: what's new entry

cd745b6

MAINT: Enforce same kind array for intent(inplace)

33bf19c

mhvk force-pushed the fortranobject-no-array-swap branch from cba18c0 to 33bf19c Compare December 10, 2025 16:00

Uh oh!

ENH: ensure intent(inplace) does not mangle input array #29929

Are you sure you want to change the base?

ENH: ensure intent(inplace) does not mangle input array #29929

Uh oh!

Conversation

mhvk commented Oct 11, 2025

Uh oh!

mhvk commented Oct 12, 2025

Uh oh!

seberg Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

mhvk Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

charris Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

mhvk commented Nov 11, 2025

Uh oh!

seberg commented Nov 12, 2025

Uh oh!

mattip commented Nov 12, 2025

Uh oh!

HaoZeke commented Nov 12, 2025

Uh oh!

mhvk commented Nov 12, 2025

Uh oh!

seberg commented Nov 13, 2025

Uh oh!

mhvk commented Dec 10, 2025

Uh oh!

seberg commented Dec 10, 2025

Uh oh!

mhvk commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants