ENH: Reimplementing the indexing machinery #3798

seberg · 2013-09-26T15:57:10Z

Current status: Everything works. Some reorganizing is likely necessary, but pending input on that.

Check the release notes changes for a detailed list of compatibility changes.
Tested against scipy and pandas
The code has a few TODOs, but they should not require much action.
New 0-d boolean index support can't be tested/used until deprecation is done...
Check http://sebastian.sipsolutions.net/indexing_speedup/ for some speedup examples. Indices that require casting or are not trivial looping are unfortunatly slower for small and sometimes medium sized arrays.
The memory layout changes give close to 20% speed increase for a simple loop test (disabling fast path). Of course these would normally take the fast path. So the real world speed loss is much more if that fast path cannot be taken anymore, but on the other hand it is probably less for most other cases.

njsmith · 2013-09-27T17:26:19Z

Fast paths are horrible beasts -- they increase code paths that need
test coverage (probably a bunch of ours actually aren't tested even
since we have no coverage tool to check with), they increase
maintenance burden, often they don't even help, etc.

So just a thought, as you're implementing them, you'll probably want
to be systematically running microbenchmarks to validate each one? And
maybe it would be good to write these microbenchmarks in the form of
vbench benchmarks?

On Thu, Sep 26, 2013 at 4:57 PM, seberg [email protected] wrote:

Current status: Everything works, but cleanup to do, new tests to add and
optimizations to do. If anyone is interested, everything that is not
GetMap/SetMap could already be reviews probably.

The code has a lot of TODOs, some of them mark smaller pending decisions...
Parsing itself should be fine, cleanup mostly for MapIter necessary (also
fixing binary compatibility)
Still needs tests for the new deprecations (non-integer array-likes)
Speed improvements/fast-path necessary for SetMap.
Speed improvements/cleanups possibly for GetMap
New 0-d boolean index support can't be tested/used until deprecation is
done...

You can merge this Pull Request by running

git pull https://github.com/seberg/numpy new-index-machinery

Or view, comment on, or merge it at:

#3798

Commit Summary

ENH: Attempt to rewrite the index parsing.
WIP: handle output dim calculation immidiatly
WIP/BUG: Add missing index DECREF.
WIP: Smaller fixes/moves
WIP: Clean up the error messages
WIP: Always return a view, never self
WIP: Fix error returns up.
MERGE: include old bug fix lost in merge conflict
WIP: Free view correctly for fancy indexing
WIP: Speed improvements by using EXTERNAL_LOOP
WIP: Test fixups<
WIP: Implement fast_take and some other speed things
WIP: Fixups, no-fancy indexes fancy iteration, probably more
WIP: Fixup DeprecationWarnings. Only issue remaining...
WIP,TST: Small fixes for the tests
WIP: Py3k fixes
WIP: Change field check and string check order
WIP,TST: Fixup python3 tests
WIP: really fix the order of hasfields...
WIP,BUG: Do not go to far PyArray_MapIterNext, fix pyobj size
WIP,TST,DOC: tiny fix
WIP: Move Ellipsis check before subclass special case
WIP: Remove debug comments
WIP: Fix mapping.c warnings

File Changes

M numpy/core/include/numpy/ndarraytypes.h (42)
M numpy/core/src/multiarray/mapping.c (2587)
M numpy/core/src/multiarray/mapping.h (5)
M numpy/core/src/multiarray/scalartypes.c.src (29)
M numpy/core/tests/test_deprecations.py (5)
M numpy/core/tests/test_indexing.py (31)
M numpy/core/tests/test_multiarray.py (8)
M numpy/core/tests/test_numerictypes.py (2)
M numpy/core/tests/test_regression.py (7)
M numpy/lib/tests/test_function_base.py (2)

Patch Links:

https://github.com/numpy/numpy/pull/3798.patch
https://github.com/numpy/numpy/pull/3798.diff

seberg · 2013-10-19T23:48:21Z

OK, there is a test failing, but that is not a big deal. I have now replaced the whole MapIter with one that specializes no-subspace (iterating everything in one, even the operand) and subspace iteration (as nested iters). The new MapIter will do iteration order optimization (if an index is visited twice, the order is not guaranteed) and for getitem will not keep the original array order.

The code is now generally faster for larger arrays, though for very small advanced indexing operation it is around 1.5x slower. And it is a lot faster for larger subspaces and non C-Order arrays. The big issue remaining issue is that shape mismatch errors are even more cryptic then before (they are reported by NpyIter with axis remapping and all...), and I think I need a dedicated function to replace the errors with something human readable.

Anyway, overall, it has converged enough that you can look at it/play with it. Technically I don't think the code will need to change much, unless someone has some comments.

seberg · 2013-10-21T15:55:40Z

For anyone interested. This has converged quite far. It probably needs a bit readibility fixes for the MapIterNew code, and new tests are still missing. Also the GetMap/SetMap should probably be condensed (could get more inner loop specializations too, but I am not really interested in that right now). BUT: It is definetly in a stage where you can look at it, and should be in a state ready for usage. Numpy tests run through fine, scipy tests run through fine and the broadcasting errors are now nice and clean.

The only disadvantage with the code is that it is actually slower for fancy indexing with less then about 100 elements because of the longer set up time of the NpyIter. I will probably do some timings to show the differences clearer.

seberg · 2013-10-25T10:54:41Z

I added some fast paths (1-d trivial iteration), and if they can be taken it is insanely fast. Uploaded some timings at http://sebastian.sipsolutions.net/indexing_speedup/. Basically the new stuff is much faster unless buffering is used (or the for smaller arrays the special cases cannot be used, i.e. uncontiguous indexes). Especially non np.intp indexes are slower unless you are indexing with more then around 1000 elements (might be a slowdown on windows maybe, and just realized that the typenum == NPY_INTP might miss np.int64). Still needs tests I admit. The comparisons are a bit unfair due to inner loop optimizations, maybe I will retest with the more fair complex double type (larger itemsize which is not specialized yet)

pv · 2013-10-25T11:08:43Z

What happens with indexing with integer scalars? Maybe out of scope for this PR, but that case I think has quite severe overheads.

seberg · 2013-10-25T12:51:36Z

It doesn't have any severe overheads I am aware of (maybe a bit of it was fixed when I sped up the PyArray_IntpFromPyInt parsing function). Indexing with only integers (full integer special case) is a tiny inkling slower I think, simply because the old machinery did a special case early on, but that is neglegible. Slicing is faster. If scalar indexing (assuming non-fancy here, scalar fancy indexes I optimized away) is slow, it probably needs checking if PyArray_GetItem can be improved, but I doubt it.

pv · 2013-10-25T13:29:21Z

The speedups here look great. What I'm a bit concerned about is the test coverage here. It would be very useful if it's possible to run this through some C code coverage tool to see if all branches are covered.

seberg · 2013-10-25T14:27:37Z

The current test coverage is actually not too bad, except maybe for things like non-native byte order, object references, etc. and broadcasting in assignments. I know I have to add a bit of tests there. I uploaded a few timings for simple slicing (basically what I said). You are indeed right about one thing. The current master (and probably also 1.8.x) as a performance bug when indexing one_dimenaional_arr[(1,)] (one dim tuple into one dim array, arr[1] is fast), that is fixed of course. Indexing an object array is half as fast as indexing a python list, I am not sure we can expect much more.

pv · 2013-10-25T19:12:21Z

gcov points out a couple of code paths never run (apart from memory allocation error handling), see gh-3979, but coverage looks quite OK.

seberg · 2013-10-25T20:53:52Z

Thanks a lot, I will have a look at that and add tests for those things that are missing.

seberg · 2013-10-26T00:54:00Z

Hmmm, @pv maybe you know it, but your coverage commands don't give any output for me. Just too old gcc (4.7.3)? Am on an ubuntu, so somewhat expected it to just work. EDIT: What I mean is of course that the .gcda gcov files are not created

seberg · 2013-10-26T01:18:46Z

Nvm that. It ran through fine after I deleted the build directory, dunno maybe the script just searched the wrong paths.

seberg · 2013-10-28T10:56:46Z

Btw. since 1.8. is soon out ;)... This is pretty much done from my side except some new Deprecation tests. I am sure things will crop up, but it is ready for starting review in case someone wants to read 2500 or so lines of code :) (fortunatly they are very linear, except MapIterNew I think).

cournape · 2013-10-28T14:18:07Z

numpy/core/include/numpy/ndarraytypes.h

@seberg can you try to keep the existing struct ABI ? If possible, we'd like to keep ABI compatibility in the 1.x branch.

I hope I did keep everything ABI compatible (well everything that is reasonably to use). This is used in two different tests for ufunc.at and another one more specific, but as of yet not tested for ABI, only for API compatibility. I am aware it still needs checking. I think Thaeno is probably the only project using this API (it was only exposed in 1.8. and that is not even officially released yet).

this structure was already exposed in at least 1.7 possibly much earlier.

Maybe the structure, but MapIterArray was not. So nobody could have possibly used it before 1.8.x (though I admit in master it has been possibly a year).

over which functions is it used in 1.8? I could only find PyArray_MapIterNew which is not exposed.

if it is private we should move it out of the public header into a private one.

for 1.8 we could at least deprecate PyArrayMapIterObject via comments in the code to avoid that we get new code using it:
What about these:
PyArrayIterObject
PyArrayMultiIterObject
PyArrayNeighborhoodIterObject
can they be deprecated? the first two are probably more commonly used so deprecation of direct access might be premature without existing get/set methods.

I've already tagged 1.8 and am waiting for Ralf to do the binaries. If we get this figured out, we could maybe do a 1.8.1, but I'd rather figure out something for 1.9. This doesn't fall into the category of a bug fix and it isn't clear to me that we yet have a consensus on what should be done, or for that matter, what has already been done. For instance, IIRC, PyArrayNeighborhoodIterObject goes back several releases. All that makes me loathe to stop the 1.8 process at this point.

yes lets figure something out for 1.9 and not delay 1.8 any longer.

I know, nobody has really time for this probably :). But I just tested the multiarray_tests.test_inplace_add compiled with master and ran it with this branch and it works. So I am pretty confident that it is binary compatible for the usage which makes sense, unless there may be some subtleties on other architectures or compilers or so.

@juliantaylor @seberg

PyArrayIterObject PyArrayMultiIterObject PyArrayNeighborhoodIterObject

I haven't checked the others yet, but it looks like PyArrayNeighborhoodIterObject already has functions for accesses, so we could move it into a deprecated file I think. That should not break anyone's code unless we start moving the internals around, at which point folks will probably need to recompile as the functions are inline, i.e., the accesses to struct internals will still be there. So we can guarantee API but not ABI. That is already the case for PyArrayObject, but I don't know if that is widely publicised. It may be that we need to make that a policy, because otherwise we are frozen.

pv · 2013-10-28T18:40:13Z

@seberg the gcov needs clean rebuild. distutils cant figure out it needs to rebuild itself

charris · 2013-12-22T21:28:43Z

Time to get this moving again. I think a first step would be to deprecate direct access to the struct internals, but that would also imply having a compatible fall back, which would vitiate this work. Hmm, we really need to know if people are already accessing the internals, and I don't know how to do that without a deprecation. Thoughts?

seberg · 2014-01-25T12:00:05Z

I wouldn't mind to resurrect this... People most definetly are accessing the internals, however, I hope (and actually think it is most likely) that "people" is limited to thaeno and possibly one more guy and if nobody tries to be overly smart using it then this does not break compatibility (I am sure thaeno is not).

The struct should only be accessed for two reasons (which is how the examples you can find use it) I think 1. to get the operand array (mostly for the dtype) and 2. to check if transposing is necessary. Both of these reasons could easily be moved into a function (or moved into the transpose function itself). However ffor those things I did keep binary compatibility with. I know it is a bit hairy, and we should deprecate the access in any case.

Btw. if someone else feels like starting to review, I could move the branch to the numpy repository so that you can just push some changes directly.

juliantaylor · 2014-01-25T12:09:26Z

have you already tried running some reverse dependencies against the branch? e.g. pandas, pytables, scipy, h5py

seberg · 2014-01-25T15:25:55Z

I ran the scipy tests before, it was fine. I just ran pandas and it is fine except for finding a pretty fat bug in this code which is fixed now.

charris · 2014-01-30T03:52:54Z

@seberg Where is this now? You have quite a few PR's that I'd like to go through, but I'm not sure which ones you consider more or less ready.

seberg · 2014-01-30T10:41:28Z

This one is good for review. (The doc I didn't update; the npyiter one still needs action; the boolean subtract one mostly needs ping of mailing list again to make sure we really want it I think; the dtype shape also needs decision if it is the right way or not)

charris · 2014-02-02T05:38:59Z

doc/release/1.9.0-notes.rst

Is this consistent with the rules?

In [5]: array(1)[(True,)] Out[5]: array([1]) In [6]: array(1)[[True]] --------------------------------------------------------------------------- IndexError Traceback (most recent call last) <ipython-input-6-fc18d566cf7d> in <module>() ----> 1 array(1)[[True]] IndexError: too many indices for array

It is a bit curious that scalar booleans return 1D arrays

In [30]: a[True] Out[30]: array([1])

Hah, I first thought it looks like a bug too, but it is actually meant like that. I agree, it is non-obvious why this should be right, but of course I am convinced it is ;). The is this an example of why:

def filter_subarrays_with_large_sum(array, large): """ arrays : ndarray {..., N} out : ndarray {1, N} """ large_sum = array.sum(-1) return array[large_sum >= large]

With this change, the result will consistently add one dimension even if array is one dimensional and the 0-d change is the same. No dimension is removed (boolean index is 0-d) but as always one dimension is added. That said, it might be a large change.

This should also fix array-likes

Found by pandas tests...

Undid it by accident.

seberg · 2014-02-07T12:24:21Z

Hmmm, didn't look too long, but I don't quite understand the segfault. I can reproduce it in 3.3 (though not with valgrind) and see the backtrace in gdb, but the PyUString_ConcatAndDel(&errmsg, tmp); call has NULL checks in place unless something has a wrong refcount...

seberg · 2014-02-07T14:00:34Z

Ok, fixed that bug... I forgot to fix the convert_shape_to_string name in the common.h and somehow that destroyed the return pointer.

SylvainCorlay · 2014-02-07T15:38:30Z

0d-arrays are treated as very a special case

exp(array(1.0)).__class__        # returns numpy.bool_     instead of numpy.ndarray
logical_not(array(1)).__class__  # returns numpy.float64     " " "     
(array(1.0) > 0).__class__       # returns numpy.bool_       " " "

For the latter case, the fix seems to allow 0d-arrays to be indexed by booleans rather than fixing the previous behaviors, making 0d arrays even more special.

Would it be possible instead to make generalized functions of 0d arrays return 0d arrays? (See issue #1421)

seberg · 2014-02-07T15:59:32Z

This does not add a special case for 0-d arrays being indexed, it changes the treatment of boolean scalars in indexing to be more sensible. Sure, changing the ufunc behaviour would remove the necessity to do it, but even then it would make sense in my opinion (why should arr[True] and arr[np.array(True)] do different things).

So in my opinion it is simply a different issue (and one that I still think might be quite hairy and complex to change).

charris · 2014-02-09T02:25:33Z

If there are no more comments in the next day I'm going to put this in.

charris · 2014-02-09T16:43:37Z

OK, let's see how this plays. @seberg Adding functions to access the struct internals would be good so we can start down the long road to hiding them.

ENH: Reimplementing the indexing machinery

cournape reviewed Oct 28, 2013
View reviewed changes

charris reviewed Feb 2, 2014
View reviewed changes

seberg added 11 commits February 6, 2014 17:52

TST: Fix buggy ref-count test

4e92b2b

MAINT: move npy_index_info into mapping.h

e0b28df

DOC|MINOR: minor fixes to release notes indexing part.

628e0ef

MAINT: Remove old undefs in mapping.c

d1cf905

DOC: Minor release notes changes

97e9053

BUG: Move 0-d array-is-scalar special case into array branch

9635b48

This should also fix array-likes

BUG: Skip must check SIZE and NDIM...

c3eb6c5

Found by pandas tests...

BUG: Fix statement before declaration in mapping.c

98b1892

MAINT: Comments from charris and some extra documentation

b292046

DOC: Add some examples to the indexing change release notes

35a5639

BUG: Fix statement before declaration in mapping.c (again...)

aba22aa

Undid it by accident.

BUG: Fix common.h convert_shape_to_string.

d554c29

charris added a commit that referenced this pull request Feb 9, 2014

Merge pull request #3798 from seberg/new-index-machinery

de6729d

ENH: Reimplementing the indexing machinery

charris merged commit de6729d into numpy:master Feb 9, 2014

This was referenced Feb 10, 2014

Bug in np.matrix indexing #3110

Closed

Size-1 arrays should be indexable by numpy.bool_ objects (Trac #823) #1421

Closed

Inconsisitent error when assigning rhs floats to lhs ints. (Trac #1891) #2484

Closed

1.9 deprecations: Follow-up ticket #4132

Closed

seberg mentioned this pull request Feb 17, 2014

ENH: allow a duck-typed non NDArray sub-class to use the iterators #4240

Closed

seberg deleted the new-index-machinery branch March 1, 2014 20:22

SylvainCorlay mentioned this pull request Mar 27, 2014

Inconsistent behavior of ufuncs with 0d arrays #4563

Closed

shoyer mentioned this pull request Nov 26, 2014

NetCDF indexing not consistent Unidata/netcdf4-python#300

Closed

ahaldane mentioned this pull request Jun 17, 2015

MAINT: document change to bytestring index behavior #5977

Merged

jakirkham mentioned this pull request Dec 5, 2016

TST: Ellipsis indexing creates a view #8343

Merged

Uh oh!

ENH: Reimplementing the indexing machinery #3798

ENH: Reimplementing the indexing machinery #3798

Uh oh!

Conversation

seberg commented Sep 26, 2013

Uh oh!

njsmith commented Sep 27, 2013

Uh oh!

seberg commented Oct 19, 2013

Uh oh!

seberg commented Oct 21, 2013

Uh oh!

seberg commented Oct 25, 2013

Uh oh!

pv commented Oct 25, 2013

Uh oh!

seberg commented Oct 25, 2013

Uh oh!

pv commented Oct 25, 2013

Uh oh!

seberg commented Oct 25, 2013

Uh oh!

pv commented Oct 25, 2013

Uh oh!

seberg commented Oct 25, 2013

Uh oh!

seberg commented Oct 26, 2013

Uh oh!

seberg commented Oct 26, 2013

Uh oh!

seberg commented Oct 28, 2013

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pv commented Oct 28, 2013

Uh oh!

charris commented Dec 22, 2013

Uh oh!

seberg commented Jan 25, 2014

Uh oh!

juliantaylor commented Jan 25, 2014

Uh oh!

seberg commented Jan 25, 2014

Uh oh!

charris commented Jan 30, 2014

Uh oh!

seberg commented Jan 30, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seberg commented Feb 7, 2014

Uh oh!

seberg commented Feb 7, 2014

Uh oh!

SylvainCorlay commented Feb 7, 2014

Uh oh!

seberg commented Feb 7, 2014