Improve small reduction performance #4186

juliantaylor · 2014-01-11T01:19:23Z

See commit messages for some details.

intern the constant strings, was accidentally not done in the original PR.
avoid a __numpy_ufunc__ string attribute lookup on pure python types which are very expensive
don't use keyword arguments for the reduction wrappers np.sum etc. Non keyword arguments parse a lot faster with the regular python function. This is an alternative to ENH: speed up ufunc reduce argument parsing #4174 which improves keyword parsing in general but has higher complexity. The keyword arguments one of the major reasons np.sum is so much slower than np.add.reduce (4us vs 2.5 us).

The last point fails a __numpy_ufunc__ test which I think is a bug in the logic of the override.
The override assumes all non input and output arguments are passed in via keyword arguments and then uses the ufunc number of inputs to consider all following arguments as output.
The last commit breaks both assumptions, out is not provided via a keyword and the reduction only has one input and not two as the non reduction types.
The effect is that you get a tuple [dtype, out, keepdims] as the out argument of the ufunc (axis is missing as nin is not treated right for reduce).

This looks to me like a bug, but I'm not very familiar with the override, please advise.

juliantaylor · 2014-01-11T01:22:43Z

@pv @cowlicks comments on the override issue?

cowlicks · 2014-01-12T16:44:16Z

This does appear to be bug with __numpy_ufunc__. When reduce is called we check for __numpy_ufunc__ and pass the number of inputs to PyUfunc_CheckOverride as ufunc->nin, which is incorrect. See here we have:

static PyObject *
ufunc_reduce(PyUFuncObject *ufunc, PyObject *args, PyObject *kwds)
{
    int errval;
    PyObject *override = NULL;

    errval = PyUFunc_CheckOverride(ufunc, "reduce", args, kwds, &override, 
                                   ufunc->nin);
    if (errval) {
        return NULL;
    }
    else if (override) {
        return override;
    }
    return PyUFunc_GenericReduction(ufunc, args, kwds, UFUNC_REDUCE);
}

charris · 2014-01-15T03:43:21Z

Redo Travis, IIRC there have been some __numpy_ufunc__ fixes.

seberg · 2014-01-22T23:10:28Z

If there have been fixes, I think it may need to be rebased to run the new tests?

charris · 2014-01-23T01:02:31Z

Well, it still fails...

charris · 2014-01-30T02:54:10Z

Restarted Travis tests.

charris · 2014-01-30T02:54:30Z

Could use a rebase, Yeah.

charris · 2014-01-30T03:46:31Z

OK, checked a rebased version and

ERROR: test_multiarray.TestBinop.test_ufunc_override_rop_simple
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/nose/case.py", line 197, in runTest
    self.test(*self.arg)
  File "/home/charris/Workspace/numpy.git/build/testenv/lib64/python2.7/site-packages/numpy/core/tests/test_multiarray.py", line 1746, in test_ufunc_override_rop_simple
    assert_equal(obj2.sum(), 42)
  File "/home/charris/Workspace/numpy.git/build/testenv/lib64/python2.7/site-packages/numpy/core/_methods.py", line 32, in _sum
    return umr_sum(a, axis, dtype, out, keepdims)
  File "/home/charris/Workspace/numpy.git/build/testenv/lib64/python2.7/site-packages/numpy/core/tests/test_multiarray.py", line 1715, in __numpy_ufunc__
    r = func(*inputs, **kw)
TypeError: output must be an array

Looks fixable.

cowlicks · 2014-04-01T04:09:47Z

What is the correct way to get the number of arguments for ufunc reductions? I'm looking for something like ufunc.nin but for the reduction methods.

If there isn't currently a way to do this. Is there a precise definition of nin which we could use to make a way? I'm wondering about a case like numpy.multiply.reduce(arr, ax), should the axis argument add to nin?

cowlicks · 2014-04-01T21:58:07Z

In PyUFunc_CheckOverride we assume that if there are more arguments than ufunc.nin then they must be out arguments, here. This doesn't work for reduction methods that take positional axis args since these are not counted in ufunc.nin.

So I think we'll have to add a special case for each reduction method so its arguments get parsed correctly.

njsmith · 2014-04-01T22:17:51Z

reduce and accumulate both exist only if nin==2, and they always take
exactly 1 array argument.

outer I guess acts like call

reduceat IIRC will always take 2 array arguments, and only exists for
nin==2, so I guess the current logic might work by accident but it would
probably be better to handle it explicitly anyway for clarity :-)

On Tue, Apr 1, 2014 at 10:58 PM, Blake Griffith [email protected]:

In PyUFunc_CheckOverride we assume that if there are more arguments than
ufunc.nin then they must be out arguments, herehttps://github.com/numpy/numpy/blob/master/numpy/core/src/private/ufunc_override.h#L98.
This doesn't work for reduction methods that take positional axis args
since these are not counted in ufunc.nin.

So I think we'll have to add a special case for each reduction method so
its arguments get parsed correctly.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/4186#issuecomment-39265463
.

Nathaniel J. Smith
Postdoctoral researcher - Informatics - University of Edinburgh
http://vorpus.org

PyArg_ParseTupleAndKeywords is pretty slow for keywords as it needs to create python strings first. Using positional arguments avoids this and gains 15-20% performance for small reductions.

juliantaylor · 2014-05-15T21:58:22Z

this can now go in, I think its still useful for 1.9 to fit the theme of faster small array operations

charris · 2014-05-15T23:41:54Z

LGTM, thanks Julian.

Improve small reduction performance

juliantaylor mentioned this pull request Feb 2, 2014

__numpy_ufunc__ check improvement #4255

Merged

juliantaylor added the Needs work label Feb 24, 2014

cowlicks mentioned this pull request Apr 19, 2014

Update ufunc override to work properly with ufunc methods. #4626

Merged

ENH: avoid keyword arguments for some ufunc.reduce wrappers

f0497ec

PyArg_ParseTupleAndKeywords is pretty slow for keywords as it needs to create python strings first. Using positional arguments avoids this and gains 15-20% performance for small reductions.

juliantaylor removed the Needs work label May 15, 2014

charris added a commit that referenced this pull request May 15, 2014

Merge pull request #4186 from juliantaylor/reduce-opt

e8e40dc

Improve small reduction performance

charris merged commit e8e40dc into numpy:master May 15, 2014

juliantaylor deleted the reduce-opt branch November 24, 2014 08:56

Uh oh!

Improve small reduction performance #4186

Improve small reduction performance #4186

Uh oh!

Conversation

juliantaylor commented Jan 11, 2014

Uh oh!

juliantaylor commented Jan 11, 2014

Uh oh!

cowlicks commented Jan 12, 2014

Uh oh!

charris commented Jan 15, 2014

Uh oh!

seberg commented Jan 22, 2014

Uh oh!

charris commented Jan 23, 2014

Uh oh!

charris commented Jan 30, 2014

Uh oh!

charris commented Jan 30, 2014

Uh oh!

charris commented Jan 30, 2014

Uh oh!

cowlicks commented Apr 1, 2014

Uh oh!

cowlicks commented Apr 1, 2014

Uh oh!

njsmith commented Apr 1, 2014

Uh oh!

juliantaylor commented May 15, 2014

Uh oh!

charris commented May 15, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants