FYI: ERFA wrapper factorization #3176

jwoillez · 2014-12-03T15:27:20Z

This is an attempt to factorize most of the code on the python side of the ERFA wrapper. This follows what @mdboom has done in #3164. It reduces the memory footprint, but may have a small performance penalty per ERFA call. I have no precise quantification of these two aspects.

PS: I will be out of touch for the next four weeks. Thought I would push this out, in case somebody wants to look into it. This is a very low priority in comparison to all the other ERFA related PRs...

mdboom · 2014-12-03T15:41:30Z

FWIW: On my machine from astropy import erfa uses about 300k less with this PR. There is a slight performance penalty, but maybe it doesn't matter given the improved readability of this approach.

jwoillez · 2014-12-03T16:18:12Z

Just some additional thoughts. It might be interesting to take the factorized part out of the template, and keep it under version control. So that, if we have additional fixes to implement, they appear in the git history. And we are talking about only ~200 lines of code or so.

embray · 2014-12-03T20:56:01Z

Hah, I was right in the middle of working something similar to this, though my approach doesn't use any templates at all. Still, this is helpful to look at.

embray · 2014-12-03T20:58:09Z

In general though I'm glad to see we're in agreement that most of the code for the wrapper functions could be generalized (as you did in setup_iter).

embray · 2014-12-03T21:04:09Z

astropy/erfa/erfa.py.templ

There's already a function like this here:
https://github.com/astropy/astropy/blob/master/astropy/modeling/utils.py#L25

Maybe it would be worth starting some module in astropy.utils for Numpy-related utilities and move it there, then this could use it.

I'm a little confused why np.broadcast is not used (both here and in modeling.utils). As that is written in C, it should nominally be faster, and that seems to be confirmed by a simple test::

In [1]: import numpy as np In [2]: from astropy.modeling.utils import check_broadcast In [3]: a = np.arange(125).reshape(5,5,5) In [4]: b = np.arange(5).reshape(1,5,1) In [5]: np.broadcast(a,b) Out[5]: <numpy.broadcast at 0x3651dc0> In [6]: %timeit np.broadcast(a, b).shape 1000000 loops, best of 3: 481 ns per loop In [7]: %timeit check_broadcast(a.shape, b.shape) 100000 loops, best of 3: 4.14 µs per loop

In the case of modeling I can tell you exactly-- For whatever reason np.broadcast only works on existing Numpy arrays. I needed a utility to calculate broadcast shapes without necessarily having created all the arrays involved in the operation yet. In other words, I needed a utility that could compute the shape of the resulting np.broadcast just given a list of array shapes, not the arrays themselves.

I think maybe the same thing was needed here.

The inner dimensions needed by ERFA are removed before the call to broadcast_array (in setup_iter). If one wants to use np.broadcast, you need to index the arrays passed in, in order to removed these consumed inner dimensions, using something like [(Ellipsis,)+(0,)*len(args_shape[i])].

OK, sounds like both cases are indeed different, both from np.broadcast and each other. Given that np.broadcast is so much faster it may still be worth using, but I think that is much better left for later optimization.

make_output_scalar -> make_outputs_scalar

jwoillez · 2014-12-05T08:10:40Z

This should fix the numpy <= 1.7 issue. I also rebased to get one single wrapper update at the end.

embray · 2014-12-05T15:48:50Z

@jwoillez Thanks--that fixed it.

I'm almost finished with some changes on top of your changes in this PR that would remove the generation of Python code altogether, as has been discussed in other threads. I don't know if this will look good to anyone else or not but I'm almost ready to put it out there.

Since my tweaks are built on top of this PR would you consider a PR to your PR, or should I just make a separate one? (Or maybe I can just point you to the branch when it's ready and you can tell me what you think). Thanks!

mhvk · 2014-12-05T15:56:59Z

@embray - for me it would be easier to review if you submitted a separate PR that is based on the work here, i.e., includes its commits.

EDIT -- actually not as much review, but test: for a direct PR to astropy I know how to get the branch down on my repository, but if I have to grab different pieces in different places, I have a perhaps unbased fear that it'll become a mess.

embray · 2014-12-05T16:02:29Z

Whether I submit a PR to @jwoillez's fork, or a separate PR, it would include all relevant commits and checking out the branch for testing is the same.

jwoillez · 2014-12-05T16:06:54Z

@embray - Please go for a separate PR, as I will not be able to do anything for the next few weeks.

mhvk · 2014-12-05T16:07:39Z

OK, unbased fear... I guess the only disadvantage then is that any comments would be under @jwoillez's github account rather than @astropy's.
...
anyway, seems like a separate PR is better independently.

jwoillez · 2014-12-06T06:50:12Z

astropy/erfa/erfa.py.templ

Maybe replace the 4 lines above by:

outer_args = [args[i][(Ellipsis,)+(0,)*len(args_shape[i])] for i in range(len(args))] iter_shape = numpy.broadcast(*outer_args).shape

and remove def calculate_broadcast() above.

@embray - To use in #3181?

I should really read all comments before replying -- looks like the optimization can be done already!

I just used my check_broadcast utility. The version in #3181 is here:

https://github.com/embray/astropy/blob/erfa-factorization/astropy/erfa/erfa.py#L174

This started from your code but, I think, made a few simplifications (it requires fewer loops over the arguments, mainly).

embray · 2015-01-13T22:17:15Z

No wonder I'm confused. I thought this PR was already merged...

jwoillez · 2015-03-12T06:57:40Z

Looks like this PR is superseded by #3181: closing.

embray · 2015-03-12T15:16:09Z

@jwoillez That's not really how I felt about it--there are still some performance issues I need to resolve in #3181 before it can be of use and I have higher priorities at the moment.

Factorization of python wrapper code

6a661a8

mhvk mentioned this pull request Dec 3, 2014

Generate ERFA wrappers on-the-fly #3170

Closed

embray reviewed Dec 3, 2014
View reviewed changes

jwoillez added 2 commits December 5, 2014 09:07

Fixed typo impacting NPY < 1.8

b825313

make_output_scalar -> make_outputs_scalar

Fix scalar handing logic

e1561dd

jwoillez force-pushed the erfa-factorization branch from d5f3641 to 21643dd Compare December 5, 2014 08:09

jwoillez force-pushed the erfa-factorization branch from 21643dd to 8152fd9 Compare December 5, 2014 09:29

jwoillez added 2 commits December 5, 2014 12:18

pep8 fix

1b30ea6

Wrapper update

7c29bcd

jwoillez force-pushed the erfa-factorization branch from 8152fd9 to 7c29bcd Compare December 5, 2014 11:19

embray mentioned this pull request Dec 5, 2014

liberfa Python further simplification #3181

Closed

jwoillez reviewed Dec 6, 2014
View reviewed changes

astrofrog added the erfa label Jan 13, 2015

jwoillez closed this Mar 12, 2015

jwoillez deleted the erfa-factorization branch August 20, 2015 07:06

embray mentioned this pull request Nov 23, 2018

Handling of invisible dtype fields in io.fits #8172

Open

Uh oh!

FYI: ERFA wrapper factorization #3176

FYI: ERFA wrapper factorization #3176

Uh oh!

Conversation

jwoillez commented Dec 3, 2014

Uh oh!

mdboom commented Dec 3, 2014

Uh oh!

jwoillez commented Dec 3, 2014

Uh oh!

embray commented Dec 3, 2014

Uh oh!

embray commented Dec 3, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jwoillez commented Dec 5, 2014

Uh oh!

embray commented Dec 5, 2014

Uh oh!

mhvk commented Dec 5, 2014

Uh oh!

embray commented Dec 5, 2014

Uh oh!

jwoillez commented Dec 5, 2014

Uh oh!

mhvk commented Dec 5, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

embray commented Jan 13, 2015

Uh oh!

jwoillez commented Mar 12, 2015

Uh oh!

embray commented Mar 12, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants