Split erfa wrapper between python and cython sections #3141

jwoillez · 2014-11-25T11:41:01Z

Reference: #3134

…itself importing the cython extension

astrofrog · 2014-11-25T12:07:13Z

Thanks!

If we do go down this road, it may be worth splitting the erfa.py file into multiple files simply to avoid having a 20000 line file, or we can also consider generting it/them on the fly. At this stage, I think that @eteq or @mdboom should comment on this though, as I'm not sure what the best approach is.

What is the performance difference introduced by this PR?

mdboom · 2014-11-25T15:56:53Z

This is an interesting idea, and certainly an improvement. I still think dropping Cython altogether is going to be the better long term plan. It's not significantly more complex than the Cython approach, but should be much more efficient on all dimensions. I should have a PR for that quite soon and we can compare approaches.

eteq · 2014-11-25T21:37:57Z

@jwoillez - can you clarify your comment from #3134 :

Now, it seems that compiling erfa itself takes more time than its wrapper.

Did you benchmark this versus the current in-master version and conclude that? Can you give us some specific numbers?

eteq · 2014-11-25T21:44:44Z

Also, if we do go this route, we really should get auto-generation at build time going like discussed in #3134 ... These 20,000 line edits are quickly going to spiral out of control otherwise.

jwoillez · 2014-11-26T04:35:19Z

Sorry about the confusion. Compiling ERFA itself does not take much time. It seems comparable to the compilation time for WCS. The issue is with the 20000 lines of cython generated C code.

I can provide approximate numbers later today.

jwoillez · 2014-11-26T04:46:50Z

@mdboom - You might be right. If rewriting all of the cython wrapper with the C api is a bit too much (it would be for me), you could also do so for the cython part left in this PR.

jwoillez · 2014-11-27T12:12:09Z

Without pushing one way or another (cython vs C), this PR is ready for review.

jwoillez · 2014-11-27T12:20:10Z

For info, compilation times are:

this PR: 50 s
master: 175 s

astrofrog · 2014-11-27T12:21:06Z

@jwoillez - thanks!

@eteq @jwoillez - can the auto-generation of the pyx file be done in get_extensions inside the setup_package.py file for ERFA? Then we don't have to set up any new infrastructure?

astrofrog · 2014-11-27T12:28:38Z

Just to be clear, the compialtion of the actual wrapper file is now actually much faster (a few seconds versus over a minute). The 50s value is for the whole of the astropy build process, right?

jwoillez · 2014-11-27T13:19:27Z

@astrofrog Correct for the question above. The wrapper compiles in a few seconds max.

astrofrog · 2014-11-27T13:47:19Z

Hmm, this raises an interesting point - auto-generating the wrappers requires jinja2 to be installed, so it would be a dependency for developers. Also, I just realized that I'm not sure if this way of auto-generating the wrappers is going to be suitable for dealing with stable releases. I'll need to think about it more.

jwoillez · 2014-11-27T13:51:44Z

Indeed. I might just remove that last commit...

astrofrog · 2014-11-28T08:23:01Z

@jwoillez - huh, very nice!

jwoillez · 2014-11-28T08:41:38Z

I redid the test and find no significant difference anymore. In any case, someone should double check me.

astrofrog · 2014-11-28T09:34:52Z

Just for the record (in case anyone else is wondering) I originally was surprised that the calculation would take this long (0.1s for 1000 calls) but it turns out I get the same timings if I call the C routines in ERFA directly, so this example is completely limited by the execution time of the ERFA C code.

It might be interesting to try the timings above with one of the much simpler ERFA routines, where we might end up being more limited by the time of the wrapper.

jwoillez · 2014-11-28T10:46:38Z

Same test with the very simple aper function.

astrofrog · 2014-11-28T11:52:04Z

@jwoillez - perfect, thanks! This all looks fine to me, so at this point I'll just leave it up to @eteq to review and merge it (but I suspect this will need to wait until after the US holiday).

@mdboom - if you then have ideas for getting rid of the Cython altogether later on, then we can do this in a separate pull request.

mhvk · 2014-11-28T14:00:17Z

Wonderful!

jwoillez · 2014-11-28T14:35:07Z

If you are going to merge this, you may also consider an additional commit (jwoillez@d41c149) that addresses #3137.

mhvk · 2014-11-28T17:08:41Z

I'll merge this now as I think it is clear that it greatly improves compilation speed, does not harm execution speed, and passes all tests.

Split erfa wrapper between python and cython sections

embray · 2014-11-28T19:18:24Z

Please don't merge issues without at least first applying a milestone--thanks!

mhvk · 2014-11-28T20:22:58Z

@embray - apologies, usually I do think about the labels; will try to be better!

embray · 2014-11-28T20:45:34Z

No problem--it just really helps me keep things straightened out, and all the better if I don't have to do it for every issue :)

eteq · 2014-11-29T19:57:05Z

@mhvk - As @astrofrog said, I wanted to review this (and also put the auto-generation back in - this is 1MB of source changes!), but it's a holiday here in the US so I hadn't gotten to it yet. I guess I'll just have to issue another PR with the changes I was going to suggest.

eteq · 2014-11-29T20:02:00Z

(But very nice work, @jwoillez ! I'm a bit surprised at the performance implications, but good news that we can keep/improve speed while still speeding up compilation!)

mhvk · 2014-11-29T20:28:47Z

@eteq - sorry, I clearly went a bit too fast on this one (perhaps being too keen to get the Time stuff to work!). I thought the auto-generation was better done as a separate PR, but hadn't quite realised the implication of adding the large change to the git history...

eteq · 2014-11-30T01:22:26Z

@mhvk - well, I don't want to curb your enthusiasm ;)

And after posting that I tried some git tricks to estimate how much it actually takes up in the repo history (which eventually gets compressed). turns out this whole thing ends up 100k compressed, even though it's more like 1.5M of actual changes. I suppose it's because it's a lot of repetitive boilerplate, which means it compresses pretty efficiently...

jwoillez added 11 commits November 25, 2014 12:12

Split template into python and python parts

a8ede4b

Add python template to setup_package.py

2a2c245

Add processing of python template to cython_generator

1b1adb8

Adjustment to import python wrapper...

f5fab81

…itself importing the cython extension

Update erfa.pyx

0a5796c

Commit erfa.py

0633913

Add "_" in front of erfa functions in cython wrapper

fd7767a

Properly deal with stat_ok

716a7fc

Fix __all__ in both templates

7a4af75

Update to erfa.py and erfa.pyx

cb59251

Fix import in erfa tests

47a3861

jwoillez added 3 commits November 25, 2014 13:31

pep8 fixes to erfa.py.templ

4156614

Update erfa.py, following pep8 fixes

d654bdc

One more pep8 fix

8d06bfe

One more pep8 tweak

fcc70db

astrofrog mentioned this pull request Nov 26, 2014

Compile time of erfa.pyx is very large #3134

Closed

Yet again, a pep8 fix

40f9c9e

mhvk added a commit that referenced this pull request Nov 28, 2014

Merge pull request #3141 from jwoillez/erfa-speedup

893ce59

Split erfa wrapper between python and cython sections

mhvk merged commit 893ce59 into astropy:master Nov 28, 2014

This was referenced Nov 28, 2014

ERFA not scalar-proof #3135

Closed

Time scalar & multi-dimensional, using new ERFA #3138

Merged

Expose erfa constants from erfam.h and use them in Time #3137

Merged

embray added Affects-dev PRs and issues that do not impact an existing Astropy release Enhancement erfa time labels Nov 28, 2014

embray added this to the v1.0.0 milestone Nov 28, 2014

eteq mentioned this pull request Nov 29, 2014

Auto-generate erfa.pyx and erfa.py #3159

Closed

jwoillez deleted the erfa-speedup branch December 1, 2014 08:32

mdboom mentioned this pull request Dec 1, 2014

Don't use Cython for ERFA wrappers #3164

Closed

eteq mentioned this pull request Dec 2, 2014

ERFA wrapper tweaks #3173

Merged

mhvk mentioned this pull request Dec 16, 2014

ENH: add np.broadcast_to and reimplement np.broadcast_arrays numpy/numpy#5371

Merged

Uh oh!

Split erfa wrapper between python and cython sections #3141

Split erfa wrapper between python and cython sections #3141

Uh oh!

Conversation

jwoillez commented Nov 25, 2014

Uh oh!

astrofrog commented Nov 25, 2014

Uh oh!

mdboom commented Nov 25, 2014

Uh oh!

eteq commented Nov 25, 2014

Uh oh!

eteq commented Nov 25, 2014

Uh oh!

jwoillez commented Nov 26, 2014

Uh oh!

jwoillez commented Nov 26, 2014

Uh oh!

jwoillez commented Nov 27, 2014

Uh oh!

jwoillez commented Nov 27, 2014

Uh oh!

astrofrog commented Nov 27, 2014

Uh oh!

astrofrog commented Nov 27, 2014

Uh oh!

jwoillez commented Nov 27, 2014

Uh oh!

astrofrog commented Nov 27, 2014

Uh oh!

jwoillez commented Nov 27, 2014

Uh oh!

astrofrog commented Nov 28, 2014

Uh oh!

jwoillez commented Nov 28, 2014

Uh oh!

astrofrog commented Nov 28, 2014

Uh oh!

jwoillez commented Nov 28, 2014

Uh oh!

astrofrog commented Nov 28, 2014

Uh oh!

mhvk commented Nov 28, 2014

Uh oh!

jwoillez commented Nov 28, 2014

Uh oh!

mhvk commented Nov 28, 2014

Uh oh!

embray commented Nov 28, 2014

Uh oh!

mhvk commented Nov 28, 2014

Uh oh!

embray commented Nov 28, 2014

Uh oh!

eteq commented Nov 29, 2014

Uh oh!

eteq commented Nov 29, 2014

Uh oh!

mhvk commented Nov 29, 2014

Uh oh!

eteq commented Nov 30, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants