Speed up Column.getitem in general #4075

embray · 2015-08-13T21:55:05Z

Alternative to #3930--speed up Column.__getitem__ using Cython base classes. This eliminates most of the overhead associated with calling pure-Python __getitem__, and speeds up the body of the method as well.

This is a more general solution than the one in #3930--I think this could be useful for #3915. Checking whether the indices need to be updated adds only a trivial amount of overhead, so we can perform that check in the actual __getitem__ without incurring additional overhead. So unless we still want to be able to use table[slice] without incurring index adjustment overhead, the get_item methods won't be necessary.

embray · 2015-08-13T22:21:43Z

The one thing I don't like about this is the duplicate code in _ColumnGetitemShim and _MaskedColumnGetitemShim. I really tried to find a workaround to that, but anything I could come up with added noticeable overhead.

Column.__getitem__ speedup branch of astropy#4075, and made adjustments to take advantage of it in the indices code. This included getting rid of the `def get_item(...)` hacks, which I think are no longer necessary--the base Column.__getitem__ is now almost exactly as fast as the stock implementation in the basic case of single item scalar access. In the process I also refactored the Cython code a bit so that it could be less repetitive. For now, tests that failed due to the change in behavior (indicies now *are* copied by default when slicing) are commented out; but all other tests pass. It should also that the 'copy_on_getitem' index mode is now a no-op since that's the default behavior. We could still change this so that it's not the default behavior, if we prefer, or add a no_copy_on_getitem mode instead.

taldcroft · 2015-08-21T20:35:16Z

This looks good to me, and hurts my brain less, as long as it's fast!

embray · 2015-08-21T22:16:41Z

I'll try to get some performance results up on this PR. From my rough testing though this adds < 1% overhead compared to the base __getitem__.

embray · 2015-08-27T16:55:42Z

Added a regression test proving that this PR also fixes #4098--also rebased this on top of #4099 so that the test I added can pass.

embray · 2015-08-27T19:56:06Z

Would be good to merge this sooner rather than later since it fixes #4098. However, I'm not sure what impact that will have on #3915. I don't want to introduce too many merge conflicts (though I don't think any such conflict will be too difficult to overcome...)

… base classes. This eliminates most of the overhead associated with calling pure-Python __getitem__, and speeds up the body of the method as well.

…er we need access to in this code is 'nd')--this allows us to avoid a global variable lookup of 'ndarray' when calling 'ndarray.tp_as_mapping.mp_subscript'. Now, in the common case of ndim==1, Column (but not MaskedColumn) adds virtually no overhead compared to the built-in __getitem__.

…n this branch also fix that issue.

Column.__getitem__ speedup branch of astropy#4075, and made adjustments to take advantage of it in the indices code. This included getting rid of the `def get_item(...)` hacks, which I think are no longer necessary--the base Column.__getitem__ is now almost exactly as fast as the stock implementation in the basic case of single item scalar access. In the process I also refactored the Cython code a bit so that it could be less repetitive. For now, tests that failed due to the change in behavior (indicies now *are* copied by default when slicing) are commented out; but all other tests pass. It should also that the 'copy_on_getitem' index mode is now a no-op since that's the default behavior. We could still change this so that it's not the default behavior, if we prefer, or add a no_copy_on_getitem mode instead.

embray · 2015-09-30T20:06:02Z

Updated to incorporate #3915 / #4202.

As I noted previously, this makes the 'copy_on_getitem' index mode is now a no-op since that's the default behavior. We could still change this so that it's not the default behavior, if we prefer, or add a no_copy_on_getitem mode instead. @taldcroft ? @mdmueller ?

embray · 2015-09-30T20:12:48Z

(To clarify, we could still make the default behavior be to not copy the indices on a slice. I wasn't sure which way is preferable, and if the current non-copy default was just chosen to not slow down __getitem__ of columns with no index.)

taldcroft · 2015-09-30T20:21:10Z

@embray - the current non-copy default was more chosen to not slow down __getitem__ of columns with an index. The idea was that normally if you are slicing an individual column that happens to be an index in the table, you don't usually want the index to come along for the ride. This re-indexing can be pretty slow.

embray · 2015-09-30T20:23:01Z

Okay, I wondered about that. In that case I'll restore the original default. This PR should still speed things up in either case since it does away with the penalty of having a __getitem__ in Python.

taldcroft · 2015-09-30T20:24:06Z

Sounds good, thanks.

…evel classes / functions. Put module docstring before imports.

@mdmueller

…is *not* the default (and re-enabled the tests that I previously disabled before I was sure what we were going to do here). This required going back to using a __class__ switcheroo to enable the index copying inside the index mode context manager. Originally I just wanted to add a flag to enable or disable it, but since that flag would have to be accessible from Python that just added too much slowdown to the general case. So this ends up adopting a middle ground between my original version of this code and @mdmueller's version.

embray · 2015-10-01T19:46:54Z

Okay, I restored the original default behavior. In order to get this working right without sacrificing performance I went back on some of @mdmueller's original implementation, or at least something that looks like it (but still without any need for the get_item methods). But this still keeps the performance benefits of my original code as well.

embray · 2015-10-02T16:31:56Z

Apparently I suck at testing on Python 2...

taldcroft · 2015-10-03T13:44:42Z

@embray - I don't have time this weekend for detailed review on this, but I'm trusting that the table testing is sufficiently complete. Have you done some simple %timeit performance testing comparing this to master?

mhvk · 2015-10-03T20:10:09Z

@embray - this looks good, and cleaner than before (though I cannot judge the cython very well). What seems to be missing, though, are some comments that explain why we are doing this at all. The original shim class had relevant comments; maybe put those either in the pyx file or below the Column class definition (to explain why it is subclassing the shim). Otherwise, I worry that a few years from now we have no clue anymore why this is being done...

mhvk · 2015-10-03T20:11:21Z

p.s. I even wonder if we should have a section in the developer docs with "tips & tricks", where we summarize these types of speed-ups and their rationale. Indeed, my shape-changing mixin would be another good addition...

embray · 2015-10-05T17:02:27Z

I'll add some better explanatory comments before merging.

@taldcroft Concerning timing this adds a tiny slowdown compared to the stock ndarray.__getitem__, solely due to the dimension check which is adding like a handful of instructions. So we're dabbling in difference of a nanosecond or two if that--really less than a timer can measure accurately anyways.

taldcroft · 2015-10-05T17:21:09Z

@embray - that's definitely in the noise, so no problem! Thanks for working on this.

astrofrog · 2015-10-05T20:00:29Z

@embray - could you add the comments in the next hour or so? It would be nice to get this in 1.1 if possible.

astrofrog · 2015-10-05T22:11:34Z

I'm going to branch v1.1, but we can consider merging this in after since it's just a performance fix.

embray · 2015-10-12T15:43:58Z

Oh, this is still open... let me add those aforementioned comments and then merge.

…shims.

taldcroft · 2015-10-12T16:25:24Z

👍

Speed up Column.__getitem__ in general

Column.__getitem__ speedup branch of astropy#4075, and made adjustments to take advantage of it in the indices code. This included getting rid of the `def get_item(...)` hacks, which I think are no longer necessary--the base Column.__getitem__ is now almost exactly as fast as the stock implementation in the basic case of single item scalar access. In the process I also refactored the Cython code a bit so that it could be less repetitive. For now, tests that failed due to the change in behavior (indicies now *are* copied by default when slicing) are commented out; but all other tests pass. It should also that the 'copy_on_getitem' index mode is now a no-op since that's the default behavior. We could still change this so that it's not the default behavior, if we prefer, or add a no_copy_on_getitem mode instead.

embray added table Performance labels Aug 13, 2015

embray force-pushed the table/getitem branch from 19daba4 to 452c639 Compare August 13, 2015 22:13

embray mentioned this pull request Aug 14, 2015

Indexing system for Table #3915

Closed

embray added the Affects-dev PRs and issues that do not impact an existing Astropy release label Aug 14, 2015

embray mentioned this pull request Aug 27, 2015

astropy tables with mixin columns cannot be pickled #4098

Merged

embray force-pushed the table/getitem branch from 4c15031 to 9f7d6fb Compare August 27, 2015 16:54

embray added this to the v1.1.0 milestone Aug 27, 2015

embray added 4 commits September 30, 2015 15:24

Alternative to astropy#3930--speed up Column.__getitem__ using Cython…

9c304f8

… base classes. This eliminates most of the overhead associated with calling pure-Python __getitem__, and speeds up the body of the method as well.

Adds a regression test for astropy#4098 confirming that the changes o…

09f0672

…n this branch also fix that issue.

embray force-pushed the table/getitem branch from 9f7d6fb to 1d68540 Compare September 30, 2015 20:02

embray added 2 commits September 30, 2015 16:57

Silly whitespace adjustments--should be two newlines between module-l…

fc51cf2

…evel classes / functions. Put module docstring before imports.

Fix for Python 2

a3278a6

embray force-pushed the master branch from 63540e0 to 34f4c29 Compare October 5, 2015 21:18

mhvk mentioned this pull request Oct 5, 2015

Table indexing should not affect Mixin classes except through info #4222

Merged

Added some more explanatory documentation for the column __getitem__ …

5de23bb

…shims.

embray added a commit that referenced this pull request Oct 12, 2015

Merge pull request #4075 from embray/table/getitem

5a95d0d

Speed up Column.__getitem__ in general

embray merged commit 5a95d0d into astropy:master Oct 12, 2015

embray deleted the table/getitem branch October 12, 2015 21:41

embray added a commit that referenced this pull request Oct 15, 2015

Merge pull request #4075 from embray/table/getitem

3c9be2b

Speed up Column.__getitem__ in general

dhomeier mentioned this pull request Oct 30, 2023

Use Cython 3.0.x #15402

Merged

1 task

Uh oh!

Speed up Column.__getitem__ in general #4075

Speed up Column.__getitem__ in general #4075

Uh oh!

Conversation

embray commented Aug 13, 2015

Uh oh!

embray commented Aug 13, 2015

Uh oh!

taldcroft commented Aug 21, 2015

Uh oh!

embray commented Aug 21, 2015

Uh oh!

embray commented Aug 27, 2015

Uh oh!

embray commented Aug 27, 2015

Uh oh!

embray commented Sep 30, 2015

Uh oh!

embray commented Sep 30, 2015

Uh oh!

taldcroft commented Sep 30, 2015

Uh oh!

embray commented Sep 30, 2015

Uh oh!

taldcroft commented Sep 30, 2015

Uh oh!

embray commented Oct 1, 2015

Uh oh!

embray commented Oct 2, 2015

Uh oh!

taldcroft commented Oct 3, 2015

Uh oh!

mhvk commented Oct 3, 2015

Uh oh!

mhvk commented Oct 3, 2015

Uh oh!

embray commented Oct 5, 2015

Uh oh!

taldcroft commented Oct 5, 2015

Uh oh!

astrofrog commented Oct 5, 2015

Uh oh!

astrofrog commented Oct 5, 2015

Uh oh!

embray commented Oct 12, 2015

Uh oh!

taldcroft commented Oct 12, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Speed up Column.getitem in general #4075

Speed up Column.getitem in general #4075