Relax tolerance of some tests for numpy 1.22 by mhvk · Pull Request #12682 · astropy/astropy

mhvk · 2022-01-01T20:01:13Z

Most likely as a consequence of numpy now supporting the faster but slightly less precise Intel Short Vector Math Library (SVML), some of our tests are failing, in ways that are not really interesting. So, up the tolerance slightly.

Note that I cannot reproduce the failures locally, so this may need some trial and error.

See ENH: Vectorizing umath module using AVX-512 (open sourced from Intel Short Vector Math Library, SVML) numpy/numpy#19478
Avoids BUG: possible problem with np.add.accumulate on 32-bit windows numpy/numpy#20699

Open follow-up issues: TST: Unpin conditions in test_copy_vla #12687, TST: Unpin numpy in pyinstaller job #12688

Checklist for package maintainer(s)

This checklist is meant to remind the package maintainer(s) who will review this pull request of some common things to look for. This list is not exhaustive.

Do the proposed changes actually accomplish desired goals?
Do the proposed changes follow the Astropy coding guidelines?
Are tests added/updated as required? If so, do they follow the Astropy testing guidelines?
Are docs added/updated as required? If so, do they follow the Astropy documentation guidelines?
Is rebase and/or squash necessary? If so, please provide the author with appropriate instructions. Also see "When to rebase and squash commits".
Did the CI pass? If no, are the failures related? If you need to run daily and weekly cron jobs as part of the PR, please apply the Extra CI label.
Is a change log needed? If yes, did the change log check pass? If no, add the no-changelog-entry-needed label. If this is a manual backport, use the skip-changelog-checks label unless special changelog handling is necessary.
Is a milestone set? Milestone must be set but astropy-bot check might be missing; do not let the green checkmark fool you.
At the time of adding the milestone, if the milestone set requires a backport to release branch(es), apply the appropriate backport-X.Y.x label(s) before merge.

pllim

Seems uncontroversial enough. Thanks!

mhvk · 2022-01-01T21:04:18Z

The python3.9 windows failure seems also related to the new numpy - and may be a bug...

I'm also unsure about the matplotlib failure - that might well be numpy related too, and thus need a new baseline image, but don't know how to investigate that... Maybe to get things going again it is an idea to pin numpy for those for now?

mhvk · 2022-01-01T21:37:45Z

For the win32 failure, I think this really is a numpy problem so raised an issue at numpy/numpy#20699

pllim · 2022-01-02T04:47:59Z

Thanks for looking into this!

Hmm. Should we wait for numpy/numpy#20699 to be resolved before proceeding?

mhvk · 2022-01-02T19:03:11Z

On looking more closely, I now also wonder about the test case that failed: there is actually a huge difference in parameters (EDIT: the last one!)

E       AssertionError: 
E       Not equal to tolerance rtol=0.15, atol=0
E       
E       Mismatched elements: 1 / 6 (16.7%)
E       Max absolute difference: 1549800.82273902
E       Max relative difference: 0.99999981
E        x: array([9.999454, 4.999725, 5.000437, 4.000786, 3.999523, 0.30056 ])
E        y: array([9.999455e+00, 4.999673e+00, 5.000516e+00, 4.000724e+00,
E              3.999598e+00, 1.549801e+06])

mhvk · 2022-01-02T19:04:02Z

Annoyingly, I cannot reproduce that result...

mhvk · 2022-01-02T19:23:33Z

Possibly, this is because Gaussian2D's theta was not actually initialized. 🤞

pllim · 2022-01-03T17:27:44Z

FYI: I rebased and see if some of the failures will go away.

Update: I also updated the truths for mpl311 over at astropy-data instead of trying to pin here, which broke both CirlcleCI jobs, whereas only mpl311 is failing with those 2 images in main. I don't see anything in the diff, but whatever.

p.s. I think the ResourceWarning about test3.fits is because the other test failed and didn't clean up open file pointers properly. It will go away when we skip the affected test.

pllim · 2022-01-03T19:10:26Z

@mhvk , I took the liberty to also try to fix all the other failures that cropped up over the holiday weekend here. Hope you don't mind!

mhvk · 2022-01-03T21:37:54Z

@pllim - of course fine to push further attempts, thanks!

It does look like in the modelling something really changed, allowing theta to wander off (of course, it is module 2pi that counts, but it is strange that the angle goes so much further off than before!). Were you able to confirm any of these on your own machine?

Most likely as a consequence of numpy now supporting the faster but slightly less precise Intel Short Vector Math Library (SVML), some of our tests are failing, in ways that are not really interesting. So, up the tolerance slightly. See numpy/numpy#19478 Temporarily pin numpy for matplotlib tests Ensure theta of Gaussian2D is initialized

pllim · 2022-01-03T21:42:25Z

Were you able to confirm any of these on your own machine?

No, I am doing other work while the CI churns away. The most curious thing is not all jobs are failing, so something specific is triggering it. I wonder if @WilliamJamieson has any idea...

pllim · 2022-01-03T22:12:15Z

Really don't know what is going on with modeling. I don't see this failure on main and I don't see how this patch can trigger this failure. I pushed a commit to just skip it on Windows to see if other jobs even have the same failure or not.

pllim · 2022-01-04T16:55:17Z

Maybe we don't need this patch if #12684 works... 👀

pllim · 2022-01-04T17:24:57Z

@WilliamJamieson is investigating if he can rewrite the offending modeling test to be more robust.

pllim · 2022-01-04T17:41:54Z

I also see this warning in a local install when I do a pip install numpy -U to upgrade to 1.22. Should we be worried about this?

oldest-supported-numpy 0.15 requires numpy==1.17.3; python_version == "3.8"
and(platform_machine != "arm64" or platform_system != "Darwin")
and platform_machine != "aarch64" and platform_machine != "s390x"
and platform_python_implementation != "PyPy", but you have numpy 1.22.0 which is incompatible.
Successfully installed numpy-1.22.0

pllim · 2022-01-04T18:14:06Z

p.s. I am unable to reproduce the modeling failure on Windows 10 locally (after unskipping it, of course).

mhvk · 2022-01-04T18:48:35Z

I tried to follow all the experiments, but fear it just left me super-confused! In particular of course not being able to actually reproduce the failures...

pllim · 2022-01-04T22:36:14Z

Hmm. Looks like we don't have to touch most of the tests if we go with #12685 . What do you think?

mhvk · 2022-01-05T01:03:49Z

Ideally, we run our tests as users would, so without setting environment variables. I don't mind relaxing the constraints, but the changes in modelling are really worrying.

So, perhaps the best is to use #12685 for now, so that the tests pass again and we can continue to merge new stuff, but raise an issue to remind us to try to find out what is causing the modelling problems - we don't want users to get wrong results if they don't have that variable set!!

WilliamJamieson · 2022-01-05T15:45:48Z

@mhvk I suspect that the "modeling problem" is just a case of equivalent parameter sets. This is because the only parameter that is "wrong" is the rotation angle of the Gaussian2D model, this parameter is technically periodic with period 2pi. My theory is that the scipy.optimize.leastlsq method is just wandering to a different "equivalent" parameter. At some point we should consider switching over to using scipy.optimize.leaset_squares, which is now the recommended method.

Note that the test changes I suggested for #12684, are passing for the Gaussian2D, this change compares the outputs of the two models over many test points, demonstrating that the two models seem to be arriving at the same results. This type of test is indifferent to "equivalent" values for parameters. The concerning problem to me is that this seems to only happen in a specific environment.

pllim · 2022-01-05T15:57:08Z

we run our tests as users would, so without setting environment variables

That is a good point as well. Now I am torn... We have two options:

If we decide we do not want to rely on magic env var, we go with this PR but first I need to cherry-pick @WilliamJamieson 's fixes from TST: Use NPY_DISABLE_CPU_FEATURES for numpy 1.22 #12684 .
If we decide we do not want to adjust tests, we go with TST: Use NPY_DISABLE_CPU_FEATURES for numpy 1.22 (try 2) #12685 .

saimn · 2022-01-05T16:07:15Z

Not using the env var seems better since it's hard to ensure that it's always set, e.g. when running pytest directly. So as it seems we now have a solution for the various tests that are failing, adjusting the constraints, I would go with that.

mhvk · 2022-01-05T16:41:41Z

@WilliamJamieson - thanks, that makes sense. As you say, the weird thing is that it depends on environment. But as long as one gets a fit that actually works, it does not matter too much.

mhvk · 2022-01-05T16:43:54Z

I also would tend to go with this PR, i.e., not using the environment variable. I have some hope that over at numpy improvements will be made so that the tolerance becomes better again, but really for our purposes the matches are still amply good enough.

TST: Pin numpy in pyinstaller job because I have no idea what is going on there. Undoing getting rid of numpy.distutils give me error about _private. Skip failing FITS test due to numpy/numpy#20699 but we need to open follow-up issue to unskip it. DOC: Replace broken Simbad links.

Added back in original assert, ignoring only for Gaussian2D.

pllim · 2022-01-05T16:56:28Z

Okay, I cleaned up this PR. Hopefully tests will pass now. I still don't understand why pyinstaller fails with numpy 1.22 but I have pinned it to 1.21.* for now.

pllim · 2022-01-05T17:28:36Z

"Allowed failure" failure is an unrelated socket timeout.

pllim · 2022-01-05T18:38:43Z

Thank you, everyone!

Co-authored-by: Marten van Kerkwijk <[email protected]> Co-authored-by: William Jamieson <[email protected]>

mhvk added testing no-changelog-entry-needed numpy-dev labels Jan 1, 2022

mhvk added this to the v5.0.1 milestone Jan 1, 2022

github-actions bot added coordinates modeling labels Jan 1, 2022

mhvk mentioned this pull request Jan 1, 2022

Remove vestiges of Python 2 #12681

Merged

9 tasks

pllim approved these changes Jan 1, 2022

View reviewed changes

This comment has been minimized.

Sign in to view

mhvk force-pushed the numpy-precision-issues branch from 41ae13c to 4f33333 Compare January 2, 2022 19:23

This comment has been minimized.

Sign in to view

kandersolar mentioned this pull request Jan 3, 2022

ENH: infinite sheds pvlib/pvlib-python#717

Merged

7 tasks

pllim added the backport-v5.0.x label Jan 3, 2022

pllim force-pushed the numpy-precision-issues branch from 4f33333 to 8eaa61b Compare January 3, 2022 17:27

pllim added the Extra CI Run cron CI as part of PR label Jan 3, 2022

pllim force-pushed the numpy-precision-issues branch from 3ce6f0f to 6eecd23 Compare January 3, 2022 21:39

pllim force-pushed the numpy-precision-issues branch from bb9bc96 to 4210769 Compare January 3, 2022 22:30

pllim mentioned this pull request Jan 4, 2022

TST: Use NPY_DISABLE_CPU_FEATURES for numpy 1.22 #12684

Closed

10 tasks

pllim and others added 2 commits January 5, 2022 11:47

Made modeling test more robust.

c33be6f

Added back in original assert, ignoring only for Gaussian2D.

pllim force-pushed the numpy-precision-issues branch from 4210769 to c33be6f Compare January 5, 2022 16:50

pllim added the Ready-for-final-review label Jan 5, 2022

pllim merged commit 3d522eb into astropy:main Jan 5, 2022

This comment has been minimized.

Sign in to view

lumberbot-app bot added the Still Needs Manual Backport label Jan 5, 2022

This was referenced Jan 5, 2022

TST: Unpin conditions in test_copy_vla #12687

Closed

TST: Unpin numpy in pyinstaller job #12688

Closed

pllim added a commit to pllim/astropy that referenced this pull request Jan 5, 2022

Manual backport of astropy#12682

cad3c11

Co-authored-by: Marten van Kerkwijk <[email protected]> Co-authored-by: William Jamieson <[email protected]>

This was referenced Jan 5, 2022

Manual backport of PR 12682 (Relax tolerance of some tests for numpy 1.22) #12689

Merged

TST: Relax rtol in modeling test #12690

Merged

pllim removed the Still Needs Manual Backport label Jan 5, 2022

mhvk deleted the numpy-precision-issues branch August 26, 2023 10:11

Uh oh!

Conversation

mhvk commented Jan 1, 2022 • edited by pllim Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist for package maintainer(s)

Uh oh!

pllim left a comment

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

mhvk commented Jan 1, 2022

Uh oh!

mhvk commented Jan 1, 2022

Uh oh!

pllim commented Jan 2, 2022

Uh oh!

This comment has been minimized.

mhvk commented Jan 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhvk commented Jan 2, 2022

Uh oh!

mhvk commented Jan 2, 2022

Uh oh!

This comment has been minimized.

pllim commented Jan 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pllim commented Jan 3, 2022

Uh oh!

mhvk commented Jan 3, 2022

Uh oh!

pllim commented Jan 3, 2022

Uh oh!

pllim commented Jan 3, 2022

Uh oh!

pllim commented Jan 4, 2022

Uh oh!

pllim commented Jan 4, 2022

Uh oh!

pllim commented Jan 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pllim commented Jan 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhvk commented Jan 4, 2022

Uh oh!

pllim commented Jan 4, 2022

Uh oh!

mhvk commented Jan 5, 2022

Uh oh!

WilliamJamieson commented Jan 5, 2022

Uh oh!

pllim commented Jan 5, 2022

Uh oh!

saimn commented Jan 5, 2022

Uh oh!

mhvk commented Jan 5, 2022

Uh oh!

mhvk commented Jan 5, 2022

Uh oh!

pllim commented Jan 5, 2022

Uh oh!

pllim commented Jan 5, 2022

Uh oh!

pllim commented Jan 5, 2022

Uh oh!

This comment has been minimized.

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mhvk commented Jan 1, 2022 •

edited by pllim

Loading

mhvk commented Jan 2, 2022 •

edited

Loading

pllim commented Jan 3, 2022 •

edited

Loading

pllim commented Jan 4, 2022 •

edited

Loading

pllim commented Jan 4, 2022 •

edited

Loading