BLD: use smaller scipy-openblas builds by mattip · Pull Request #27147 · numpy/numpy

mattip · 2024-08-08T20:40:16Z

Builds on #27140 to use the same OpenBLAS build but with fewer kernels. Based on the analysis in MacPython/openblas-libs#144 there are now 5 kernels based on cpu core labels PRESCOTT NEHALEM SANDYBRIDGE HASWELL SKYLAKEX. Needs a release note about the possible performance implications, and will also add a note about the windows changes in #27140.

mattip · 2024-08-09T05:43:16Z

@charris This would be nice to backport since it shrinks the wheel sizes

charris · 2024-08-09T13:32:05Z

Thanks Matti.

charris · 2024-08-09T19:12:01Z

This would be nice to backport

Done for 2.1 because it will have an rc, but skipped for 2.0.

theAeon · 2024-09-08T18:19:54Z

Has anyone compared these benchmarks against the Zen kernels on AMD chips? The original post only tested Intel archs b/c its a mac-focused repo, but its entirely possible that there will be a not-insignificant performance difference.

mattip · 2024-09-08T20:10:16Z

We would need someone to rerun the benchmark scripts with an AMD processor that has AVX512 features.

theAeon · 2024-09-08T23:33:04Z

Only AVX2 over here, unfortunately.

theAeon · 2024-09-09T00:42:04Z

Looks like M7a instances should do the trick.

edit: i'm just going to do it

theAeon · 2024-09-09T01:43:45Z

Marginally worse than SKYLAKEX despite, according to OpenBLAS docs, being HASWELL with zen2/3 optimizations (i.e. no AVX512). Curious what this looks like on my local zen 3 machine.

arch	mean	spread	perf_ratios
SAPPHIRERAPIDS	2.57453	0.01505	1
CORE2	2.58216	0.01535	1.00296
COOPERLAKE	2.58281	0.01735	1.00321
SKYLAKEX	2.58809	0.0184	1.00527
ZEN	2.58923	0.01005	1.00571
PRESCOTT	2.59641	0.0072	1.0085
PENRYN	2.59959	0.0206	1.00973
HASWELL	2.60049	0.0165	1.01008
KATMAI	2.60156	0.0104	1.0105
ATOM	2.6024	0.02285	1.01082
COPPERMINE	2.60262	0.0155	1.01091
NORTHWOOD	2.60514	0.01525	1.01189
DUNNINGTON	2.60977	0.00905	1.01369
SANDYBRIDGE	2.61479	0.00485	1.01564
NEHALEM	2.6164	0.01415	1.01626
BANIAS	2.61833	0.00895	1.01701

theAeon · 2024-09-09T03:22:46Z

arch	mean	spread	perf_ratios
HASWELL	0.0699409	0.000441	1
ZEN	0.0714307	0.003309	1.0213
SANDYBRIDGE	0.0949565	0.0010665	1.35767
CORE2	0.167046	0.003455	2.38839
PENRYN	0.175879	0.00074	2.51468
DUNNINGTON	0.18015	0.00605	2.57574
NEHALEM	0.195515	0.002045	2.79543
COPPERMINE	0.251743	0.00092	3.59937
BANIAS	0.253619	0.00164	3.62619
PRESCOTT	0.253751	0.00249	3.62807
KATMAI	0.256017	0.003495	3.66047
NORTHWOOD	0.25638	0.00475	3.66567
ATOM	0.317082	0.00437	4.53357

resounding meh

mattip · 2024-09-09T05:22:25Z

I am not sure what I am seeing. What are the two results?

theAeon · 2024-09-09T07:23:44Z

First one is an AWS m7a-medium. One zen 4 core.

Second is my personal machine, which is zen 3.

Can't really see a reason to include the ZEN kernel based on either of those.

mattip · 2024-09-09T07:42:47Z

And that is using an openblas from before the shrink?

theAeon · 2024-09-09T11:02:51Z

Hm. If I followed the instructions from the script repository exactly, it would have pulled down latest scipy-openblas, wouldn't it.

BLD: use smaller scipy-openblas builds

9526007

github-actions bot added the 36 - Build Build related PR label Aug 8, 2024

add release note

75ffe91

charris added the 09 - Backport-Candidate PRs tagged should be backported label Aug 9, 2024

charris merged commit 807cd74 into numpy:main Aug 9, 2024

charris mentioned this pull request Aug 9, 2024

BLD: use smaller scipy-openblas builds #27162

Merged

charris removed the 09 - Backport-Candidate PRs tagged should be backported label Aug 9, 2024

mattip deleted the scipy-openblas-0.3.27.44.5 branch May 5, 2025 11:21

Uh oh!

Conversation

mattip commented Aug 8, 2024

Uh oh!

mattip commented Aug 9, 2024

Uh oh!

charris commented Aug 9, 2024

Uh oh!

charris commented Aug 9, 2024

Uh oh!

theAeon commented Sep 8, 2024

Uh oh!

mattip commented Sep 8, 2024

Uh oh!

theAeon commented Sep 8, 2024

Uh oh!

theAeon commented Sep 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

theAeon commented Sep 9, 2024

Uh oh!

theAeon commented Sep 9, 2024

Uh oh!

mattip commented Sep 9, 2024

Uh oh!

theAeon commented Sep 9, 2024

Uh oh!

mattip commented Sep 9, 2024

Uh oh!

theAeon commented Sep 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

theAeon commented Sep 9, 2024 •

edited

Loading