Replace hypot(x, y) with sqrt(x*x + y*y) for GPU compatibility by siddharthabishnu · Pull Request #48 · CliMA/CubedSphere.jl

siddharthabishnu · 2025-10-17T20:44:53Z

This change replaces calls to hypot(x, y) with sqrt(x*x + y*y) to improve GPU compatibility.

The hypot function was lowering to the CUDA symbol __nv_hypot, which caused JIT compilation failures in Oceananigans tests.

Using explicit multiplication avoids external libdevice dependencies and ensures portability across GPU backends.

Added the script 'elliptic_quasi_conformal_cubed_sphere_grid.jl' in the 'src' directory, which leverages elliptic grid generation techniques to enlarge corner cells of the conformal cubed sphere while preserving near-orthogonality. This potentially offers a solution to both computational intensity and grid alignment concerns.

Refined the quasi-conformal cubed sphere grid using the Ensemble Kalman Inversion (EKI) method. EKI, an advanced derivative-free data assimilation technique, iteratively adjusts model parameters to optimize performance and accuracy, based on the comparison of model outputs with observational data. In our application, EKI is employed to optimize the parameters of elliptic grid generation, aiming to enhance the grid's quality. This improvement is measured by steering three key grid diagnostics---orthogonality, isotropy, and uniformity of cell sizes---towards their ideal values representing the “observational data” in our optimization process.

…-mapping

Implement Haversine formula to compute spherical distance.

navidcy · 2025-10-17T23:24:13Z

Two questions:

Is x*x different from x^2?
I don't understand the issue here... I thought these methods are called on elements of the grid, thus x, y, z are numbers.

siddharthabishnu · 2025-10-17T23:44:09Z

Two questions:

Is x*x different from x^2?

I don't understand the issue here... I thought these methods are called on elements of the grid, thus x, y, z are numbers.

I believe x*x is faster than x^2 on the GPU, which is the main reason I chose to use it.
The issue itself isn’t here but in Oceananigans, which throws the following error:

JIT session error: Symbols not found: [ __nv_hypot ] lookupError Failed to materialize symbols: { (enzymejitdl_23, { entry }) } [2025/10/17 20:03:20.897] ERROR Compilation failed, MLIR module written to /tmp/reactant_cZpeeZ/module_000_I6HH_post_all_pm.mlir -@-> /var/lib/buildkite-agent/.julia-oceananigans/packages/Reactant/IgTfV/src/mlir/IR/Pass.jl:119 Reactanigans unit tests: Error During Test at /var/lib/buildkite-agent/Oceananigans.jl-26323/test/test_reactant.jl:64 Got exception outside of a @test "failed to run pass manager on module"

On the GPU, the hypot function lowers to the CUDA symbol __nv_hypot, which the Reactant/XLA toolchain isn’t finding. To resolve this, I replaced it with a GPU-safe equivalent that uses sqrt, which avoids the external CUDA dependency.

navidcy · 2025-10-18T04:02:59Z

are we missing an Adapt statement here?

cc @simone-silvestri, @glwagner

simone-silvestri · 2025-10-20T17:22:29Z

I don't think so, it looks like an issue with Reactant/XLA. I guess if we can use a simpler function that does the same thing that's ok.

siddharthabishnu added 30 commits February 14, 2024 18:33

Create function to make cubed sphere plots

04582f2

Non-uniform conformal map --> enlarge corner cells

0d35cda

Make separate cubed sphere plots in 2D and 3D

0db4e64

Update Project.toml

27ba1a8

Reorganize scripts and update Project.toml

8b9b72b

Merge main into non-uniform-conformal-mapping

58e84ec

Modify plot titles and output filenames

a42c8af

Adjust spacing

b31025d

Don't compute minimum Δx vs resolution by default

6b450df

Access arrays in memory-aligned column-major order

6744328

Reduce spacing

55eef6b

Take spherical surface into consideration

78076c7

Minor modification

5582502

Update Project.toml

149c5cd

Merge remote-tracking branch 'origin/main' into non-uniform-conformal…

82a5753

…-mapping

Improve conformal cubed sphere visualization

cd66886

Split code into multiple files and add docstrings

e339d63

Update .gitignore

b07f035

Remove unnecessary line breaks

e0f660f

Add functions to visualize cubed sphere grids

a1aa733

Incorporate Navid’s suggestion

b5e5e26

Implement Haversine formula to compute spherical distance.

Update .gitignore

4e8af28

Add spherical coordinate conversion functions

dcba0c0

Add Distances.jl as dependency

3d2405d

Compute signed turning angle on the sphere

ddf5765

Add StaticArrays as a project dependency

46ea72e

atan2 --> atan

ccfe722

Merge duplicate functions

54dd8df

navidcy and others added 19 commits October 9, 2025 08:38

some utils

138105c

it's ref

4885bdb

no need to export compute_deviation_from_isotropy; seems bit specialized

fb70137

first X, Y, Z, then x, y

57d5d9d

no need to distinguish r=1 and r>1; they are all distances

a043c94

another example for isotropy

f73b8ca

don't stress physical area; all areas are areas

6084798

cleaner info msgs

455354e

back to x, y, X, Y, Z

8d91b25

N_iterations

34934db

N_iterations

fb16e78

N_iterations

a6fcf40

updates

2e1b0d2

cleaner info msgs

194d033

cleaner info msgs

0180baa

cleaner info msgs

240401b

hypot(x, y) --> sqrt(x*x + y*y)

1a313fc

Merge main into non-uniform-conformal-mapping

e8b5527

Add latitude bounds check in lat_lon_to_cartesian

d697527

siddharthabishnu force-pushed the sb/non-uniform-conformal-mapping branch from f9ef5af to d697527 Compare October 17, 2025 21:40

Fix doctest output in cartesian_to_lat_lon

e6c49d3

siddharthabishnu requested a review from navidcy October 17, 2025 22:40

simone-silvestri approved these changes Oct 20, 2025

View reviewed changes

siddharthabishnu merged commit a3fea33 into main Oct 20, 2025
10 checks passed

siddharthabishnu mentioned this pull request Oct 20, 2025

Bump version from 0.3.3 to 0.3.4 #49

Merged

siddharthabishnu mentioned this pull request Jan 21, 2026

Remove duplicate spherical geometry functions CliMA/Oceananigans.jl#5183

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace hypot(x, y) with sqrt(xx + yy) for GPU compatibility#48

Replace hypot(x, y) with sqrt(xx + yy) for GPU compatibility#48
siddharthabishnu merged 78 commits intomainfrom
sb/non-uniform-conformal-mapping

siddharthabishnu commented Oct 17, 2025 •

edited

Loading

Uh oh!

navidcy commented Oct 17, 2025

Uh oh!

siddharthabishnu commented Oct 17, 2025 •

edited

Loading

Uh oh!

navidcy commented Oct 18, 2025

Uh oh!

simone-silvestri commented Oct 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

siddharthabishnu commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

navidcy commented Oct 17, 2025

Uh oh!

siddharthabishnu commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

navidcy commented Oct 18, 2025

Uh oh!

simone-silvestri commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

siddharthabishnu commented Oct 17, 2025 •

edited

Loading

siddharthabishnu commented Oct 17, 2025 •

edited

Loading

simone-silvestri commented Oct 20, 2025 •

edited

Loading