Implement bytecode callbacks with static, never-modified bytecode #13553

xavierleroy · 2024-10-15T16:15:19Z

This is an alternative to #13549. See this PR for the detailed description of the problem that needs to be fixed.

Like #13549, the present PR avoids modifying bytecode dynamically, and making it thread-local.

Unlike #13549, it preserves the ability to perform a callback on N arguments in one go, instead of having to cut it in slices of 1 to 3 arguments.

ghost

This also makes the failures disappear and looks good to me.

stedolan · 2024-10-16T09:36:13Z

This looks good! This approach does seem cleaner than either modifying bytecode or slicing up callbacks into 3-argument chunks, so I think this is a better fix than #13549.

However, I think there's a bug in the current version: the old code, going through Instruct(APPLY), used to do a goto check_stacks before executing the code from the callback. I think this stack check is important, as I think it's possible to reach caml_callback without much available stack (there's no stack check before Instruct(C_CALLn), so you can enter C code having already consumed part of your stack).

I think there's a simple fix: the interpreter should do a goto check_stacks right before entering the main loop, so that a stack check occurs before any instructions are executed.

shindere · 2024-10-16T12:25:53Z

Stephen Dolan (2024/10/16 02:36 -0700):

I think there's a simple fix: the interpreter should do a `goto check_stacks` right before entering the main loop, so that a stack check occurs before any instructions are executed.

Won't that be too often then? I suppose many instructions do actually not change the stack and it then feels a pity to me to do a check before they are executed, leading to a performance loss. Or am I missing something maybe?

stedolan · 2024-10-16T12:54:45Z

I'm not suggesting a check inside the loop, before every instruction - that would indeed cause performance loss. I'm suggesting a single check, before the loop is entered, which would be once per invocation of caml_bytecode_interpreter.

shindere · 2024-10-16T13:43:24Z

Stephen Dolan (2024/10/16 05:55 -0700):

I'm not suggesting a check inside the loop, before every instruction - that would indeed cause performance loss. I'm suggesting a single check, _before_ the loop is entered, which would be once per invocation of `caml_bytecode_interpreter`.

Ah okay, sorry I misunderstood. And you think such a check would be enough, then?

NickBarnes · 2024-10-16T14:24:09Z

I think this stack check is important, as I think it's possible to reach caml_callback without much available stack (there's no stack check before Instruct(C_CALLn), so you can enter C code having already consumed part of your stack).

On this subject, shouldn't caml_callbackN_exn do a stack limit check before subtracting narg + 4 from sp? Maybe I'm missing something obvious about the bytecode stacks.

xavierleroy · 2024-10-16T14:34:03Z

the interpreter should do a goto check_stacks right before entering the main loop

Well spotted! See the latest commit on this PR.

shouldn't caml_callbackN_exn do a stack limit check before subtracting narg + 4 from sp

Right now we rely on the "red zone" at the bottom of the stack to absorb the arguments. There's an assertion but it's not up to date (it assumes 256 words of red zone while it's only 32 words nowadays). Maybe it would be better to check and resize the stack in caml_callbackN_exn. Will look into it in a couple of days.

NickBarnes · 2024-10-16T14:55:04Z

Ah, I see, Stack_threshold_words ?

xavierleroy · 2024-10-16T17:36:59Z

Yes, Stack_threshold_words.

I pushed an alternative to @stedolan's suggestion, based on @NickBarnes' remark, where caml_callbackN checks for stack space and reallocates the stack if needed. I doubt the test suite exercises this code, though. I'll try to come up with a test later this week.

xavierleroy · 2024-10-18T11:50:01Z

I added a test that triggers the stack resizing, and a Changes entry. This PR is ready for a final review.

ghost · 2024-10-18T12:01:32Z

runtime/callback.c

-  domain_state->current_stack->sp -= narg + 4;
+  /* Ensure there's enough stack space */
+  intnat req = narg + 3 + Stack_threshold_words;
+  if (domain_state->current_stack->sp - req <


This almost calls for an internal version of caml_ensure_stack_capacity, but that can be done later if worth doing.

jmid · 2024-10-22T14:50:55Z

After a good round of testing this afternoon I can confirm that this fixes both #13512 and #13402.
Thank you both!

Support passing the initial values of env and extra_args as parameters to the bytecode interpretation function.

- Use the new `caml_bytecode_interpreter` API to jump straight to the function's code. - Avoid modifying bytecode in place. - Avoid per-thread bytecode. - Register bytecode early and only once. Fixes: ocaml#13402 Fixes: ocaml#13512 Closes: ocaml#13549

... before copying the arguments to the stack and calling the bytecode interpreter.

For bytecode, this tests stack resizing in caml_callbackN.

xavierleroy · 2024-10-23T09:42:13Z

Cleaned up the history. Rebased to fix conflict. Knelt in front of check-typo. Time to merge!

Implement bytecode callbacks with static, never-modified bytecode (cherry picked from commit c3092e7)

Octachron · 2024-10-23T13:30:38Z

Cherry-picked to the 5.2 branch as 71d85e0 in prevision of the upcoming 5.2.1 release.

xavierleroy mentioned this pull request Oct 15, 2024

Fix a nasty bug in the bytecode VM initialisation #13549

Closed

ghost approved these changes Oct 16, 2024

View reviewed changes

dra27 linked an issue Oct 16, 2024 that may be closed by this pull request

GC crashes under bytecode #13512

Closed

xavierleroy added a commit to xavierleroy/ocaml that referenced this pull request Oct 18, 2024

Changes entry for ocaml#13553

4df9b57

xavierleroy added this to the 5.3 milestone Oct 18, 2024

ghost reviewed Oct 18, 2024

View reviewed changes

xavierleroy added 5 commits October 23, 2024 10:05

interp.c: more general API for the bytecode API

bfeceb9

Support passing the initial values of env and extra_args as parameters to the bytecode interpretation function.

Bytecode callbacks: make sure there is enough stack space

d38286f

... before copying the arguments to the stack and calling the bytecode interpreter.

Test callbacks with many arguments

af5e5aa

For bytecode, this tests stack resizing in caml_callbackN.

Changes entry for ocaml#13553

2a750b1

xavierleroy force-pushed the tweak-bytecode-callbacks branch from 4df9b57 to 2a750b1 Compare October 23, 2024 09:16

xavierleroy merged commit c3092e7 into ocaml:trunk Oct 23, 2024

xavierleroy added a commit that referenced this pull request Oct 23, 2024

Merge pull request #13553 from xavierleroy/tweak-bytecode-callbacks

98e8243

Implement bytecode callbacks with static, never-modified bytecode (cherry picked from commit c3092e7)

Octachron pushed a commit that referenced this pull request Oct 23, 2024

Merge pull request #13553 from xavierleroy/tweak-bytecode-callbacks

71d85e0

Implement bytecode callbacks with static, never-modified bytecode (cherry picked from commit c3092e7)

This was referenced Oct 23, 2024

[ocaml5-issue] Segfault in STM Domain.DLS test sequential on bytecode ocaml-multicore/multicoretests#446

Closed

[ocaml5-issue] segfault in 32-bit mode during neg_tests/lin_tests_domain.ml ocaml-multicore/multicoretests#440

Closed

jmid mentioned this pull request Feb 17, 2025

The GC stress test can trigger a fatal error in malloc ocaml-multicore/multicoretests#532

Open

Implement bytecode callbacks with static, never-modified bytecode #13553

Implement bytecode callbacks with static, never-modified bytecode #13553

Uh oh!

Conversation

xavierleroy commented Oct 15, 2024

Uh oh!

ghost left a comment

Choose a reason for hiding this comment

Uh oh!

stedolan commented Oct 16, 2024

Uh oh!

shindere commented Oct 16, 2024 via email

Uh oh!

stedolan commented Oct 16, 2024

Uh oh!

shindere commented Oct 16, 2024 via email

Uh oh!

NickBarnes commented Oct 16, 2024

Uh oh!

xavierleroy commented Oct 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NickBarnes commented Oct 16, 2024

Uh oh!

xavierleroy commented Oct 16, 2024

Uh oh!

xavierleroy commented Oct 18, 2024

Uh oh!

ghost Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

jmid commented Oct 22, 2024

Uh oh!

xavierleroy commented Oct 23, 2024

Uh oh!

Octachron commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

xavierleroy commented Oct 16, 2024 •

edited

Loading