ARM: issue with constant islands and large switch instructions#1022
ARM: issue with constant islands and large switch instructions#1022xavierleroy wants to merge 2 commits intotrunkfrom
Conversation
… a new constant island As reported in GPR#994, the previous heuristic for starting a new constant island could start it too late if the instruction just emitted produces a lot of asm code. This is typically the case for Lswitch instruction with a large jump table. This commit checks for new constant island before emitting the instruction, and takes into account (an upper bound for) the size of the instruction's code. Also: report a more accurate size for the encoding of Lswitch in Thumb2 mode. The one computed before overestimated the size most of the time, but could also underestimate in some cases.
|
We need to decide quicky whether this goes in 4.05 or is kept for future work. @mshinwell : an opinion? |
|
I'm planning to read this one soon, so we should be able to get this into 4.05. |
| emit_all (ninstr' + n) (has_fallthrough i.desc) i.next | ||
| end | ||
|
|
||
|
|
There was a problem hiding this comment.
Unrelated whitespace change
| end; | ||
| `{emit_label !tailrec_entry_point}:\n`; | ||
| emit_all 0 fundecl.fun_body; | ||
| emit_all 0 true fundecl.fun_body; |
There was a problem hiding this comment.
I think labelling ~fallthrough would be clearer.
asmcomp/arm/emit.mlp
Outdated
| emit_all 0 i.next | ||
| end else | ||
| emit_all ninstr' i.next | ||
| let limit = limit - !num_literals - max_instruction_size i in |
There was a problem hiding this comment.
The name num_literals is confusing since it sounds like, well, the number of literals---but actually it's the number of words they occupy. Could we rename this?
asmcomp/arm/emit.mlp
Outdated
| then 1 + (Array.length jumptbl + 1) / 2 + Array.length jumptbl | ||
| else 2 + Array.length jumptbl | ||
| | _ -> | ||
| 4 (* conservative upper bound *) |
There was a problem hiding this comment.
This number doesn't appear to be conservative, e.g. for Iextcall. Could you double-check all the cases?
Maybe there should be an assertion in emit_all that emit_instr doesn't return a number greater than what max_instruction_size returned.
(Aside: problems like this make me increasingly think that the right way to do code emitters is via a DSL, where absolutely everything including layout is described; then it's easier to do things such as measuring.)
…ze_literals Continuation of GPR#1022 and commit 19757e0.
|
I fixed |
|
The failing Travis check is unrelated (camlp4 issue). Is this good to go? |
|
I still reckon there should be an assertion that the max instruction size wasn't an underestimate, but if you don't want to do that, OK. |
|
Squashed and merged in trunk, commit 07b28ed . |
b11eea1 flambda-backend: Introduce Import_info (ocaml#1036) bc5b135 flambda-backend: Fix `ocamlobjinfo` on flambda2 .cmx files (ocaml#1029) c8babbd flambda-backend: Compilation_unit optimisations (ocaml#1035) e8d3e22 flambda-backend: Use 4.14.0 opam switch for building (includes upgrading ocamlformat to 0.24.1) (ocaml#1030) eb14a86 flambda-backend: Port PR81 from ocaml-jst (ocaml#1024) 131bc12 flambda-backend: Merge ocaml-jst 2022-12-13 (ocaml#1022) 06c189a flambda-backend: Make stack allocation the default (ocaml#1013) 98debd5 flambda-backend: Initial support for value slots not of value kind (ocaml#946) deb1714 flambda-backend: Add is_last flag to closinfo words (ocaml#938) d07fce1 flambda-backend: Disable poll insertion in Configure (ocaml#967) 0f1ce0e flambda-backend: Regenerate ocaml/configure autoconf 2.69 (instead of 2.71) (ocaml#1012) 27132d8 flambda-backend: Fix for spurious typing error related to expanding through functor arguments (ocaml#997) 724fb68 flambda-backend: Use `Compilation_unit.t` instead of `Ident.t` for globals (ocaml#871) 396d5b8 flambda-backend: Add a test for frametable setup in natdynlinked libraries (ocaml#983) b73ab12 flambda-backend: Fix invocation of `caml_shared_startup` in native dynlink (ocaml#980) 7c7d75a flambda-backend: Fix split_default_wrapper which did not trigger anymore with flambda2 (ocaml#970) 8fb75bd flambda-backend: Port ocaml#11727 and ocaml#11732 (ocaml#965) fdb7987 flambda-backend: Fix include functor issue after 4.14 merge. (ocaml#948) 9745cdb flambda-backend: Print -dprofile/-dtimings output to stdout like 4.12 (ocaml#943) 5f51f21 flambda-backend: Merge pull request ocaml#932 from mshinwell/4.14-upgrade 841687d flambda-backend: Run make alldepend in ocaml/ (ocaml#936) 72a7658 flambda-backend: Remove reformatting changes only in dynlink/dune (preserving PR889 and adjusting to minimise diff) 6d758cd flambda-backend: Revert whitespace changes in dune files, to match upstream c86bf6e flambda-backend: Remove duplicate tests for polling 971dbeb flambda-backend: Testsuite fixes 32f8356 flambda-backend: Topeval fix for symbols patch befea01 flambda-backend: Compilation fixes / rectify merge faults a84543f flambda-backend: Merge ocaml-jst 8e65056 flambda-backend: Merge ocaml-jst 4d70045 flambda-backend: Remove filename from system frametable (amd64) (ocaml#920) 5e57b7d flambda-backend: Bugfix for runtime frame_descr logic for C frames (ocaml#918) 6423d5e flambda-backend: Merge pull request ocaml#914 from mshinwell/merge-ocaml-jst-2022-10-24 ead605c flambda-backend: Add a missing Extract_exception (ocaml#916) c8f1481 flambda-backend: Resolve conflicts and add specialise/specialised attributes to Builtin_attributes cf4d0d3 flambda-backend: Merge fixes (ocaml#21) c2f742f flambda-backend: Re-enable some tests for Flambda2 (ocaml#881) 3d38d13 flambda-backend: Long frames in frametable (ocaml#797) 85aec7b flambda-backend: Add loop attribute to Builtin_attributes c0f16e3 flambda-backend: Compilation fixes 90dea23 flambda-backend: Merge flambda-backend/main 5acc6ea flambda-backend: Fixes after merge e501946 flambda-backend: Merge ocaml-jst 115083b flambda-backend: Merge ocaml-jst 9943b2e flambda-backend: Revert "Revert "Transform tail-recursive functions into recursive continuations (ocaml#893)"" (ocaml#909) ce339f1 flambda-backend: Fix alloc modes and call kinds for overapplications (ocaml#902) e6a317c flambda-backend: Revert "Transform tail-recursive functions into recursive continuations (ocaml#893)" 853c488 flambda-backend: Transform tail-recursive functions into recursive continuations (ocaml#893) 5a977e4 flambda-backend: Fix missing End_region primitives on switch arms (ocaml#898) 7fa7f9d flambda-backend: Add missing dependencies to Dune files (ocaml#889) 3cd36f0 flambda-backend: Have Lambda `Pgetglobal` and `Psetglobal` take `Compilation_unit.t` (ocaml#896) 7565915 flambda-backend: [@poll error] attribute (ocaml#745) 9eb9448 flambda-backend: Backport the main safepoints PRs (ocaml#740) 689bdda flambda-backend: Add strict mode for ocamldep (ocaml#892) git-subtree-dir: ocaml git-subtree-split: b11eea1
This is an alternate fix for the issue reported in #994 .
Additionally, it fixes another potential issue with switch instructions whereas the size of the instruction encoding could have been underestimated.