Fix float array specialisation in flambda by lpw25 · Pull Request #1169 · ocaml/ocaml

lpw25 · 2017-05-12T10:24:41Z

In 4.04 flambda tries to optimise array set operations based on the kind of value being put into the array. This differs from how these operations actually behave -- which is always to look at the kind of the array itself to decide whether the set operation is for a Double_array or not. I think this constitutes a bug as quite a bit of high performance code relies on it being safe to place a float into a normal array.

This PR adjusts the optimisation to only consider the kind of the array and not the value being stored into the array. I've made it against the 4.05 branch since this is a bug.

alainfrisch · 2017-05-12T10:29:01Z

I think this constitutes a bug as quite a bit of high performance code relies on it being safe to place a float into a normal array.

Can you elaborate on that part? Do you mean storing a float in boxed form in a regular array? How could this be done (without using unsafe tricks or the C API)?

lpw25 · 2017-05-12T10:38:26Z

Do you mean storing a float in boxed form in a regular array?

Yes

How could this be done (without using unsafe tricks or the C API)?

It can't, but in practice it is done using unsafe tricks and up until now that has been perfectly safe.

alainfrisch · 2017-05-12T11:57:15Z

in practice it is done using unsafe tricks and up until now that has been perfectly safe.

If you have an array whose static type is "float array", it is unsound for it to be represented as a regular array (since accessing the array will not perform a dynamic check for its representation). I guess you are referring to code that encapsulate some local data structure and is safe for the "client code".

Are you aware of such tricks used in publicly available libraries, or are you referring to some code internal at Jane Street?

lpw25 · 2017-05-12T13:06:03Z

Here's one from core_kernel:

https://github.com/janestreet/core_kernel/blob/master/src/obj_array.ml

There are others in core_kernel as well as in our internal code.

bluddy · 2017-05-12T15:26:14Z

Doesn't look like a bug to me. Looks like an unsafe trick that needs to be changed, since that's implicit in using unsafe tricks. Right?

stedolan · 2017-05-15T09:02:22Z

In general, it's not a bug if code using unsafe Obj tricks breaks in a new version - that's the price of using Obj.

In this particular case, though, I agree with @lpw25 that the behaviour should be changed: the runtime is careful to only do the float array optimisation based on the tag of the array, never on the tag of the element, and the optimiser should behave the same way.

mshinwell · 2017-05-15T10:21:22Z

Yeah, I think this was a mistake (partially on my part). Leo's patch appears to be ok.

chambart · 2017-05-16T16:51:51Z

I'm not completely certain than we want to remove this. It mainly depend on the incentive we want on the float array hack. I'm not certain whether I was responsible for this, but I knew the behavior and willingly kept it that way.

This is I think an optimization that triggers quite a lot. The idea is that there are seldom any information about the array kind because arrays are, in practice, always created with externals (this is the reason why the fatal error in the current code almost never trigger on incorrect cases). But in numerical code values stored in the array on the other hand can often be known as float. For instance after inlining Array.map (fun x -> x+.1.) can be optimized when we consider the element stored inside the array.

The kind of code you are suggesting is highly unsafe and already needs some deep understanding of the compiler to write properly. Assuming that this can be updated does not strike me as a major problem.
There is now a way to avoid propagating information and it should be used in this particular case like in https://github.com/chambart/ocaml-nullable-array/blob/master/lib/nullable_array.ml#L161. If we want to really provide the non-float array functionality, we should export the Psetfield_computed primitive.

In fact I would rather argue that this code is missing a case to prevent the use of the dynamic version when the argument is known to be a block.

lpw25 · 2017-05-16T17:29:24Z

It mainly depend on the incentive we want on the float array hack

Well the reason this unsafe code is needed in the first place is to avoid the cost of the float array hack by creating genuine uniform arrays. I'm not particularly keen on optimising generic code using float arrays at the cost of making it very difficult to work around them.

The kind of code you are suggesting is highly unsafe and already needs some deep understanding of the compiler to write properly.

I disagree that a deep understanding is required, this is a fairly simple property that has always previously been guaranteed by OCaml.

Assuming that this can be updated

I have my doubts about the ease of finding all these cases and updating them. It is certainly the case that most of Janestreet's libraries cannot safely be used with 4.04+flambda because of this change. I'll try and find all the cases in our code-base, possibly if I make the optimisation into an error I can reliably find them.

If we want to really provide the non-float array functionality, we should export the Psetfield_computed primitive

I think that would definitely be preferable to using opaque_identity as doesn't prohibit any other optimisations. Probably we should use Psetfield_computed for %obj_set_field and Pfield_computed for %obj_field so that they are used for Obj.field and Obj.set_field. They are already not supposed to be used with Doule_arrays -- that's what Obj.double_field and Obj.set_double_field are for. Whilst we are at it an proper builtin primitive for Obj.new_block would be useful so that we can create sized blocks efficiently without using the array primitives.

damiendoligez · 2017-05-29T13:56:39Z

@lpw25 Have you made progress on finding all cases in JS code?

@chambart Just because something is not documented doesn't mean that we go around and break existing code without carefully considering the matter. So it comes down to the usefulness of this particular optimization vs. the inconvenience of breaking lots of code. I don't have a firm opinion yet, but it needs to be argued either way.

chambart · 2017-05-29T15:33:01Z

@damiendoligez in that case, I think it is quite a useful one, especially if we consider propagating more type information to the backend, like #1029

Also potential breakage must be quite well contained. Abusing Obj for that needed to be done extremely carefully, even without potential additional float array specialization. For instance not being bitten by GC specialization problems requires careful type annotation on every use:

type elt = Int | Block of int
type 'a t = elt array
let set (t:'a t) field (value:'a) = Array.set t field (Obj.magic value:elt)

There is no way that could have been written in some non clearly separated and abstracted module, hence quite easy to track. From what I have seen, there is probably only that place (the whole file) in Core that needs fixing. And I didn't encounter any other library doing anything similar (but of course I don't know every library).

damiendoligez · 2017-06-14T12:45:55Z

@lpw25 @mshinwell Do you have any info on how hard it would be to find and fix all occurrences in JS code?

lpw25 · 2017-06-14T13:16:17Z

I've been working through them for the last couple of days. It's a pain but it's not too bad. If we were discussing adding this optimisation for 4.05 then I would suggest holding off, but since 4.04 already contains it we might as well leave it in 4.05.

…aml#1191) This reverts commit 5123d61.

* Apply ocamlformat to the playground code * Add a Makefile target instead * ocamlformat.0.25.1 * update make deps * Update ocamlformat version on CI

Fix float array specialisation in flambda

e1a515f

mshinwell added the bug label May 12, 2017

mshinwell assigned chambart and unassigned chambart May 12, 2017

mshinwell requested a review from chambart May 12, 2017 10:39

mshinwell added this to the 4.05.0 milestone May 15, 2017

lpw25 closed this Jun 14, 2017

damiendoligez mentioned this pull request Aug 18, 2017

Configure-time option for float array optimization. #1294

Merged

lpw25 mentioned this pull request Oct 10, 2017

Fix fatal error when compiling objects with Flambda #965

Merged

stedolan pushed a commit to stedolan/ocaml that referenced this pull request Mar 21, 2023

Revert "Add a proper top and bottom layout" (ocaml#1169)

5123d61

stedolan pushed a commit to stedolan/ocaml that referenced this pull request Mar 21, 2023

Revert "Revert "Add a proper top and bottom layout" (ocaml#1169)" (oc…

4e08295

…aml#1191) This reverts commit 5123d61.

Conversation

lpw25 commented May 12, 2017

Uh oh!

alainfrisch commented May 12, 2017

Uh oh!

lpw25 commented May 12, 2017

Uh oh!

alainfrisch commented May 12, 2017

Uh oh!

lpw25 commented May 12, 2017

Uh oh!

bluddy commented May 12, 2017

Uh oh!

stedolan commented May 15, 2017

Uh oh!

mshinwell commented May 15, 2017

Uh oh!

chambart commented May 16, 2017

Uh oh!

lpw25 commented May 16, 2017

Uh oh!

damiendoligez commented May 29, 2017

Uh oh!

chambart commented May 29, 2017

Uh oh!

damiendoligez commented Jun 14, 2017

Uh oh!

lpw25 commented Jun 14, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants