-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compiled-in page size makes jemalloc binaries unportable #467
Comments
You can use e.g. |
I think in this case, Rust would like to have binaries that run everywhere, not subject to the configuration whims of different distros. So Is it feasible to run jemalloc compiled with the 64k page size even on systems that are only 4k? I guess these I'm not surprised that dynamic page size reduces compiler optimizations. Perhaps some of that can be mitigated with different expressions, e.g. instead of page mask, use a shr+shl combo to strip bits. But I'm sure that's a lot of code to deal with. FWIW a single system could have different page sizes with different kernels -- e.g. a kernel developer who normally runs the distro kernel, but sometimes reboots into their own build. Obviously they would not want to swap out all userspace each time. But maybe mixed pages are impractical for other reasons. There may be other programs with static page sizes too, and that's normally just fine in a controlled distro. |
What about |
|
Sounds like jemalloc is broken on systems which differ in page size than the host it was compiled on (unless an option was passed). This unfortunately reduces the portability of binaries created and can often make Rust segfault by default. For now let's patch over this by disabling jemalloc until we can figure out a better solution. Closes rust-lang#36994 Closes rust-lang#37320 cc jemalloc/jemalloc#467
Disable jemalloc on aarch64/powerpc Sounds like jemalloc is broken on systems which differ in page size than the host it was compiled on (unless an option was passed). This unfortunately reduces the portability of binaries created and can often make Rust segfault by default. For now let's patch over this by disabling jemalloc until we can figure out a better solution. Closes #36994 Closes #37320 cc jemalloc/jemalloc#467
Disable jemalloc on aarch64/powerpc Sounds like jemalloc is broken on systems which differ in page size than the host it was compiled on (unless an option was passed). This unfortunately reduces the portability of binaries created and can often make Rust segfault by default. For now let's patch over this by disabling jemalloc until we can figure out a better solution. Closes #36994 Closes #37320 cc jemalloc/jemalloc#467
More than a cross-distro problem, it can cause problems in a single distro, where the user switches the page size it their kernel. Now, something that /could/ work, is if one configured --with-lg-pages for the biggest you can expect a kernel to use. The problem is that it then brakes |
How expensive would it be (in code bloat, etc) to have multiple copies of the code compiled for different constants (to still get the benefit of compiler optimizations which are aware of the page size)? So you basically compile three different versions of the code - the 4K page version, the 64K page version, and, if you're being really careful, the X page version (where X must be determined at runtime). Then, at init time when a process starts, you decide which one you're going to use. |
All mappings continue to be PAGE-aligned, even if the system page size is smaller. This change is primarily intended to provide a mechanism for supporting multiple page sizes with the same binary; smaller page sizes work better in conjunction with jemalloc's design. This resolves jemalloc#467.
All mappings continue to be PAGE-aligned, even if the system page size is smaller. This change is primarily intended to provide a mechanism for supporting multiple page sizes with the same binary; smaller page sizes work better in conjunction with jemalloc's design. This resolves jemalloc#467.
Page size is baked into various data structures at compile time, so we would have to make substantial changes to directly support multiple page sizes at run time. However, it is possible to support a system page size smaller than the page size that is baked in. #769 implements such support. |
Nice! |
@marcan IMO this comment is abusive. As much as I like your work (I even sponsor you), I don't think it's OK to insinuate malevolence. |
As far as I understand, on ARM, assuming the compile-time page size is equivalent to the runtime one is exactly as non-portable as doing That is to say, in any scenario where a non-64K page size is hard-coded, architecturally speaking, I might not quite call this active sabotage, but it's a pretty simple fact that essentially doing |
jemalloc may have a problem on ARM. See also: jemalloc/jemalloc#467
I can understand the compiler optimization. But the pagesize is always a pow2 so you just do the masking for div/mod manually as the compiler would. (Non pow2 numbers need an additional mad AFAIK and for more info on this see libdivide) I don’t think it’s malicious just a bit too much focus on performance got in the way of practicality on ARM. |
Linux has now boot time dynamic page size switch. That makes page size a runtime selection, not something that is fixed, even for one OS distribution. |
jemalloc's default configuration is broken on arm64 and causes runtime failures on machines with non-4k page sizes. See jemalloc/jemalloc#467 Change-Id: Ia243d6e43fcc9eaad893e51d6fa90febd7c0f344 Reviewed-on: https://gn-review.googlesource.com/c/gn/+/18300 Commit-Queue: Dirk Pranke <[email protected]> Commit-Queue: Takuto Ikuta <[email protected]> Reviewed-by: Takuto Ikuta <[email protected]> Reviewed-by: Dirk Pranke <[email protected]>
Hopefully makes the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Hopefully makes the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Hopefully makes the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Hopefully makes the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Hopefully makes the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Working towards making the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Working towards making the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Working towards making the build deterministic regardless of the build host jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852
Working towards making the build deterministic regardless of the build host Page size: jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852 Darwin: Similar to NixOS#394620 LG_VADDR: jemalloc/jemalloc#1997
Working towards making the build deterministic regardless of the build host Page size: jemalloc/jemalloc#467 https://sources.debian.org/src/jemalloc/5.3.0-3/debian/rules/ NixOS#380852 Darwin: Similar to NixOS#394620 LG_VADDR: jemalloc/jemalloc#1997
I see that configure uses
sysconf(_SC_PAGESIZE)
at compile time to determine the page size. However, this can vary depending on kernel configuration, so a jemalloc compiled on one machine may have issues running on some other machine. For instance, Debian aarch64 uses 4k pages, but Fedora uses 64k, so a Debian jemalloc effectively can't run correctly on Fedora.The concrete issue I encountered was in trying to bootstrap Rust aarch64 on Fedora. Because Rust is self-hosting, I have to use upstream binaries to get started. I was getting strange crashes, and when I tried strace I saw this:
So it seems clearly expecting to have 4k pages. Most of the
madvise
calls were not on the real 64k page boundaries, so they getEINVAL
. A few of them happen to align, so they succeed -- but the requested length was less than 64k and the kernel rounds up! Thus it will dump more memory than intended, and I suspect the SEGV addr=0x8 is a deref of a nulled-pointer from the cleared page.I have managed to get around this with:
export MALLOC_CONF=lg_dirty_mult:-1
. But I think it would be better if jemalloc read the page size at runtime, and FWIW Firefox's forked "mozjemalloc" does exactly that.The text was updated successfully, but these errors were encountered: