You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Auto merge of #116352 - Kobzol:rustc-driver-bolt, r=<try>
Optimize `librustc_driver.so` with BOLT
This PR optimizes `librustc_driver.so` on 64-bit Linux CI with BOLT.
### Code
One thing that's not clear yet to me how to resolve is how to best pass a linker flag that we need for BOLT (the second commit). It is currently passed unconditionally, which is not a good idea. We somehow have to:
1) Only pass it when we actually plan to use BOLT. How to best do that? `config.toml` entry? Environment variable? CLI flag for bootstrap? BOLT optimization is done by `opt-dist`, therefore bootstrap doesn't know about it by default.
2) Only pass it to `librustc_driver.so` (see performance below).
Some discussion of this flag already happened on [Zulip](https://rust-lang.zulipchat.com/#narrow/stream/326414-t-infra.2Fbootstrap/topic/Adding.20a.20one-off.20linker.20flag).
### Performance
Latest perf. results can be found [here](#102487 (comment)). Note that instruction counts are not very interesting here, there are only regressions on hello world programs. Probably caused by a larger C++ libstd (?).
Summary:
- ✔️ `-1.8%` mean improvement in cycle counts across many primary benchmarks.
- ✔️ `-1.8%` mean Max-RSS improvement.
- ✖️ 34 MiB (+48%) artifact size regression of `librustc_driver.so`.
- This is caused by building `librustc_driver.so` with relocations (which are required for BOLT). Hopefully, it will be [fixed](https://discourse.llvm.org/t/bolt-rfc-a-new-mode-to-rewrite-entire-binary/68674) in the future with BOLT improvements, but now trying to reduce this size increase is [tricky](#114649).
- Note that the size of this file was recently reduced in #115554 by pretty much the same amount (33 MiB). So the size after this PR is basically the same as it was for the last ~year.
- ✖️ 1.4 MiB (+53%) artifact size regression of `rustc`.
- This is annoying and pretty much unnecessary. It is caused by the way relocations are currently applied in this PR, because they are applied both to `librustc_driver.so` (where they are needed) and for `rustc` (where they aren't needed), since both are built with a single cargo invocation. We might need e.g. some tricks in the bootstrap `rustc` shim to only apply the relocation flag for the shared library and not for `rustc`.
### CI time
CI (try build) got slower by ~5 minutes, which is fine, IMO. It can be further reduced by running LLVM and `librustc_driver` BOLT profile gathering at the same time (now they are gathered separately for LLVM and `librustc_driver`).
r? `@Mark-Simulacrum`
Also CC `@onur-ozkan,` primarily for the bootstrap linker flag issue.
complete-c x.py -n"__fish_seen_subcommand_from doc"-l llvm-profile-generate -d'generate PGO profile with llvm built for rustc'
239
+
complete-c x.py -n"__fish_seen_subcommand_from doc"-l enable-bolt-settings -d'Enable BOLT link flags'
233
240
complete-c x.py -n"__fish_seen_subcommand_from doc"-s h -lhelp-d'Print help (see more with \'--help\')'
234
241
complete-c x.py -n"__fish_seen_subcommand_from test"-l skip -d'skips tests matching SUBSTRING, if supported by test tool. May be passed multiple times'-r-F
235
242
complete-c x.py -n"__fish_seen_subcommand_from test"-ltest-args -d'extra arguments to be passed for the test tool being used (e.g. libtest, compiletest or rustdoc)'-r
complete-c x.py -n"__fish_seen_subcommand_from bench"-l llvm-profile-generate -d'generate PGO profile with llvm built for rustc'
315
+
complete-c x.py -n"__fish_seen_subcommand_from bench"-l enable-bolt-settings -d'Enable BOLT link flags'
307
316
complete-c x.py -n"__fish_seen_subcommand_from bench"-s h -lhelp-d'Print help'
308
317
complete-c x.py -n"__fish_seen_subcommand_from clean"-l stage -d'Clean a specific stage without touching other artifacts. By default, every stage is cleaned if this option is not used'-r
309
318
complete-c x.py -n"__fish_seen_subcommand_from clean"-l config -d'TOML configuration file for build'-r-F
0 commit comments