Skip to content

Commit 739b7ef

Browse files
authored
Unrolled build for rust-lang#135481
Rollup merge of rust-lang#135481 - Zalathar:node-flow, r=oli-obk coverage: Completely overhaul counter assignment, using node-flow graphs The existing code for choosing where to put physical counter-increments gets the job done, but is very ad-hoc and hard to modify without introducing tricky regressions. This PR replaces all of that with a more principled approach, based on the algorithm described in "Optimal measurement points for program frequency counts" (Knuth & Stevenson, 1973). --- We start by ensuring that our graph has “balanced flow”, i.e. each node's flow (execution count) is equal to the sum of all its in-edge flows, and equal to the sum of all its out-edge flows. That isn't naturally true of control-flow graphs, so we introduce a wrapper type `BalancedFlowGraph` to fix that by introducing synthetic nodes and edges as needed. Once our graph has balanced flow, the next step is to create another view of that graph in which each node's successors have all been merged into one “supernode”. Consequently, each node's out-edges can be coalesced into a single out-edge to one of those supernodes. Because of the balanced-flow property, the flow of that coalesced edge is equal to the flow of the original node. Having expressed all of our node flows as edge flows, we can then analyze node flows using techniques for analyzing edge flows. We incrementally build a spanning tree over the merged supernodes, such that each new edge in the spanning tree represents a node whose flow can be computed from that of other nodes. When this is done, we end up with a list of “counter terms” for each node, describing which nodes need physical counters, and how the remaining nodes can have their flow calculated by adding and subtracting those physical counters. --- The re-blessed coverage tests show that this results in modest or major improvements for our test programs. Some tests need fewer physical counters, some tests need fewer expression nodes for the same number of physical counters, and some tests show striking reductions in both.
2 parents 99db273 + 6eabf03 commit 739b7ef

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

56 files changed

+2040
-1967
lines changed

compiler/rustc_data_structures/src/graph/iterate/mod.rs

+10
Original file line numberDiff line numberDiff line change
@@ -125,6 +125,16 @@ where
125125
pub fn visited(&self, node: G::Node) -> bool {
126126
self.visited.contains(node)
127127
}
128+
129+
/// Returns a reference to the set of nodes that have been visited, with
130+
/// the same caveats as [`Self::visited`].
131+
///
132+
/// When incorporating the visited nodes into another bitset, using bulk
133+
/// operations like `union` or `intersect` can be more efficient than
134+
/// processing each node individually.
135+
pub fn visited_set(&self) -> &DenseBitSet<G::Node> {
136+
&self.visited
137+
}
128138
}
129139

130140
impl<G> std::fmt::Debug for DepthFirstSearch<G>

compiler/rustc_data_structures/src/graph/mod.rs

+1
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ pub mod dominators;
44
pub mod implementation;
55
pub mod iterate;
66
mod reference;
7+
pub mod reversed;
78
pub mod scc;
89
pub mod vec_graph;
910

Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
use crate::graph::{DirectedGraph, Predecessors, Successors};
2+
3+
/// View that reverses the direction of edges in its underlying graph, so that
4+
/// successors become predecessors and vice-versa.
5+
///
6+
/// Because of `impl<G: Graph> Graph for &G`, the underlying graph can be
7+
/// wrapped by-reference instead of by-value if desired.
8+
#[derive(Clone, Copy, Debug)]
9+
pub struct ReversedGraph<G> {
10+
pub inner: G,
11+
}
12+
13+
impl<G> ReversedGraph<G> {
14+
pub fn new(inner: G) -> Self {
15+
Self { inner }
16+
}
17+
}
18+
19+
impl<G: DirectedGraph> DirectedGraph for ReversedGraph<G> {
20+
type Node = G::Node;
21+
22+
fn num_nodes(&self) -> usize {
23+
self.inner.num_nodes()
24+
}
25+
}
26+
27+
// Implementing `StartNode` is not possible in general, because the start node
28+
// of an underlying graph is instead an _end_ node in the reversed graph.
29+
// But would be possible to define another wrapper type that adds an explicit
30+
// start node to its underlying graph, if desired.
31+
32+
impl<G: Predecessors> Successors for ReversedGraph<G> {
33+
fn successors(&self, node: Self::Node) -> impl Iterator<Item = Self::Node> {
34+
self.inner.predecessors(node)
35+
}
36+
}
37+
38+
impl<G: Successors> Predecessors for ReversedGraph<G> {
39+
fn predecessors(&self, node: Self::Node) -> impl Iterator<Item = Self::Node> {
40+
self.inner.successors(node)
41+
}
42+
}

compiler/rustc_index/src/bit_set.rs

+30
Original file line numberDiff line numberDiff line change
@@ -281,6 +281,24 @@ impl<T: Idx> DenseBitSet<T> {
281281
}
282282

283283
bit_relations_inherent_impls! {}
284+
285+
/// Sets `self = self | !other`.
286+
///
287+
/// FIXME: Incorporate this into [`BitRelations`] and fill out
288+
/// implementations for other bitset types, if needed.
289+
pub fn union_not(&mut self, other: &DenseBitSet<T>) {
290+
assert_eq!(self.domain_size, other.domain_size);
291+
292+
// FIXME(Zalathar): If we were to forcibly _set_ all excess bits before
293+
// the bitwise update, and then clear them again afterwards, we could
294+
// quickly and accurately detect whether the update changed anything.
295+
// But that's only worth doing if there's an actual use-case.
296+
297+
bitwise(&mut self.words, &other.words, |a, b| a | !b);
298+
// The bitwise update `a | !b` can result in the last word containing
299+
// out-of-domain bits, so we need to clear them.
300+
self.clear_excess_bits();
301+
}
284302
}
285303

286304
// dense REL dense
@@ -1087,6 +1105,18 @@ impl<T: Idx> fmt::Debug for ChunkedBitSet<T> {
10871105
}
10881106
}
10891107

1108+
/// Sets `out_vec[i] = op(out_vec[i], in_vec[i])` for each index `i` in both
1109+
/// slices. The slices must have the same length.
1110+
///
1111+
/// Returns true if at least one bit in `out_vec` was changed.
1112+
///
1113+
/// ## Warning
1114+
/// Some bitwise operations (e.g. union-not, xor) can set output bits that were
1115+
/// unset in in both inputs. If this happens in the last word/chunk of a bitset,
1116+
/// it can cause the bitset to contain out-of-domain values, which need to
1117+
/// be cleared with `clear_excess_bits_in_final_word`. This also makes the
1118+
/// "changed" return value unreliable, because the change might have only
1119+
/// affected excess bits.
10901120
#[inline]
10911121
fn bitwise<Op>(out_vec: &mut [Word], in_vec: &[Word], op: Op) -> bool
10921122
where

compiler/rustc_index/src/bit_set/tests.rs

+26
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,32 @@ fn union_two_sets() {
7575
assert!(set1.contains(64));
7676
}
7777

78+
#[test]
79+
fn union_not() {
80+
let mut a = DenseBitSet::<usize>::new_empty(100);
81+
let mut b = DenseBitSet::<usize>::new_empty(100);
82+
83+
a.insert(3);
84+
a.insert(5);
85+
a.insert(80);
86+
a.insert(81);
87+
88+
b.insert(5); // Already in `a`.
89+
b.insert(7);
90+
b.insert(63);
91+
b.insert(81); // Already in `a`.
92+
b.insert(90);
93+
94+
a.union_not(&b);
95+
96+
// After union-not, `a` should contain all values in the domain, except for
97+
// the ones that are in `b` and were _not_ already in `a`.
98+
assert_eq!(
99+
a.iter().collect::<Vec<_>>(),
100+
(0usize..100).filter(|&x| !matches!(x, 7 | 63 | 90)).collect::<Vec<_>>(),
101+
);
102+
}
103+
78104
#[test]
79105
fn chunked_bitset() {
80106
let mut b0 = ChunkedBitSet::<usize>::new_empty(0);

0 commit comments

Comments
 (0)