specs: use lazy lexicographic comparison instead of key_ordering by tgamblin · Pull Request #21618 · spack/spack

tgamblin · 2021-02-11T09:40:18Z

We have been using the @llnl.util.lang.key_ordering decorator for specs and most of their components. This leverages the fact that in Python, tuple comparison is lexicographic. It allows you to implement a _cmp_key method on your class, and have __eq__, __lt__, etc. implemented automatically using that key. For example, you might use tuple keys to implement comparison, e.g.:

class Widget:
    # author implements this
    def _cmp_key(self):
        return (
            self.a,
            self.b,
            (self.c, self.d),
            self.e
        )

    # operators are generated by @key_ordering
    def __eq__(self, other):
        return self._cmp_key() == other._cmp_key()

    def __lt__(self):
        return self._cmp_key() < other._cmp_key()

    # etc.

The issue there for simple comparators is that we have to build the tuples and we have to generate all the values in them up front. When implementing comparisons for large data structures, this can be costly.

This PR replaces @key_ordering with a new decorator, @lazy_lexicographic_ordering. Lazy lexicographic comparison maps the tuple comparison shown above to generator functions. Instead of comparing based on pre-constructed tuple keys, users of this decorator can compare using elements from a generator. So, you'd write:

@lazy_lexicographic_ordering
class Widget:
    def _cmp_iter(self):
        yield a
        yield b
        def cd_fun():
            yield c
            yield d
        yield cd_fun
        yield e

    # operators are added by decorator (but are a bit more complex)

There are no tuples that have to be pre-constructed, and the generator does not have to complete. Instead of tuples, we simply make functions that lazily yield what would've been in the tuple. If a yielded value is a callable, the comparison functions will call it and recursively compare it. The comparator just walks the data structure like you'd expect it to.

The @lazy_lexicographic_ordering decorator handles the details of implementing comparison operators, and the Widget implementor only has to worry about writing _cmp_iter, and making sure the elements in it are also comparable.

Using this PR shaves another 1.5 sec off the runtime of spack buildcache list, and it also speeds up Spec comparison by about 30%. The runtime improvement comes mostly from two things:

lazily stopping the comparison as soon as possible (e.g., many specs just have different names, which is the firs thing the lazy generators return)
avoiding the use of hash() in _cmp_iter() (it was used in _cmp_key() before)

tgamblin · 2021-02-11T16:11:18Z

Seems there's a bug in here somewhere -- I might not get to fixing that until later today.

alalazo

Minor comments from a first read of the PR. I'll test drive it asap.

lib/spack/llnl/util/lang.py

alalazo · 2021-02-11T16:16:00Z

lib/spack/llnl/util/lang.py

-    def _cmp_key(self):
-        return tuple(sorted(self.values()))
+    def _cmp_iter(self):
+        for _, v in sorted(self.items()):


I assume the ordering is arbitrary for us, but want to note that this is different from the previous one (the ordering is done here on (key, value) pairs, while before it was just the values)

lib/spack/spack/test/spec_dag.py

lib/spack/spack/test/spec_syntax.py

lib/spack/llnl/util/lang.py

cosmicexplorer · 2021-02-11T16:53:30Z

lib/spack/llnl/util/lang.py

+    tuples. The issue there for simple comparators is that we have to
+    bulid the tuples *and* we have to generate all the values in them up
+    front. When implementing comparisons for large data structures, this
+    can be costly.


Should we be using this whenever possible, then? Is there a use case where this doesn't apply?

lib/spack/llnl/util/lang.py

lib/spack/spack/spec.py

lib/spack/llnl/util/lang.py

lib/spack/spack/architecture.py

lib/spack/spack/spec.py

tgamblin · 2021-03-22T09:22:59Z

@alalazo @becker33 @eugeneswalker: this is ready for another look. With the latest E4S build cache (which has 37k specs), it significantly improves performance for spack buildcache list:

develop:

       42.68 real        35.00 user         0.75 sys

lazy-lexicographic-spec-comparison:

       24.71 real        23.80 user         0.46 sys

So that shaves > 40% of the time off.

The key is really two things. First, __lt__ is faster b/c of the laziness -- specs that are actually less than others bail as early as possible. For __eq__, or rather for specs that are equal, things are trickier, because we do not want __eq__ to have to traverse an entire spec. I'm leveraging the dag hash like this:

    def __hash__(self):
        # If the spec is concrete, we leverage the DAG hash and just use
        # a 64-bit prefix of it. The DAG hash has the advantage that it's
        # computed once per concrete spec, and it's saved -- so if we
        # read concrete specs we don't need to recompute the whole hash.
        # This is good for large, unchanging specs.
        if self.concrete:
            if not self._dunder_hash:
                self._dunder_hash = self.dag_hash_bit_prefix(64)
            return self._dunder_hash

        # This is the normal hash for lazy_lexicographic_ordering. It's
        # slow for large specs because it traverses the whole spec graph,
        # so we hope it only runs on abstract specs, which are small.
        return hash(lang.tuplify(self._cmp_iter))

So basically, I think this gets us the best of both worlds -- lazy comparison for abstract specs that are small, and leveraging precomputed hashes for specs that are large (which are pretty much all the concrete ones).

eugeneswalker

This looks great! Thank you.

becker33

Only request is removing a vestigial comment. Otherwise LGTM

becker33 · 2021-03-25T14:48:09Z

lib/spack/spack/test/spec_syntax.py

        x1 = Spec('a')
        x1.concretize()
-        x1._hash = 'xy'
+        x1._hash = 'xyyyyyyyyyyyyyyyyyyyyyyyyyyyyyyy'


Why is this necessary?

The fact that Spec.__hash__ now uses self.dag_hash_bit_prefix(64) means that there need to be at least 64 bits of hash data to get the prefix. That's 13 base32 characters, so I just went ahead and made these hashes full length. Every real hash will be 32 characters long (160 bits for SHA1) so I made them proper stand-ins.

lib/spack/llnl/util/lang.py

Review outdated, comments have been addressed

We have been using the `@llnl.util.lang.key_ordering` decorator for specs and most of their components. This leverages the fact that in Python, tuple comparison is lexicographic. It allows you to implement a `_cmp_key` method on your class, and have `__eq__`, `__lt__`, etc. implemented automatically using that key. For example, you might use tuple keys to implement comparison, e.g.: ```python class Widget: # author implements this def _cmp_key(self): return ( self.a, self.b, (self.c, self.d), self.e ) # operators are generated by @key_ordering def __eq__(self, other): return self._cmp_key() == other._cmp_key() def __lt__(self): return self._cmp_key() < other._cmp_key() # etc. ``` The issue there for simple comparators is that we have to bulid the tuples *and* we have to generate all the values in them up front. When implementing comparisons for large data structures, this can be costly. This PR replaces `@key_ordering` with a new decorator, `@lazy_lexicographic_ordering`. Lazy lexicographic comparison maps the tuple comparison shown above to generator functions. Instead of comparing based on pre-constructed tuple keys, users of this decorator can compare using elements from a generator. So, you'd write: ```python @lazy_lexicographic_ordering class Widget: def _cmp_iter(self): yield a yield b def cd_fun(): yield c yield d yield cd_fun yield e # operators are added by decorator (but are a bit more complex) There are no tuples that have to be pre-constructed, and the generator does not have to complete. Instead of tuples, we simply make functions that lazily yield what would've been in the tuple. If a yielded value is a `callable`, the comparison functions will call it and recursively compar it. The comparator just walks the data structure like you'd expect it to. The ``@lazy_lexicographic_ordering`` decorator handles the details of implementing comparison operators, and the ``Widget`` implementor only has to worry about writing ``_cmp_iter``, and making sure the elements in it are also comparable. Using this PR shaves another 1.5 sec off the runtime of `spack buildcache list`, and it also speeds up Spec comparison by about 30%. The runtime improvement comes mostly from *not* calling `hash()` `_cmp_iter()`.

Since `lazy_lexicographic_ordering` handles `None` comparison for us, we don't need to adjust the spec comparators to return empty strings or other type-specific empty types. We can just leverage the None-awareness of `lazy_lexicographic_ordering`. - [x] remove "or ''" from `_cmp_iter` in `Spec` - [x] remove setting of `self.namespace` to `''` in `MockPackage`

tgamblin · 2021-03-28T18:05:59Z

@becker33:comments addressed

tgamblin · 2021-03-31T22:16:29Z

@alalazo FYI

alalazo · 2021-04-01T03:48:12Z

Thanks I'll update #21683

tgamblin added performance e4s labels Feb 11, 2021

tgamblin requested review from alalazo, cosmicexplorer and eugeneswalker February 11, 2021 09:40

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch from 37d80d8 to ad129e6 Compare February 11, 2021 15:54

alalazo previously requested changes Feb 11, 2021

View reviewed changes

cosmicexplorer reviewed Feb 11, 2021

View reviewed changes

tgamblin requested a review from becker33 February 11, 2021 21:32

becker33 requested changes Feb 11, 2021

View reviewed changes

lib/spack/llnl/util/lang.py Outdated Show resolved Hide resolved

lib/spack/llnl/util/lang.py Outdated Show resolved Hide resolved

lib/spack/spack/architecture.py Show resolved Hide resolved

lib/spack/spack/spec.py Show resolved Hide resolved

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch 3 times, most recently from e649850 to 93231c0 Compare February 11, 2021 23:54

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch from 93231c0 to ba31dd0 Compare March 13, 2021 08:52

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch 5 times, most recently from f7cc208 to 670da01 Compare March 22, 2021 09:05

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch 3 times, most recently from df3e379 to eef9b11 Compare March 22, 2021 20:12

eugeneswalker previously approved these changes Mar 24, 2021

View reviewed changes

becker33 requested changes Mar 25, 2021

View reviewed changes

tgamblin added 3 commits March 27, 2021 17:21

specs: speed up traversal by avoiding redundant canonicalization

4288faa

tgamblin dismissed eugeneswalker’s stale review via 420177f March 28, 2021 00:22

tgamblin force-pushed the lazy-lexicographic-spec-comparison branch from eef9b11 to 420177f Compare March 28, 2021 00:22

becker33 approved these changes Mar 31, 2021

View reviewed changes

becker33 merged commit a1d9a56 into develop Mar 31, 2021

becker33 deleted the lazy-lexicographic-spec-comparison branch March 31, 2021 21:39

alalazo mentioned this pull request Apr 1, 2021

Allow for multiple dependencies/dependents from the same package #21683

Closed

3 tasks

tgamblin mentioned this pull request Jul 31, 2021

spack diff: make output order deterministic #25169

Merged

3 tasks

tgamblin mentioned this pull request Oct 23, 2022

Consolidate DAG traversal in traverse.py, support DFS/BFS #33406

Merged

Conversation

tgamblin commented Feb 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tgamblin commented Feb 11, 2021

Uh oh!

alalazo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alalazo Feb 11, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cosmicexplorer Feb 11, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tgamblin commented Mar 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eugeneswalker left a comment

Choose a reason for hiding this comment

Uh oh!

becker33 left a comment

Choose a reason for hiding this comment

Uh oh!

becker33 Mar 25, 2021

Choose a reason for hiding this comment

Uh oh!

tgamblin Mar 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tgamblin commented Mar 28, 2021

Uh oh!

tgamblin commented Mar 31, 2021

Uh oh!

alalazo commented Apr 1, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tgamblin commented Feb 11, 2021 •

edited

Loading

tgamblin commented Mar 22, 2021 •

edited

Loading

tgamblin Mar 28, 2021 •

edited

Loading