WIP: Generate gitlab ci yaml by scottwittenburg · Pull Request #8718 · spack/spack

scottwittenburg · 2018-07-16T21:39:43Z

This presents another possible path to generating release binaries, similar to #8444, but with the jobs being created more statically (by a script run manually when release spec-set changes). Then each job (represented in the rebuild_package.sh shell script) can check for itself whether to build the package or whether that work is not needed.

scottwittenburg · 2018-07-16T21:41:55Z

ping @opadron I've been running this branch against my own local gitlab/ci instance, and though it needs adjustment, it's a decent (working) start. Currently the jobs in the pipeline are discovering they need a rebuild, then failing when my test Docker image doesn't have some of the build toolchain expected.

scottwittenburg · 2018-07-18T17:16:43Z

ping @aashish24

aashish24 · 2018-07-18T17:37:27Z

thanks @scottwittenburg few things I have mentioned that you probably know already but I am documenting it here:

get the compiler information from spec
use yml config file for image/OS mapping (as you mentioned)
parallel workflow (use multiple jobs per stage depending on the dependency). Although I would get the serial workflow working first.

Great start.. thanks for pinging me on this.

tgamblin

@scottwittenburg @opadron: This is starting to look good to me! See my review comments for details.

FYI: I found this promising GitLab issue on allowing seed jobs to generate gitlab-ci.yml dynamically. It seems like people like it. I put in a comment about our use case and a link to this PR, but I was struck by how similar the proposal is to what we ended up doing here. I hope it gets implemented.

tgamblin · 2018-07-19T07:56:19Z

lib/spack/spack/cmd/check_binaries.py

+            # full_hash doesn't match (or remote end doesn't know about
+            # the full_hash), then we trigger a rebuild.
+            remote_pkg_info = remote_pkg_index[pkg_short_hash]
+            if not 'full_hash' in remote_pkg_info or \


Use parens around the condition instead of \

tgamblin · 2018-07-19T08:16:06Z

lib/spack/spack/cmd/check_binaries.py

+    for release_spec in release_spec_set:
+        pkg_name = release_spec.name
+        pkg_version = release_spec.version
+        pkg_short_hash = release_spec.dag_hash()


The name short_hash here is a little confusing, as Spack (like git) allows you to enter in uniqe prefixes of hashes and have them work. So I'd say the hash in your "short spec" is really a "short hash" 😄.

I think I see why we call it a short hash -- the full_hash is a SHA-256, while the dag_hash is a SHA-1, so it's shorter. That actually took me by surprise -- the full hash is really long and has a bunch of ==== padding at the end of it because SHA-256 length isn't evenly divisible by 5.

Can we do two things here?

Since there's just going to be one hash in the end, just call this the hash

change the full hash to use SHA-1/base32 like the dag_hash, so at least they're the same length. I worry about transitioning full hash to be the main hash if they're different lengths. We can talk about making the hashes longer in Spack later, but I think things will be smoother if the full hash looks like the dag hash.

#8911 represents my attempt to fulfill request (2) above.

tgamblin · 2018-07-19T08:16:55Z

lib/spack/spack/cmd/check_binaries.py

+        tty.msg('  %s -> %s' % (mirror, configured_mirrors[mirror]))
+        mirror_url = configured_mirrors[mirror]
+        mirror_rebuilds = get_mirror_rebuilds(mirror, mirror_url, release_spec_set)
+        if len(mirror_rebuilds) > 0:


more pythonic: if mirror_rebuilds:

tgamblin · 2018-07-19T08:56:41Z

lib/spack/spack/cmd/check_binaries.py

+        outf.write(json.dumps(rebuilds))
+
+
+def check_single(args):


seems like most of this function can be replaced with a call to check_all, if check_all took a spec set and/or list of specs as a parameter instead of opening a specific file.

Right now it duplicates a fair amount of logic.

tgamblin · 2018-07-19T08:57:34Z

lib/spack/spack/cmd/check_binaries.py

+        tty.msg(msg)
+
+
+def check_binaries(parser, args):


Can this command be integrated with spack buildcache or with spack mirror? I think it makes more sense as a subcommand of one of those.

tgamblin · 2018-07-19T08:59:02Z

lib/spack/spack/cmd/release_jobs.py

+    share_path = os.path.join('.', 'share', 'spack')
+    common_scripts_dir = os.path.join(share_path, 'docker', 'build', 'common')
+
+    os_container_mapping = {


This probably belongs in a config file somewhere.

tgamblin · 2018-07-19T09:00:32Z

lib/spack/spack/cmd/release_jobs.py

+
+    release_spec_set = None
+
+    with open(release_specs_path, 'r') as fin:


this duplicates the logic in check_all -- why not a function like CombinatorialSpecSet.from_file(path)?

tgamblin · 2018-07-19T09:01:47Z

lib/spack/spack/cmd/release_jobs.py

+
+    with open(release_specs_path, 'r') as fin:
+        release_specs_contents = fin.read()
+        release_specs_yaml = yaml.load(release_specs_contents)


minor point: consider using spack_yaml instead of yaml. spack_yaml preserves file/line info from the file as attributes on the data structure it reads in (try typing spack config blame config if you are interested).

scottwittenburg · 2018-08-14T23:12:19Z

I think this thing is ready for another review pass @tgamblin, at your convenience. Pinging @scheibelp for visibility as well.

The jobs specified in the auto-generated .gitlab-ci.yaml are passing on my personal gitlab instance (i.e. sometimes detecting that a package exists on the mirror and the full_hash matches, sometimes detecting a need to rebuild and doing so). What they're currently missing is pushing the built packages back to the binary mirror. Once we have #8664 from @bryonbean working, we should be able to add that.

Before I try pushing this branch to the gitlab instance set up by @opadron, I will need to regenerate the .gitlab-ci.yaml using the appropriate mirror url referring to our S3 account. Do you know who I need to ask about that?

One thing I don't think I have seen yet is how dependencies will get pulled as binaries from the mirror if they exist. Is that something that should already exist if we just have a mirror configured? Or should I be thinking about how that will work?

ping @aashish24

opadron · 2018-08-17T19:14:04Z

@scottwittenburg just as an FYI: the kubernetes cluster should be easier than ever to use! (#9018)
Just keep in mind that we're still running on community edition, and soon (like Tuesday) we should start talking about how to move forward on getting Ultimate Edition up and running.

scottwittenburg · 2018-08-20T18:15:11Z

fyi @zackgalbreath

tgamblin · 2018-08-22T05:50:18Z

@scottwittenburg @opadron and @bryonbean: the S3 mirror should be set up in Kitware's AWS, maybe with Cloudflare but we can see if we really need that once the traffic starts coming in.

There is some info here on hosting static stuff like this via S3. It would be nice if the bucket had a spack-ish domain name... maybe spack-mirror.s3.amazonaws.com or something like that.

It is worth considering that as the bucket gets more distributed -- i.e. if we enable CDN and start having files propagated out all over -- I think the likelihood that binaries will appear there quickly goes down. So passing artifacts another way may be the better way to go.

tgamblin

@scottwittenburg: Thanks! This mostly looks good -- just a few comment inline. See above on the S3 mirror.

tgamblin · 2018-08-22T06:00:47Z

lib/spack/docs/testing_guide.rst

+either of these options to ``spack install``:
+
+  * ``--log-format=cdash-simple``
+  * ``--log-format=cdash-complete``


There's actually just --log-format=cdash (and --log-format=junit) now. Sorry this is dated. Do you want me to fix it or maybe do you or @zackgalbreath want to take a stab?

tgamblin · 2018-08-22T06:03:06Z

lib/spack/docs/testing_guide.rst

+``report.configure.xml``, and ``report.test.xml``, for the build,
+configure, and tests steps, respectively.
+
+If you want to upload these fils to a CDash instance, you can use ``curl``:


should probably document @zackgalbreath's --cdash-upload-url here, before the curl stuff. Also @zackgalbreath, does that support authenticated submissions? Should that be documented?

tgamblin · 2018-08-22T06:04:04Z

lib/spack/docs/testing_guide.rst

+.. _cmd-spack-test-suite:
+
+---------------------
+CDash test suites


This whole section can be ripped out and replaced with dos on @scottwittenburg's new infrastructure when it's ready.

tgamblin · 2018-08-22T06:32:54Z