feat(ci): handle gradle build in Dockerfile by themightychris · Pull Request #1120 · MobilityData/gtfs-validator

themightychris · 2022-04-15T02:22:21Z

Summary:

As a new contributor wanting to try building/running PR branches locally, I was surprised that the Dockerfile didn't actually build the application but rather relied on it being built first by a toolchain on the host machine. The intent of Dockerfile is usually to capture and pin the build dependencies as well as runtime environment.

Expected behavior:

docker build . always goes from branch source to ready-to-run container image

Proposal:

Ideally Dockerfile handles a full build from source. I also configured the default entrypoint to handle running the CLI, as the instructions indicated running the Docker container would give you a shell you could run the jar on but you'd need to override the entrypoint to sh to get that rather than the jShell the JDK base image comes with.

Testing:

docker build . -t gtfs-validator:latest
docker run --rm gtfs-validator:latest --help

Potential improvements:

~~Inserting RUN apk add npm after the first FROM line is needed on branches that include the HTML UI~~
Update/extend the Docker instructions in the README if these changes are acceptable in spirit
Use openjdk:11-slim or openjdk:19-alpine as the runtime base image
- Alpine would be ideal as it offers the smallest base image, but -alpine variants are only available starting with the latest major version 19, so we would need to verify that everything functions correctly under JDK 19
- This might best be done in a separate PR since it introduces a higher risk of runtime behavior change and this PR as-is is fairly isolated from causing any runtime behavior changes (see feat(ci): use slim base for runtime container #1185)

Open questions:

Is this an acceptable change?
Are there any other processes besides the GitHub workflow I updated in this PR that script what happens before docker build that would need updating too?

CLAassistant · 2022-04-15T02:22:27Z

All committers have signed the CLA.

Dockerfile

nackko · 2022-04-15T18:26:39Z

Nice ! I definitely didn't know much about Docker when originally implementing so thank you for the new knowledge :)

themightychris · 2022-04-25T21:56:00Z

@f8full @barbeau I updated the build to use shadowJar and got my CLA signed, should be good to merge now! Let me know if there's anything else I can address

The only other thing I can think of, is that it might be good form to update the Docker section of the README to show building the container locally rather than pulling it from GitHub. As a new developer who was looking to build and run an unmerged branch with Docker, I was surprised to find that section saying to pull the already-built image instead of showing how to build it from source (Update: just pushed a commit adding notes to the README)

themightychris · 2022-06-07T16:50:36Z

Rebased on top of 3.1.0

maximearmstrong

Thank you @themightychris! Only one thing to change in README.md.

More importantly, I get this error when I run the command docker build . --build-arg=VERSION_TAG=v3 -t gtfs-validator:latest. Did you experience something similar?

README.md

themightychris · 2022-06-13T16:15:56Z

@maximearmstrong I suspect the error you see is related to existing build/cache assets in your working directory from doing non-dockerized builds getting pulled into the Docker environment. I did this work so I could skip ever installing/running gradle on my host machine so I won't be able to test this scenario

Could you try adding build/ and any other generated artifacts you see in your existing local working tree to the .dockerignore file and see if that does the trick? We want to filter those all out from getting pulled into the Docker "build context"

maximearmstrong · 2022-06-16T15:31:23Z

@themightychris Indeed, that was the problem. Adding the following lines to .dockerignore solved the problem.

build/
config/

maximearmstrong

@themightychris I think we just need to fix the .dockerignore and the documentation comment here, then we are ready to merge :)

.dockerignore

themightychris · 2022-06-17T21:45:49Z

@maximearmstrong I've:

incorporated your .dockerignore additions
rebased and updated Dockerfile to accommodate the removal of versionTag
updated the README per your recommendation and some other thoughts I had

I think we should be good to go now!

themightychris · 2022-06-18T12:53:07Z

@maximearmstrong can you approve the workflow runs?

maximearmstrong

Thanks for the changes @themightychris! That one thing aside, I think we are ready to merge. Also, could you sign the CLA agreement?

maximearmstrong · 2022-06-27T22:56:39Z

Dockerfile

+COPY --from=build /build/main/build/libs/*.jar /
 WORKDIR /
+
+ENTRYPOINT [ "java", "-jar", "gtfs-validator-0.1.0-SNAPSHOT-cli.jar" ]


It looks like we don't compute the project version using this line when building via Docker, so the validator version is always 0.1.0-SNAPSHOT.

This makes the ENTRYPOINT stable and works well for local testing, but creates a problem for the ones using the actual docker image. This implementation doesn't allow users to know the real validator version, which is usually displayed in the reports, which I think is very valuable. Any thoughts on this @isabelle-dr?

If the shadow jar was created using the correct project version, we could use the file name in the ENTRYPOINT. @bdferris Any thoughts on this? Do you have an idea why the axion-release plugin would not work here?

I believe if you update the actions/checkout@v2 action in the docker.yml file to include tag info then the version number will be computed correctly. See https://github.com/MobilityData/gtfs-validator/blob/master/.github/workflows/package_installers.yml#L33 for an example.

Thanks, that's helpful. Do you know if there is a way to do this locally as well? Currently, building with Docker does not calculate the version. We wouldn't need to hardcode the jar file name if it did.

Assuming I understand your question correctly, local builds should typically work by default, because you are typically working from a full Git repo where all tag info is already present.

That's what I thought too. Currently, when I run this command

docker build . -t gtfs-validator:latest

I end up building a shadow jar named gtfs-validator-0.1.0-SNAPSHOT-cli.jar, which seems also to be the case on Chris' end since the entry point was hardcoded as such. Though, by running ./gradlew shadowJar --no-daemon locally, the jar file is built with the correct tag version.

Ideally, we would create an entry point using the tag detected locally and on GitHub. So I'm looking to find out why the axion-release plugin is not detected in the docker build locally. So far, I have had no luck with the documentation of the axion-release plugin on what might be the issue.

@maximearmstrong my original version of this PR replicated what the GitHub workflow for building docker did, passing in $VERSION_TAG and using that to generate a version-infused entrypoint.

I removed that in 6aa301a because the jar build process stopped respecting the $versionTag input and started building gtfs-validator-0.1.0-SNAPSHOT-cli.jar regardless of input.

I found that b93dc1b, which got merged after I started my PR, removed that functionality so I updated this branch accordingly. We can just revert 6aa301a if the preference is to restore $versionTag

If the desire is to make the versions be driven by Git, I recommend chaining builds off of GitHub's release event and using the release metadata to generate and label the build (rather than driving it off raw Git tags or branch pushes)

Apologies, I'm new to Docker, took me a second to figure out what's going on.

Why wasn't the Docker build resolving the version number correctly in local builds? Recall that the axion-release plugin uses Git tags in the cloned repo to resolve the current version. Since .git was included in the .dockerignore file, none of the Git repro metadata was being copied over with the Docker build context (aka the COPY --chown=gradle:gradle . /build directive). By removing .git from .dockerignore, I was able to build local Docker images that correctly resolved the release version number. @themightychris did you add .git to .dockerignore for a specific reason? I couldn't think of any reasons, but it doesn't mean they don't exist :)

The upshot of (1) is that gradle is now building a jar like gtfs-validator-3.1.1-SNAPSHOT-cli.jar. How to specify the entry point now that the version number is no longer fixed? I'd use something like the following:

COPY --from=build /build/main/build/libs/gtfs-validator-*-cli.jar /gtfs-validator-cli.jar ... ENTRYPOINT [ "java", "-jar", "gtfs-validator-cli.jar" ]

Here, we are copying the gtfs-validator-VERSION-cli.jar (@themightychris note how I didn't copy *.jar, since that picks up main-VERSION.jar as well) and rename it to a file without a version number. That way we can get a consistent reference in the ENTRYPOINT specifier.

I've verified the combo of (1) and (2) works for me on local Docker builds. I think it should also work for the docker.yml release builds, assuming we make the action/checkout tweak I mentioned.

I'll admit that I'm not entirely sure what would happen if you attempted docker build https://github.com/MobilityData/gtfs-validator.git#some_tag. Namely, I'm not sure if Docker will preserve enough git metadata to let version resolution work. Worse-case scenario, it'll fail to resolve the version number, it will default back to 0.1.0-SNAPSHOT, but the ENTRYPOINT should still work correctly.

Thanks a lot @bdferris-v2! I tested it locally and it works for me too. The correct version of the validator is displayed in the report. @themightychris If you can do these changes, we will be ready to merge.

@bdferris-v2 It's a bit of a bad practice to pull .git/ into the Docker build context—it can lead to extra or even sensitive content accidentally getting built into the container, and it's a criss-crossing of concerns to have your Dockerfile build process using a git client and knowledge of your release process to figure things out about the release itself.

A preferred approach is to let the system invoking the build determine the designated version and pass that in via a --build-arg. We can see that workflow happening previously in 6aa301a when gradle shadowJar could accept versionTag as an environment variable.

If I understand correctly, a plugin being used within the gradle workflow is handling extracting the current version from a local .git/ repo? If that's the case, it would be ideal if that could be made to be overridden environmentally, such that a CI system can pass one in and deactivate the build process trying to inspect .git/—Would that be difficult? That's a less quick fix, but this sort of separation of build and release concerns is important for anyone who needs to run their own builds and keep their forks minimal. And it's fine if the gradle build layer has some smarts when its run directly as long as they can be overridden in CI via environment

High-level: I hear what you are saying. That said, I do wonder if we'll just be introducing complexity elsewhere for the sake of purity in the Docker file. But more on that later.

For context, the versioning changes were relatively recent in b93dc1b to better support X.Y.Z version numbers, as used in app packaging, version checks, etc. This moved us away from the previous scheme of extracting version info from $GITHUB_REF in GitHub workflows. One nice by-product of this change is that version info was now correctly computed for local builds as well.

So how would we make this work in a Docker build without access to .git?

First, it is possible to override the version from the Gradle command-line. Per the axion-release plugin docs, you can force the version on the command-line in the following way:

./gradlew -Prelease.forceVersion=3.0.0

Second, it's possible to output the current version as deduced from .git using the Gradle currentVersion command as well:

./gradlew cV -q -Prelease.quiet

So put that all together:

In the docker workflow, we'd compute the current version as described above.

We'd pass that as a --build-arg to the Docker build context.

We'd pass that further into the actual gradle build to override the version.

Kind of roundabout, but I think it technically works? My only question is if it's possible to gracefully fallback for local docker build such that the user doesn't have to remember to manually pass in the version argument.

themightychris · 2022-06-30T17:09:00Z

@maximearmstrong I definitely went through the CLA signature already when I first opened the PR...just did it again and the badge seems to be updated now 👍

bdferris-v2 · 2022-07-08T19:52:14Z

.github/workflows/docker.yml

          if [ "${{ github.event_name }}" = "push" ]; then
            TAGS="${DOCKER_IMAGE}:latest"
          fi
+          echo ::set-output name=gradleVersionTag::${VERSION}


The release version in Gradle is now expected to be the form X.Y.Z (with an optional -SNAPSHOT suffix). I think it would be best if you got rid of version resolution via $GITHUB_REF and use the ./gradle currentVersion mechanism I described in my previous comment.

I've seen approaches like this run into problems in the past: developers are free to push tags without any restrictions GitHub can implement, and it's nondeterministic which of multiple tags for the current HEAD such client-side tooling will select. When it comes to GitHub actions being triggered by the release events though, the maintainer creating the GitHub release selected a specific tag to use/create and GitHub is providing that to the workflow as a singular input via GITHUB_REF. The real-world failure case here is developers pushing pre-release tags for testing, and then the GitHub release flow arbitrarily picking one of those to tag a release with rather than what the maintainer creating the official release tag selected.

Besides, this was already how the current workflow was sourcing the Docker image tag to apply and these changes are just also routing that same designation into the container build. So now they're both chained off the same input and it would be a broader scope of change to source that version designation differently. @bdferris-v2 I'd be happy to take that change on within this PR though if that's still preferred given the gotcha I described above.

The one substantial change I DID make here was dropping the v prefix when reading the Git tag. While it is a git convention to prefix version number tags with v, the convention with Docker image tags is to drop it. The -Prelease.forceVersion= option was also expecting it without the v prefix, so I did it upstream here to drive both the Docker tag and the Gradle version. So as of right now if this got merged, future docker container image publishings would drop thev—which seems like a good move to me this early in publishing one to bring in line with convention.

I wouldn't describe it as non-deterministic. The axios-release-plugin has a defined strategy for resolving the current version from the Git commit history. It's smart enough to ignore arbitrary tags like i-love-tacos but it is true that it will complain if you create a version tag like v1.tacos.

So I'm agreed that this requires us to use consistent vX.Y.Z version tags when tagging releases, but that doesn't seem too terrible. More importantly, this wasn't introduced arbitrarily. The jpackage app packaging phase specifically fails if passed a non-X.Y.Z version number, which is what led to this upstream change in the first place. We are going to need some consistency in our version tags one way or another. If there are other approaches that play well with the free-wheeling nature of Git versioning, I'm interested in learning more about them.

But if we stick with what we've got, then I'd argue that you should drop the code to resolve the version from GITHUB_REF and instead use the currentVersion approach I discussed earlier so that we get consistent version numbers whether we are building from a release tag, a PR, etc. I'll admit it's a hack (thus my inclination to just include the .git metadata in the Docker build itself, but I digress).

Unfortunately, while playing around with forceVersion behavior, I discovered that if you pass in 1.2.3-SNAPSHOT as the forced version, the plugin will happily append an additional -SNAPSHOT to make the version 1.2.3-SNAPSHOT-SNAPSHOT. So there's probably an argument for making the version resolution command:

$(./gradlew cV -q -Prelease.quiet | sed 's/-SNAPSHOT//')

Dockerfile

Co-authored-by: Maxime Armstrong <[email protected]>

…ript

themightychris · 2022-07-13T22:23:54Z

@bdferris-v2 regarding #1120 (comment)

Someone testing release processes will often push semver-formatted prerelease tags like vX.Y.Z-alpha.1. Sounds like axios-release-plugin tries to pick the highest version and commits to semver rules, so someone properly picking pre-release versions shouldn't cause any trouble...

If there are other approaches that play well with the free-wheeling nature of Git versioning, I'm interested in learning more about them.

IME what works best is treating GitHub Releases as authoritative and the driver of published releases. That leaves behind a tag but the release drives publishing, not the tag. Axion's job seems to be to assign version numbers, but is the person creating releases in GitHub using that or handling picking version numbers manually in parallel anyway?

I'm going ahead and implementing your suggestion though. In doing so I've had trouble reconciling all the use cases:

The GitHub workflow has code to handle generating version strings for PRs and arbitrary branches, but the workflow is only hooked up to releases and pushes to master—so that code is unreachable and therefor any changes wouldn't be testable
You mention stripping -SNAPSHOT, but this workflow is set to run either on releases or any push to master, so every push to master between releases would generate a -SNAPSHOT release and by stripping it we'd be publishing a falsely-labeled next minor release to ghcr.io on each push to master (if I'm thinking this through correctly)
The general convention with Docker images is to push trunk builds to :latest

So I took a pass at trying to reconcile this all and narrowed the code to specifically deal with the two cases we have GitHub actions triggers for right now:

Releases publish container builds with a :X.Y.Z version tag generated by gradlew cV (see test run and published image)
Pushes to master publish container builds with a :latest tag (see test run and published image)
All published containers have the gradlew cV-generated version passed into their build via forceVersion and applied to the container labels (which may include X.Y.{Z+1}-SNAPSHOT versions)

Take another look and see if this looks good to merge to you now

./gradlew currentVersion needs full history in order to search parent commits for closest tag

themightychris · 2022-07-14T12:58:41Z

The ./gradlew currentVersion functionality of digging up past tags in order to generate a X.Y.{Z+1}-SNAPSHOT version for non-tagged HEADs wasn't working correctly in my last push because the GitHub actions/checkout@v2 action was only shallow cloning by default. I fixed that in d2551c2

I also:

did some misc cleanup of the action in d2551c2...f6a6a89
found where suppressing an extra -SNAPSHOT needed to happen in d008ce8
folded in feat(ci): use slim base for runtime container #1185 in 4685a37

maximearmstrong

LGTM! Thank you @themightychris. @bdferris-v2, if you are okay with the last changes, I think we are ready to merge.

.github/workflows/docker.yml

Dockerfile

bdferris-v2 · 2022-07-14T18:05:30Z

LGTM. I only had a comment nit + a question, but I don't think they are overall blocking.

barbeau reviewed Apr 15, 2022

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

themightychris marked this pull request as ready for review April 25, 2022 22:03

themightychris force-pushed the fixes/complete-docker-build branch from 550591a to 35cf956 Compare April 25, 2022 22:05

themightychris requested a review from barbeau April 25, 2022 22:05

isabelle-dr requested a review from maximearmstrong April 26, 2022 18:36

This was referenced Jun 7, 2022

Bug: Running using documented Docker documentation fails #1111

Open

Proposal: enable direct deployment of validator as a web service #1184

Closed

themightychris force-pushed the fixes/complete-docker-build branch from 35cf956 to dd7dbb4 Compare June 7, 2022 16:50

maximearmstrong suggested changes Jun 7, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

maximearmstrong reviewed Jun 16, 2022

View reviewed changes

.dockerignore Show resolved Hide resolved

themightychris force-pushed the fixes/complete-docker-build branch from ef039b2 to 84fe4b2 Compare June 18, 2022 12:33

themightychris requested a review from maximearmstrong June 18, 2022 12:52

maximearmstrong reviewed Jun 27, 2022

View reviewed changes

themightychris force-pushed the fixes/complete-docker-build branch from 84fe4b2 to b6508e0 Compare July 8, 2022 18:01

bdferris-v2 reviewed Jul 8, 2022

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

bdferris-v2 reviewed Jul 8, 2022

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

bdferris-v2 reviewed Jul 8, 2022

View reviewed changes

Dockerfile Show resolved Hide resolved

themightychris force-pushed the fixes/complete-docker-build branch from 7927d80 to 038a5f3 Compare July 11, 2022 22:08

themightychris requested review from bdferris-v2 and maximearmstrong July 11, 2022 22:25

themightychris and others added 9 commits July 13, 2022 18:16

fix(ci): add build artifacts to .dockerignore

f1434f0

Co-authored-by: Maxime Armstrong <[email protected]>

fix(ci): remove use of versionTag per changes in b93dc1b

a01f444

docs: update README Docker instructions

9221d0c

feat(ci): drive release build version from git

296133d

fix(ci): copy only CLI jar into Docker image and eliminate wrapper sc…

57fe4cc

…ript

fix(ci): only output gradleVersionTag when build is triggered by a tag

20c1534

fix(ci): force Docker repository name to lowercase

9f269e2

feat(ci): drive versions via axion-release-plugin

e2e3ef9

fix(ci): use dynamic repository path for Docker labels

f2227a0

themightychris force-pushed the fixes/complete-docker-build branch from e9fa071 to f2227a0 Compare July 13, 2022 22:16

themightychris added 7 commits July 14, 2022 08:53

chore(ci): add debug output

505ff56

fix(ci): fetch full git history for generating versions

d2551c2

./gradlew currentVersion needs full history in order to search parent commits for closest tag

fix(ci): move ./gradlew call after setup+cache steps

43b079d

chore(ci): update checkout action

d6f2155

chore(ci): delete outdated comment

475b23a

chore(ci): bump docker/build-push-action to v3

85e5b5e

chore(ci): bump docker/setup-buildx-action to v2

f6a6a89

themightychris added 2 commits July 14, 2022 09:14

fix(ci): prevent duplicate -SNAPSHOT

d008ce8

feat(ci): use slim base for runtime container

4685a37

themightychris mentioned this pull request Jul 14, 2022

feat(ci): use slim base for runtime container #1185

Closed

maximearmstrong approved these changes Jul 14, 2022

View reviewed changes

bdferris-v2 reviewed Jul 14, 2022

View reviewed changes

.github/workflows/docker.yml Outdated Show resolved Hide resolved

bdferris-v2 reviewed Jul 14, 2022

View reviewed changes

Dockerfile Show resolved Hide resolved

chore(ci): fix incomplete comment

09df11b

bdferris-v2 approved these changes Jul 14, 2022

View reviewed changes

maximearmstrong merged commit fbae61d into MobilityData:master Jul 14, 2022

themightychris deleted the fixes/complete-docker-build branch July 14, 2022 21:32

Conversation

themightychris commented Apr 15, 2022 • edited by maximearmstrong Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Apr 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

nackko commented Apr 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

themightychris commented Apr 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

themightychris commented Jun 7, 2022

Uh oh!

maximearmstrong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

themightychris commented Jun 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maximearmstrong commented Jun 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maximearmstrong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

themightychris commented Jun 17, 2022

Uh oh!

themightychris commented Jun 18, 2022

Uh oh!

maximearmstrong left a comment

Choose a reason for hiding this comment

Uh oh!

maximearmstrong Jun 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

themightychris Jun 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

themightychris commented Jun 30, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

themightychris Jul 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

themightychris commented Jul 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

themightychris commented Jul 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

themightychris commented Apr 15, 2022 •

edited by maximearmstrong

Loading

CLAassistant commented Apr 15, 2022 •

edited

Loading

nackko commented Apr 15, 2022 •

edited

Loading

themightychris commented Apr 25, 2022 •

edited

Loading

themightychris commented Jun 13, 2022 •

edited

Loading

maximearmstrong commented Jun 16, 2022 •

edited

Loading

maximearmstrong Jun 27, 2022 •

edited

Loading

themightychris Jun 30, 2022 •

edited

Loading

themightychris Jul 11, 2022 •

edited

Loading

themightychris commented Jul 13, 2022 •

edited

Loading

themightychris commented Jul 14, 2022 •

edited

Loading