Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: make IMAGE_TAG available in buildArgs when used in docker FROM #9664

Merged
merged 46 commits into from
Jan 23, 2025

Conversation

alphanota
Copy link
Contributor

@alphanota alphanota commented Jan 14, 2025

Note: This PR has the cherry-picked changes from @abe-winter 's original PR #9450.

Before parsing the Dockerfile to extract dependencies, Skaffold generates a list of build arguments for the Dockerfile. These arguments did not previously include [IMAGE_REPO,IMAGE_NAME,IMAGE_TAG] which the document states should be available. This PR makes changes so that these build args are available when extracting dependencies from the Dockerfile.

Overview:

The build args are evaluated by the SourceDependenciesCache.SingleArtifactDependencies interface method. This method is in turn called by CheckArtifacts in the skaffold diagnose command and within the depLister function

The depLister is a function used to get dependencies of a docker image. It implements the DependencyLister func interface.
Its overwritten by a stub implementation in some unit tests.

Changes:

The IMAGE_REPO,IMAGE_NAME_IMAGE_TAG variables are derived from the fully-qualified docker tag for the image.
Previously, the dependencies for the dockerfile were fetched before the tag was generated. The logic used to generate the image tag was moved out of its pkg/skaffold/runner/build.go within the Builder struct and turned into a callable utility method. This is now located in the util package pkg/skaffold/util/util.go.

The depLister function signature was changed to also accept a tag argument to propagate it to the SingleArtifactDependencies struct. The methods that implement depLister were changes to match the new signature. For ex. files in pkg/skaffold/build/cache, and pkg/skaffold/tag.

Diagnose: The diagnose implementation now generate the tags for the images to be built and passes the appropriate tag to SingleArtifactDependencies when its called.

Unit Tests:

Added unit test in dependencies_test.go to directly the new functionality in SingleArtifactDependencies.

Other tests don't test SingleArtifactDepencies directly but use a fake implementation of the depLister function. So the only thing I did here was to change the function signature of the stub to add the tag parameter.

Integration Tests:

In integration/build_test.go:

  • Added integration test to check that the [IMAGE_REPO,IMAGE_NAME,IMAGE_TAG] build args are properly evaluated and passed to the docker image.

  • Added second integration test to ensure that IMAGE_TAG can be used as part of a file to be copied from the host to the docker image.

Miscellaneous Bug Fixes:

Original PR Description

Related:

Description
Apologies if I am simply misunderstanding something and this can be fixed w/out a code change. I am certainly not fully wrapping my head around this codebase.

This PR adds the ability to use IMAGE_TAG (IMAGE_REPO etc) in the docker buildArgs section, specifically when the build arg is then consumed in the Dockerfile FROM. On current stable and main, this case fails in the depList / SingleArtifactDependencies phase, with logs like:

Checking cache...

  • base: Found Remotely
  • child: Error checking cache.

getting hash for artifact "child": getting dependencies for "child": parsing ONBUILD instructions: retrieving image "gcr.io/headsdown/base:": parsing reference "gcr.io/headsdown/base:": could not parse reference: gcr.io/headsdown/base:

Note that when used with dependencies (as below), this seems to only work reliably if I run skaffold build && skaffold run. skaffold run by itself will sometimes fail with 'image not found remotely'.

Repro

Stripped-down skaffold.yaml:

build:
  local:
    useBuildkit: true
  artifacts:
    - image: base
      docker:
        dockerfile: Dockerfile
    - image: child
      requires:
      - image: base
      docker:
        dockerfile: child/Dockerfile
        buildArgs:
          IMAGE_TAG: '{{.IMAGE_TAG}}'

Top of child/Dockerfile:

# syntax=docker/dockerfile:1
ARG IMAGE_TAG
FROM gcr.io/PROJECTNAME/base:$IMAGE_TAG

tldr the second artifact depends on the first and then uses it as a base image

User facing changes
I think the docs already imply this is possible

Follow-up Work
No idea if I'm conforming to house style -- if accepted, guessing someone on the team will need to clean up or at least assign me cleanup. I would normally have submitted a bug rather than a PR for something like this, but I didn't understand why my template param was missing until I'd already solved the problem.

If this is something useful, happy to add tests etc.

@alphanota : We might want to add integration tests to check that the build args are resolved correctly.

@alphanota alphanota requested a review from a team as a code owner January 14, 2025 21:24
@alphanota alphanota requested a review from plumpy January 14, 2025 21:24
Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi team,

I'm currently reviewing this pull request and will have a full review in a few minutes. In the meantime, here's a summary to help you and other reviewers quickly get up to speed:

This pull request addresses an issue where the IMAGE_TAG variable wasn't correctly available in buildArgs when used within a Docker FROM instruction. The problem manifested as errors during the dependency resolution phase of the build process, resulting in messages like could not parse reference: gcr.io/headsdown/base:<no value>.

Intent: The primary goal is to enable the use of IMAGE_TAG (and related variables like IMAGE_REPO) within buildArgs when those args are used to define the base image in a Dockerfile. This allows for more dynamic and flexible image building.

Changes:

The changes involve modifying several functions related to dependency listing and cache handling within the Skaffold build process. Specifically:

  • The DependencyLister function signature in pkg/skaffold/build/cache/cache.go was updated to include a tags map, allowing for the propagation of image tags.
  • The hash function in pkg/skaffold/build/cache/hash.go was modified to incorporate this tags map, ensuring that the cache correctly accounts for the image tag values.
  • Several other functions related to cache lookup and dependency resolution were updated to handle the tags map consistently, including pkg/skaffold/build/cache/lookup.go, pkg/skaffold/build/gcb/cloud_build.go, pkg/skaffold/diagnose/diagnose.go, and pkg/skaffold/runner/new.go.
  • A new helper function quickMakeEnvTags was added to pkg/skaffold/graph/dependencies.go to parse and create the tags map from an image reference.
  • The SingleArtifactDependencies function in pkg/skaffold/graph/dependencies.go was modified to accept and use the tags map.
  • The EvalBuildArgs function in pkg/skaffold/graph/dependencies.go was updated to EvalBuildArgsWithEnv, which takes the environment tags into account.

Activity: The pull request author, alphanota, provided a detailed description of the problem, including a reproduction scenario and a proposed solution. They also acknowledged potential style issues and expressed willingness to make further improvements.

I'll provide a more detailed review shortly, but this summary should give you a good overview of the changes and their purpose.

And now, a little haiku to brighten your day:

Tags flow, code compiles,
Building images, smooth and fast,
Skaffold's power shines.

Copy link

@ghost ghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR addresses a critical issue where IMAGE_TAG (and related variables) were not available in buildArgs when used within the FROM instruction of a Dockerfile. The solution correctly propagates the tag information to the dependency resolution stage. However, the introduction of quickMakeEnvTags introduces a potential point of failure and adds complexity. I recommend exploring alternative approaches that leverage existing Skaffold mechanisms for resolving tags.

@alphanota alphanota closed this Jan 14, 2025
@alphanota alphanota deleted the abe-winter-changes branch January 14, 2025 21:28
@alphanota alphanota restored the abe-winter-changes branch January 14, 2025 21:29
@alphanota alphanota reopened this Jan 14, 2025
@pull-request-size pull-request-size bot added size/L and removed size/M labels Jan 16, 2025
Copy link
Collaborator

@plumpy plumpy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ideally I'd love to see an integration test that uses these new args. But at the very least, there should be some unit tests.

@alphanota alphanota requested a review from plumpy January 23, 2025 00:08
@plumpy plumpy merged commit 93e325b into GoogleContainerTools:main Jan 23, 2025
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants