Fix dockerfile parser with empty line after escape #24725

yongtang · 2016-07-16T19:45:35Z

- What I did

This fix tries to fix the bug reported by #24693 where an empty line after escape will not be stopped by the parser.

- How I did it

This fix addresses this issue by stop the parser from continue with an empty line after escape.

- How to verify it

An additional integration test has been added.

- Description for the changelog

- A picture of a cute animal (not mandatory but encouraged)

This fix fixes #24693.

Signed-off-by: Yong Tang [email protected]

thaJeztah · 2016-07-16T19:48:51Z

~~LGTM~~

actually, looks like tests are failing, need to look more into this

ping @duglin

thaJeztah · 2016-07-16T20:22:10Z

Looks like tests are failing, think there may be more to look into

duglin · 2016-07-16T20:29:00Z

Don't have time right now to full review but I do recall that we had some really funky rules w.r.t. auto-continuing lines when they're blank. Wouldn't surprise me if the failures are related to that. And the last time we thought about "fixing" this we didn't because we didn't want to break existing Dockerfiles that might count on this odd behavior.

thaJeztah · 2016-07-16T20:32:17Z

Yes, I was wondering, perhaps we strip the continuation character before we arrive here, in case this may see \ as an empty line (but I should look as well)

yongtang · 2016-07-16T21:23:27Z

Thanks @duglin @thaJeztah. The failed test has the following file:

RUN apt-get \update && \
  apt-get \"install znc -y
ADD \conf\\" /.znc

RUN foo \

bar \

baz

CMD [ "\/usr\\\"/bin/znc", "-f", "-r" ]

and expect the result to be:

RUN foo bar baz

In other words, the previous behavior of docker considers empty lines (and comments) as continuation. And you could place many empty lines (or comment) in between:

FROM busybox
RUN echo foo \




RUN bar


CMD [ "\/usr\\\"/bin/znc", "-f", "-r" ]

The result of the above would be one command RUN:

RUN echo foo RUN bar
CMD [ "\/usr\\\"/bin/znc", "-f", "-r" ]

I think it makes sense to at least not allow empty lines to continue as it is quite misleading. People may place an escape by mistake, then it will continue forever, no matter how many empty lines have passed.

We could keep the comment for continuation to not break many existing behaviors, i.e.,

FROM busybox
RUN  \
# my comment
# some other comment 
# and some script, etc
echo bar

will be

RUN echo bar

and the old behavior is maintained in this case.

I updated pull request for the test cases to allow the test pass. But let me know the best way to handle them in these situations, or we could leave the old behavior alone if needed.

thaJeztah · 2016-07-16T21:28:24Z

Sounds good, but we may want to look at the history of that test; if it was testing a specific case, or just confirming "broken" behavior

cyphar · 2016-07-17T00:26:29Z

builder/dockerfile/parser/testfiles/continueIndent/Dockerfile

This comment makes me believe this was intentional functionality.

yes this was intentionally weird and while I'm 100% in favor of "fixing" this, it needs to be discussed by the maintainers because this was purposely kept weird all this time.

Let's remove, or edit, this comment to be more accurate

cyphar · 2016-07-17T00:29:47Z

I think that the bug is not with our treatment of multi-line commands, but with comments:

RUN cd /go/src/github.com/grafana && \
    git clone https://github.com/raintank/grafana-api-golang-client.git && \
    cd grafana-api-golang-client && \
    git checkout raintank #&& \
    #go get github.com/raintank/env-load

RUN go get github.com/tsenart/vegeta
RUN go get github.com/raintank/inspect/inspect-es

Currently the #go get github.com/raintank/env-load line seems to be stripped but the trailing #&& \ is not on the previous line. We should fix how we strip comments, not remove a (questionable) feature of the parser as a hotfix. For example, will the changed code fix this case (from reading it, I don't think it will):

RUN cd /go/src/github.com/grafana && \
    git clone https://github.com/raintank/grafana-api-golang-client.git && \
    cd grafana-api-golang-client && \
    git checkout raintank #&& \
    #go get github.com/raintank/env-load
RUN go get github.com/tsenart/vegeta

thaJeztah · 2016-07-17T00:36:54Z

@cyphar yes, it surprised be that the #go get line is stripped, because the Dockerfile syntax says that comments must begin at position 1, and cannot have spaces before them.

If we stick to that then, in this case, the #go get line would be treated as part of the RUN, and just concatenation with the previous line.

I'm not sure why we ever wanted to support the blank lines though, or if that was a bug in the past, that we decided to keep supporting

cyphar · 2016-07-17T00:42:22Z

TBH, I do actually prefer more lax commenting. It's weird to me (coming from shell scripts) that #&& \ isn't stripped. But definitely there's an inconsistency here that we should deal with.

thaJeztah · 2016-07-17T00:42:46Z

Looks like the blank line behavior was introduced here; #8580

thaJeztah · 2016-07-17T00:48:35Z

@cyphar it's because everything inside the RUN should be passed as-is to the shell inside the comtainer, basically no processing should take place, especially since we now have a SHELL directive, and # may not even be a comment in the shell that is selected

yongtang · 2016-07-17T05:29:03Z

Thanks @cyphar @thaJeztah. Not knowing enough history of docker, I can only guess the reason comment line is continued might be related to the ambiguity when # and \ exist at the same line.

For example, in case of

\ # Should the comment be removed first so that `\` is line escape,
\ # or should we consider this is the escape of space ` ` followed by `#` and comment?

and

# Should the end of line `\` be considered as part of the comment , \
# or the end of line `\` be considered as an escape and not part of the comment ? \

The outputs of the above likely depends on if we parse from the left, or parse from the right.

So continuation with comment lines avoids the issue mentioned above.

Again, I don't know enough history of docker so I could be totally wrong here.

On the other hand, I feel like the continuation on empty line is quite misleading. The issue is that, if the user adds an escape \ at one line, then the user will be forced to adds something other than comment lines or empty lines to stop the escape. And this could easily getting into the next RUN XXX if not careful.

cyphar · 2016-07-17T05:33:46Z

I personally don't like the fact that we are acting differently to shells with \ continuation. But to be clear, my point was that I don't think this patch fixes the underlying problem (if you remove the newline between the two RUNs I think it will still act incorrectly). We need to fix how comments are dealt with -- and fixing up line continuation is a separate thing that we should probably also do.

Also, please note that we don't actually support comments at the end of lines. We only support comments if the # is the first character in the line. Is it a bit of a weird restriction? Sure, but then again we aren't sure what the shell is during build time.

cpuguy83 · 2016-09-08T13:41:01Z

What are we doing with this?

duglin · 2016-09-08T14:06:49Z

my vote is to "fix" this but since it has been this way for ages (intentionally) we'd need agreement to break backwards compatibility.

duglin · 2016-09-08T14:12:06Z

to be clear, I'd like to see two fixes:

make blank lines end a continuation sequence
allow for blanks in front of a comment (#) - remove requirement that "#" be only in the first column

While I think supporting comments at the end of lines would be good, that's a bigger change that would be trickier since we don't have a formal "end of statement" in Dockerfiles so knowing when "#" isn't part of the RUN command itself could be tricky. So for now I think doing the two things above would be a good first step.

duglin · 2016-10-11T17:48:57Z

@thaJeztah that's fine - its just that this PR migrated a bit into fixing some of the oddities in our parser so I tossed in one that was related. Either way is fine with me.

thaJeztah · 2016-10-11T18:33:07Z

@duglin thanks - I wasn't sure if everyone would be on board with changing that, and potentially block this PR (which is already quite old) from being merged.

@yongtang could you keep the change for comments out of this PR, that way I hope we'd be able to get this moving forward quicker. I'm open to doing it in a separate PR so that it can be discussed separately

yongtang · 2016-10-11T22:55:22Z

Thanks @duglin @thaJeztah. The PR has been updated so that now only ^# is considered as the comment. Please take a look and let me know if there are any issues.

mlaventure · 2016-10-21T20:44:20Z

The current version seems to correctly break the continuation sequence.

Provided that we're still ok to have a subsequent PR to fix the rest, LGTM

ping @duglin @thaJeztah

This fix tries to fix the bug reported by moby#24693 where an empty line after escape will not be stopped by the parser. This fix addresses this issue by stop the parser from continue with an empty line after escape. An additional integration test has been added. This fix fixes moby#24693. Signed-off-by: Yong Tang <[email protected]>

yongtang · 2016-10-27T20:29:06Z

The PR has been rebased to fix the merge conflict. All tests passed. Ping @duglin @thaJeztah

duglin · 2016-10-28T15:26:05Z

LGTM

thaJeztah

LGTM

tianon · 2016-11-30T23:23:45Z

This is a pretty huge breaking change for Dockerfile parsing -- are we sure we're committed to making this change?

The failure mode is pretty awful too; here's an example error from trying to build one of the php official image variants:

$ docker version
Client:
 Version:      1.13.0-rc2
 API version:  1.25
 Go version:   go1.7.3
 Git commit:   1f9b3ef
 Built:        Wed Nov 23 17:40:58 2016
 OS/Arch:      linux/amd64

Server:
 Version:             1.13.0-rc2
 API version:         1.25
 Minimum API version: 1.12
 Go version:          go1.7.3
 Git commit:          1f9b3ef
 Built:               Wed Nov 23 17:40:58 2016
 OS/Arch:             linux/amd64
 Experimental:        false
$ docker build 7.0/alpine
Sending build context to Docker daemon 13.82 kB
Error response from daemon: Unknown instruction: --ENABLE-FTP

tianon · 2016-11-30T23:24:26Z

For reference, here's the exact line that's failing on: https://github.com/docker-library/php/blob/7d058f79e6fdf7b8e97d5004d1098c436d348a47/7.0/alpine/Dockerfile#L97

tianon · 2016-11-30T23:54:40Z

builder/dockerfile/parser/testfiles/continueIndent/Dockerfile

-
-this is some more useful stuff
+# this is a comment that breaks escape continuation
+RUN echo this is some more useful stuff


This is the offending test change -- a comment by itself should still be valid in the middle of a continuation (very, very common, useful pattern, supported by Docker for ages now).

yongtang · 2016-12-01T00:10:48Z

@tianon @duglin
Before this PR, the behavior of parser was (See #24725 (comment)):

Space is continued as long as it begins with \:

FROM busybox
RUN echo foo \




RUN bar

CMD [ "\/usr\\\"/bin/znc", "-f", "-r" ]

will result in (many empty lines will continue):

RUN echo foo RUN bar
CMD [ "\/usr\\\"/bin/znc", "-f", "-r" ]

Comment is continued as long as it begins with \:

FROM busybox
RUN  \
# my comment
# some other comment 
# and some script, etc
echo bar

will result in (may comment lines will continue):

RUN echo bar

This PR changed both based on discussions.

If there are concerns with "no-continuation-for-comments", maybe we could revert back?

We probably could keep the other changes "no-continuation-for-empty-lines" alone as it is confusing. The reason is that, in case an user have placed a \ at the end of one line, then this \ will continue even if many empty lines has been added.

Alternatively, I am wondering if it makes sense to add a VERSION for Dockerfile? This should allows for backward-compatibility in the future.

thaJeztah · 2016-12-01T00:17:56Z

@yongtang was discussing with Tianon; problem is here; https://github.com/docker/docker/pull/24725/files#diff-a70e690b6dc1bcdbae36a5f54c868bbeR183

It breaks on a comment line, but should probably detect if it's a comment-only line, and if so, continue

Due to the previous behavior, inside RUN, that means also if there's whitespace before the #

yongtang · 2016-12-01T00:36:00Z

Thanks @thaJeztah. Let me take a look and see if I could provide a quick fix for it.

This fix is to address moby#29005 where continuation after comments are not allowed after moby#24725. This fix made the change so that moby#29005 could be addressed. An integration test has been added. Other related tests have been udpated to conform to the behavior changes. This fix fixes moby#29005, but please feel free for any additional suggestions. Signed-off-by: Yong Tang <[email protected]>

kinghuang · 2016-12-01T18:06:36Z

The behaviour in 1.13-RC2 has broken all of my Dockerfiles as I depend on both spaces and comments being allowed after continuations in the RUN and ENV instructions. As a user, this is a huge and unwelcome breaking change for 1.13 if it stays.

thaJeztah · 2016-12-01T18:35:15Z

@kinghuang we are aware of this change, and will address before 1.13 GA. The discussion is taking place here; #29005, and is currently focussed on the "blank lines", perhaps you can add a minimal example of this from your Dockerfile.

I'll lock the discussion here, so that the discussion does not take place on multiple places, but you're welcome to join in the discussion on #29005

GordonTheTurtle added area/distribution Image Distribution status/0-triage labels Jul 16, 2016

yongtang mentioned this pull request Jul 16, 2016

bug: comment on preceding line in Dockerfile causes subsequent run comments to be ignored #24693

Open

thaJeztah added status/2-code-review status/failing-ci Indicates that the PR in its current state fails the test suite and removed status/0-triage labels Jul 16, 2016

cyphar reviewed Jul 17, 2016
View reviewed changes

yongtang force-pushed the 24693-dockerfile-empty-line-after-escape branch 4 times, most recently from 5789bfa to d442830 Compare September 14, 2016 05:35

duglin mentioned this pull request Sep 14, 2016

Allow comments in && \ # ... moby/buildkit#4240

Closed

yongtang force-pushed the 24693-dockerfile-empty-line-after-escape branch 2 times, most recently from 543d1e1 to 759f845 Compare September 15, 2016 01:08

yongtang force-pushed the 24693-dockerfile-empty-line-after-escape branch from 4512ca7 to 01421e9 Compare October 11, 2016 21:11

yongtang force-pushed the 24693-dockerfile-empty-line-after-escape branch from 01421e9 to 3e1b539 Compare October 27, 2016 17:48

thaJeztah approved these changes Oct 28, 2016

View reviewed changes

thaJeztah added the impact/dockerfile label Oct 28, 2016

thaJeztah added this to the 1.13.0 milestone Oct 28, 2016

thaJeztah merged commit c5adc27 into moby:master Oct 28, 2016

thaJeztah added the impact/changelog label Oct 28, 2016

yongtang deleted the 24693-dockerfile-empty-line-after-escape branch October 28, 2016 20:56

duglin mentioned this pull request Nov 18, 2016

[1.13] docker build failed if there is a comment line in the multi-line RUN statements #28587

Closed

tianon reviewed Nov 30, 2016

View reviewed changes

yosifkit mentioned this pull request Dec 1, 2016

Parser in docker 1.13 breaks backwards compatibility for comments in middle of RUN statement #29005

Closed

yongtang mentioned this pull request Dec 1, 2016

Allow continuation after comments #29006

Closed

moby locked and limited conversation to collaborators Dec 1, 2016

Fix dockerfile parser with empty line after escape #24725

Fix dockerfile parser with empty line after escape #24725

Uh oh!

Conversation

yongtang commented Jul 16, 2016

Uh oh!

thaJeztah commented Jul 16, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaJeztah commented Jul 16, 2016

Uh oh!

duglin commented Jul 16, 2016

Uh oh!

thaJeztah commented Jul 16, 2016

Uh oh!

yongtang commented Jul 16, 2016

Uh oh!

thaJeztah commented Jul 16, 2016

Uh oh!

cyphar Jul 17, 2016

Choose a reason for hiding this comment

Uh oh!

duglin Sep 8, 2016

Choose a reason for hiding this comment

Uh oh!

duglin Sep 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cyphar commented Jul 17, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaJeztah commented Jul 17, 2016

Uh oh!

cyphar commented Jul 17, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaJeztah commented Jul 17, 2016

Uh oh!

thaJeztah commented Jul 17, 2016

Uh oh!

yongtang commented Jul 17, 2016

Uh oh!

cyphar commented Jul 17, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cpuguy83 commented Sep 8, 2016

Uh oh!

duglin commented Sep 8, 2016

Uh oh!

duglin commented Sep 8, 2016

Uh oh!

duglin commented Oct 11, 2016

Uh oh!

thaJeztah commented Oct 11, 2016

Uh oh!

yongtang commented Oct 11, 2016

Uh oh!

mlaventure commented Oct 21, 2016

Uh oh!

yongtang commented Oct 27, 2016

Uh oh!

duglin commented Oct 28, 2016

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

tianon commented Nov 30, 2016

Uh oh!

tianon commented Nov 30, 2016

Uh oh!

tianon Nov 30, 2016

Choose a reason for hiding this comment

Uh oh!

yongtang commented Dec 1, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaJeztah commented Dec 1, 2016

Uh oh!

yongtang commented Dec 1, 2016

thaJeztah commented Jul 16, 2016 •

edited

Loading

duglin Sep 20, 2016 •

edited

Loading

cyphar commented Jul 17, 2016 •

edited

Loading

cyphar commented Jul 17, 2016 •

edited

Loading

cyphar commented Jul 17, 2016 •

edited

Loading

yongtang commented Dec 1, 2016 •

edited

Loading