Test and in SVG by zcorpan · Pull Request #135 · html5lib/html5lib-tests

zcorpan · 2021-06-04T12:45:00Z

See whatwg/html#6736

zcorpan · 2021-06-04T12:52:53Z

Also need to test this in regular parsing mode (without #document-fragment)

zcorpan · 2021-06-07T21:29:50Z

Also need to test this in regular parsing mode (without #document-fragment)

Done.

stevecheckoway · 2021-06-17T22:38:20Z

+#data
+<math></p><foo>
+#errors
+10: HTML end tag “p” in a foreign namespace context.
+#document
+| <html>
+|   <head>
+|   <body>
+|     <math math>
+|     <p>
+|     <foo>


This test agrees with Firefox but I'm not entirely sure I see how the spec gives this parsing. (I assume the others are similar, this is just the one I happened to look into.)

As I understand it, the <math> should go through a few insertion modes, inserting html, head, and body elements before being processed in the "in body" insertion mode. The rules say to insert a math element in the MathML namespace.

The  is handled by the rules for parsing in a foreign context under the "any other end tag." Step 2 gives a parse error. In step 6, node (the body element) is an element in the HTML namespace so step 7 says to process according to the rules in the current insertion mode which is still "in body."

The rules for  in the "in body" insertion mode say that if there isn't a p element in button scope (and there is not), then this is another parse error (which is missing from this test) and to insert a p element.

Inserting the p should happen at the appropriate place for inserting a node. No override target is specified so target is the current node (the math element). Foster parenting isn't enabled so the "adjusted insertion location [is] inside target, after its last child (if any)."

As far as I can tell, this should cause the following errors.

expected doctype token

unexpected end tag in foreign content (from the parsing in foreign context)

unexpected end tag (from in body)

unexpected EOF (because foo is not closed)

And the DOM should have the p and foo elements as children of math.

There must be something wrong with my analysis, but I can't figure out what it is. Any help is appreciated.

After reading your analysis, I'm no longer convinced the test is correct either, but maybe you just managed to confuse me :)

(I didn't check the number of parse errors before.)

The test agrees with Firefox, Safari, and Chrome. I must be missing something. The Gumbo parser (at least the updated version that appears in Nokogumbo and now in Nokogiri) parses based on my understanding and puts the p and foo elements as children of math.

I just checked html5ever. It doesn't have an up-to-date html5lib-tests submodule so I added the test and it produces the same output as Gumbo (including putting the foo element in the MathML namespace):

input: <math><foo> got: | <html> | <head> | <body> | <math math> | | <math foo> expected: | <html> | <head> | <body> | <math math> | | <foo>

Could this be a spec bug?

I filed an issue in whatwg/html. I suspect I'm just misreading the spec, but I'm not sure where.

See also whatwg/html#5113 and whatwg/html#6736

I believe your reading is correct - and that it was pointed out that this created a dangerous loophole for exploiting sanitizer round tripping. I'm not certain but I think maybe due to the nature of the problem vendors and some libraries may have addressed it before the spec was actually updated to reflect the necessary change.

Ms2ger · 2021-06-21T12:35:07Z

Oh, I merged this before the corresponding spec PR; apologies for the resulting confusion.

This is because of a change in the HTML5 parsing spec to special case closing and tags in foreign content (svg, mathml). See whatwg/html#5113 and html5lib/html5lib-tests#135

Test and in SVG

c7fc3f6

See whatwg/html#6736

zcorpan mentioned this pull request Jun 4, 2021

HTML parser: handle and in SVG whatwg/html#6736

Merged

3 tasks

Test and in SVG and MathML in non-fragment case

373295a

Ms2ger approved these changes Jun 11, 2021

View reviewed changes

Ms2ger merged commit 9b4a29c into master Jun 11, 2021

Ms2ger deleted the bocoup/svg-p-br-end-tag branch June 11, 2021 11:23

zcorpan mentioned this pull request Jun 14, 2021

Update tests/testdata (Test and in SVG) html5lib/html5lib-python#534

Merged

stevecheckoway reviewed Jun 17, 2021

View reviewed changes

This was referenced Jun 17, 2021

Epic: merge Nokogumbo into Nokogiri sparklemotion/nokogiri#2204

Closed

"Any other end tag" whatwg/html#6788

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test </p> and </br> in SVG#135

Test </p> and </br> in SVG#135
Ms2ger merged 2 commits intomasterfrom
bocoup/svg-p-br-end-tag

zcorpan commented Jun 4, 2021

Uh oh!

zcorpan commented Jun 4, 2021

Uh oh!

zcorpan commented Jun 7, 2021

Uh oh!

stevecheckoway Jun 17, 2021

Uh oh!

Ms2ger Jun 18, 2021

Uh oh!

stevecheckoway Jun 19, 2021

Uh oh!

stevecheckoway Jun 19, 2021

Uh oh!

bathos Jun 19, 2021

Uh oh!

Ms2ger commented Jun 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zcorpan commented Jun 4, 2021

Uh oh!

zcorpan commented Jun 4, 2021

Uh oh!

zcorpan commented Jun 7, 2021

Uh oh!

stevecheckoway Jun 17, 2021

Choose a reason for hiding this comment

Uh oh!

Ms2ger Jun 18, 2021

Choose a reason for hiding this comment

Uh oh!

stevecheckoway Jun 19, 2021

Choose a reason for hiding this comment

Uh oh!

stevecheckoway Jun 19, 2021

Choose a reason for hiding this comment

Uh oh!

bathos Jun 19, 2021

Choose a reason for hiding this comment

Uh oh!

Ms2ger commented Jun 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants