Surprising parsing behavior with active formatting elements nad PLAINTEXT

The HTML spec has the following note [in the fragment about tree construction of PLAINTEXT](https://html.spec.whatwg.org/multipage/parsing.html#markup-declaration-open-state:~:text=Once%20a%20start%20tag%20with%20the%20tag%20name%20%22plaintext%22):

> Once a start tag with the tag name "plaintext" has been seen, that will be the last token ever seen other than character tokens (and the end-of-file token), because there is no way to switch out of the [PLAINTEXT state](https://html.spec.whatwg.org/multipage/parsing.html#plaintext-state).

This is not true, and it is possible to create an element inside `<plaintext>` that is spec-compliant. Check the following HTML:

```html
<p><a><plaintext>x
```

It will create the following DOM tree:

```
└─ #document
   └─ html
      ├─ head
      └─ body
         ├─ p
         │  └─ a
         └─ plaintext
            └─ a
               └─ #text: x
```

I feel like this is quite unexpected and doesn't happen to other RAWTEXT elements such as `<xmp>` or `<style>`. The difference between `<xmp>` and `<plaintext>` is that in the former we have [Reconstruct the active formatting elements](https://html.spec.whatwg.org/multipage/parsing.html#reconstruct-the-active-formatting-elements) while in the latter the active formatting elements are reconstructed on the first character token.

I was wondering about potential security implications of this behavior but couldn't find one. But still, I find this behavior surprising and believe it should be fixed by adding "Reconstruct the active formatting elements" for `<plaintext>` as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Surprising parsing behavior with active formatting elements nad PLAINTEXT #8009

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Surprising parsing behavior with active formatting elements nad PLAINTEXT #8009

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions