Proposal: Replace the token queue with an event-handler system

### Why

The token queue adds a level of indirection that makes it harder to fix some issues. Eg. #292 is easy to fix once the token queue is gone. Also, debugging is currently complicated, as stack traces end at the token queue.

With the queue gone, stack traces will point at the corresponding line in the tokenizer. V8 will be able to optimise more aggressively; in my branch combining all of the changes, I see a ~15% performance increase using `htmlparser-benchmark`.

### Game plan

1. Update the tokenizer to produce events. There will be a `QueuedTokenizer` class that wraps around the tokenizer, which provides an interface for the parser. Opened as #404
2. Invert event processing in the parser. The parser currently first checks the insertion mode, and then the token type. By inverting this (checking first the token type, then the insertion mode), we prepare the parser to accept the events from (1). Opened as #405
3. Tie everything together. Have the updated parser from (2) consume the tokenizer events from (1). Opened as #419

(1) and (2) do not depend on one-another and can be merged independently.

cc @wooorm @43081j 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal: Replace the token queue with an event-handler system #403

Why

Game plan

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Proposal: Replace the token queue with an event-handler system #403

Description

Why

Game plan

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions