[web] Line break algorithm using unicode properties #18795

mdebbar · 2020-06-03T20:48:19Z

Switch to using the unicode data instead of checking manually for new lines and whitespace.

Note: this PR uses minimal parts of the unicode data just to cover the main things (e.g. whitespace, NL, BK, LF, CR, etc). There's another PR coming that will implement the full algorithm that'll take advantage of the entire unicode data.

Didn't add any tests because I'm relying on existing tests to verify that the switch to unicode data doesn't break existing behavior.

TODO:

Check if there are any performance implications.

yjbanov · 2020-06-05T20:34:00Z

lib/web_ui/lib/src/engine/text/line_breaker.dart

 }

+/// Normalizes properties that behave the same way into one common property.
+LineCharProperty _normalizeLineProperty(LineCharProperty prop) {


Can this substitution be made in the _packedLineBreakProperties so we never need to convert at run time?

This normalization is part of the algorithm. I want to keep codegen independent.

The other reason is that in some cases we still need to know the original property (pre-normalization). e.g. rawCurr == LineCharProperty.LF.

There are a few optimizations that I can think of. I expect them to have marginal perf improvements but nothing drastic.

Some ideas for my future reference:

Normalize properties during codegen.

Reduce the amount of binary searches by either caching at runtime, or codegen'ing a hash map for the most common characters, or both.

For paragraphs that can fit in a single line, we can avoid a whole bunch of computations.

How about inside UnicodePropertyLookup.fromPackedData?

I'm just throwing ideas out. The PR LGTM. This comment is non-blocking.

UnicodePropertyLookup.fromPackedData is used by both line breaker and word breaker. They have different properties and different rules for normalization.

I see your general point though. And I agree we could do some optimizations there.

yjbanov · 2020-06-05T20:38:16Z

lib/web_ui/lib/src/engine/text/line_breaker.dart

+    // TODO: Use the 2d table now. See https://www.unicode.org/reports/tr14/tr14-22.html#ExampleTable
+
+    // TODO: After using the 2d table, do:
+    // hasSpaces = false;


Should these todos be github issues?

They are coming in the next PR.

fluttergithubbot · 2020-06-09T19:55:39Z

It looks like this pull request may not have tests. Please make sure to add tests before merging. If you need an exemption to this rule, contact Hixie on the #hackers channel in Chat.

Reviewers: Read the Tree Hygiene page and make sure this patch meets those guidelines before LGTMing.

…engine#18795)

mdebbar added the platform-web Code specifically for the web engine label Jun 3, 2020

mdebbar requested a review from yjbanov June 3, 2020 20:48

mdebbar self-assigned this Jun 3, 2020

googlebot added the cla: yes label Jun 3, 2020

yjbanov approved these changes Jun 5, 2020

View reviewed changes

mdebbar force-pushed the line_break_algorithm1 branch from 3b5dcb7 to b2b172c Compare June 8, 2020 22:25

[web] Line break algorithm using unicode properties

db7f8d5

mdebbar force-pushed the line_break_algorithm1 branch from b2b172c to db7f8d5 Compare June 9, 2020 16:26

mdebbar merged commit 3e9f8f3 into flutter:master Jun 9, 2020

engine-flutter-autoroll mentioned this pull request Jun 10, 2020

Roll Engine from e8c13aa012c9 to 7e6c856ea0bf (30 revisions) flutter/flutter#59173

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

dbb13e8

…engine#18795)

engine-flutter-autoroll mentioned this pull request Jun 10, 2020

Roll Engine from e8c13aa012c9 to eaa2f7f90f6c (33 revisions) flutter/flutter#59183

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

3fc6f17

…engine#18795)

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

e990815

…engine#18795)

engine-flutter-autoroll mentioned this pull request Jun 10, 2020

Roll Engine from e8c13aa012c9 to b1a08f2abd40 (34 revisions) flutter/flutter#59192

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

195a640

…engine#18795)

This was referenced Jun 10, 2020

Roll Engine from e8c13aa012c9 to 0e8f89cd71b5 (35 revisions) flutter/flutter#59198

Closed

Roll Engine from e8c13aa012c9 to a960e72656a9 (38 revisions) flutter/flutter#59206

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

5aa7d09

…engine#18795)

engine-flutter-autoroll mentioned this pull request Jun 10, 2020

Roll Engine from e8c13aa012c9 to a960e72656a9 (38 revisions) flutter/flutter#59211

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 10, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

f9066e0

…engine#18795)

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 11, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

25b0e3a

…engine#18795)

engine-flutter-autoroll mentioned this pull request Jun 11, 2020

Roll Engine from e8c13aa012c9 to 14c78ff3aa6c (41 revisions) flutter/flutter#59218

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Jun 11, 2020

3e9f8f3 [web] Line break algorithm using unicode properties (flutter/…

0497968

…engine#18795)

engine-flutter-autoroll mentioned this pull request Jun 11, 2020

Roll Engine from e8c13aa012c9 to 965fbbed1776 (42 revisions) flutter/flutter#59222

Merged

mdebbar mentioned this pull request Jun 12, 2020

web_benchmarks_html text_canvas_cached_layout.html.layout.average and text_canvas_color_grid.html.text_layout.average regression... flutter/flutter#59339

Closed

mdebbar deleted the line_break_algorithm1 branch April 15, 2021 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[web] Line break algorithm using unicode properties #18795

[web] Line break algorithm using unicode properties #18795

Uh oh!

mdebbar commented Jun 3, 2020 •

edited

Loading

Uh oh!

yjbanov Jun 5, 2020

Uh oh!

mdebbar Jun 8, 2020

Uh oh!

yjbanov Jun 8, 2020

Uh oh!

mdebbar Jun 8, 2020

Uh oh!

mdebbar Jun 8, 2020

Uh oh!

yjbanov Jun 5, 2020

Uh oh!

mdebbar Jun 5, 2020

Uh oh!

fluttergithubbot commented Jun 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[web] Line break algorithm using unicode properties #18795

[web] Line break algorithm using unicode properties #18795

Uh oh!

Conversation

mdebbar commented Jun 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yjbanov Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

mdebbar Jun 8, 2020

Choose a reason for hiding this comment

Uh oh!

yjbanov Jun 8, 2020

Choose a reason for hiding this comment

Uh oh!

mdebbar Jun 8, 2020

Choose a reason for hiding this comment

Uh oh!

mdebbar Jun 8, 2020

Choose a reason for hiding this comment

Uh oh!

yjbanov Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

mdebbar Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

fluttergithubbot commented Jun 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mdebbar commented Jun 3, 2020 •

edited

Loading