-
Notifications
You must be signed in to change notification settings - Fork 789
[css-text-3] Should enclosed ideographic blocks be space-discarding? #4992
Copy link
Copy link
Closed
Labels
Closed Accepted by Editor DiscretionTesting UnnecessaryMemory aid - issue doesn't require testsMemory aid - issue doesn't require testsTracked in DoCcss-text-3Current WorkCurrent Worki18n-clreqChinese language enablementChinese language enablementi18n-jlreqJapanese language enablementJapanese language enablementi18n-klreqKorean language enablementKorean language enablementi18n-trackerGroup bringing to attention of Internationalization, or tracked by i18n but not needing response.Group bringing to attention of Internationalization, or tracked by i18n but not needing response.
Metadata
Metadata
Assignees
Labels
Closed Accepted by Editor DiscretionTesting UnnecessaryMemory aid - issue doesn't require testsMemory aid - issue doesn't require testsTracked in DoCcss-text-3Current WorkCurrent Worki18n-clreqChinese language enablementChinese language enablementi18n-jlreqJapanese language enablementJapanese language enablementi18n-klreqKorean language enablementKorean language enablementi18n-trackerGroup bringing to attention of Internationalization, or tracked by i18n but not needing response.Group bringing to attention of Internationalization, or tracked by i18n but not needing response.
In #337 we decided to key line-break transformation behavior by Unicode Block. Most of the blocks are pretty straightforward: Han, Kana, Yi, and CJK punctuation blokcs discard, and everything else converts to a space. But there are a few interesting cases...
One interesting case is the enclosed ideographic blocks:
https://en.wikipedia.org/wiki/Enclosed_CJK_Letters_and_Months
https://en.wikipedia.org/wiki/Enclosed_Ideographic_Supplement
The numerics in the Letters and months block seem likely to be used outside of CJK context, also there are quite a few Hangul, and I wouldn't be surprised if at least some of the other characters are also used in Korean sometimes.
Note, however, that we only discard if both sides (before and after) the line break are part of the space-discarding character set.