[css-text-3] Clarify Segment Break Transformation Rules when mutiple segment breaks involve #836

upsuper · 2016-12-26T12:07:42Z

The first rule for collapsing segment breaks is:

If the character immediately before or immediately after the segment break is the zero-width space character (U+200B), then the break is removed, leaving behind the zero-width space.

It is not clear to me what should happen if there are multiple segment breaks involve here. For example, if I have ZWSP LF LF LF x, would this rule produce:

ZWSP LF LF x (with only the first LF removed), or
ZWSP x (with all LF removed because of recursively applying this rule)?

(In the first case, the remaining LFs would be converted to whitespaces by the last rule there, and the second whitespace would be removed by step 4 of Phase I, so the final result would be ZWSP WS x.)

This may also affect the second rule:

Otherwise, if the East Asian Width property of both the character before and after the line feed is F, W, or H (not A), and neither side is Hangul, then the segment break is removed.

If I have W LF LF W, should the two LFs be removed by this rule?

It seems to me that removing all segment breaks together would be easier for implementation, so I would propose making the rules that way if there are no other concerns.

The text was updated successfully, but these errors were encountered:

upsuper · 2016-12-26T12:08:27Z

cc @chenpighead

fantasai · 2016-12-26T13:14:43Z

Amending this to be consecutive segment breaks makes sense to me. Seems like limiting it to only one is an error. The only thing that shouldn't change is that, if there's a space or tab somewhere in that sequence, the sequence becomes a space and not nothing.

upsuper · 2016-12-26T13:19:34Z

For those rules, I don't think there can be any whitespace or tab in their input, because whitespaces should have been removed by step 1 of Phase I if there is any segment break.

fantasai · 2016-12-26T13:37:07Z

Oh, right. Yeah, that's probably more correct. :)

fantasai · 2016-12-27T06:53:08Z

Agenda+ to confirm the fix.

upsuper added the css-text-3 Current Work label Dec 26, 2016

upsuper assigned fantasai and kojiishi Dec 26, 2016

fantasai closed this as completed in 82deba7 Dec 26, 2016

fantasai added Agenda+ Closed Accepted as Obvious Bugfix labels Dec 27, 2016

astearns removed the Agenda+ label Mar 8, 2017

fantasai added the Tracked in DoC label Mar 6, 2018

frivoal added the Tested Memory aid - issue has WPT tests label Apr 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[css-text-3] Clarify Segment Break Transformation Rules when mutiple segment breaks involve #836

[css-text-3] Clarify Segment Break Transformation Rules when mutiple segment breaks involve #836

upsuper commented Dec 26, 2016

upsuper commented Dec 26, 2016

fantasai commented Dec 26, 2016

upsuper commented Dec 26, 2016

fantasai commented Dec 26, 2016

fantasai commented Dec 27, 2016

[css-text-3] Clarify Segment Break Transformation Rules when mutiple segment breaks involve #836

[css-text-3] Clarify Segment Break Transformation Rules when mutiple segment breaks involve #836

Comments

upsuper commented Dec 26, 2016

upsuper commented Dec 26, 2016

fantasai commented Dec 26, 2016

upsuper commented Dec 26, 2016

fantasai commented Dec 26, 2016

fantasai commented Dec 27, 2016