Wrong sequence for Arabic ligature marks(FC5E-FC62, FCF2-FCF4)

Richard Wordingham richard.wordingham at ntlworld.com
Fri Feb 18 13:48:28 CST 2022


On Fri, 18 Feb 2022 04:44:17 +0000
Saeed Hubaishan via Unicode <unicode at corp.unicode.org> wrote:

> Hi,
> "The Decomposition Type Mapping"  of these ligature marks are worng:
> FC5E     ‎ﱞ‎    Arabic Ligature Shadda With Dammatan Isolated Form
>                 ≈       <isolated> 0020 ␣ 064C ◌ٌ 0651 ◌ّ
> FC5F     ‎ﱟ‎    Arabic Ligature Shadda With Kasratan Isolated Form
>                 ≈       <isolated> 0020 ␣ 064D ◌ٍ 0651 ◌ّ
> FC60     ‎ﱠ‎    Arabic Ligature Shadda With Fatha Isolated Form
>                 ≈       <isolated> 0020 ␣ 064E ◌َ 0651 ◌ّ
> FC61     ‎ﱡ‎    Arabic Ligature Shadda With Damma Isolated Form
>                 ≈       <isolated> 0020 ␣ 064F ◌ُ 0651 ◌ّ
> FC62     ‎ﱢ‎    Arabic Ligature Shadda With Kasra Isolated Form
>                 ≈       <isolated> 0020 ␣ 0650 ◌ِ 0651 ◌ّ
> 
> FCF2     ‎ﳲ‎    Arabic Ligature Shadda With Fatha Medial Form
>                 ≈       <medial> 0640 ‎ـ‎ 064E ◌َ 0651 ◌ّ
> FCF3     ‎ﳳ‎    Arabic Ligature Shadda With Damma Medial Form
>                 ≈       <medial> 0640 ‎ـ‎ 064F ◌ُ 0651 ◌ّ
> FCF4     ‎ﳴ‎    Arabic Ligature Shadda With Kasra Medial Form
>                 ≈       <medial> 0640 ‎ـ‎ 0650 ◌ِ 0651 ◌ّ
> Arabic Shadda must be before the marks (064C ◌ٌ ,064D ◌ٍ , 064E ◌َ ,
> 064F ◌ُ ,  0650 ◌ِ)

But they and shadda have different non-zero canonical combining classes
(ccc), so their order shall intend no difference.  Shadda has the higher
ccc, so it comes last.  Putting it last makes the decomposition table
easier to use for conversion to form NFKD.

Richard.




More information about the Unicode mailing list