<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Aptos;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:12.0pt;
font-family:"Aptos",serif;
mso-ligatures:standardcontextual;
mso-fareast-language:EN-US;}
span.E-postmall17
{mso-style-type:personal-compose;
font-family:"Aptos",serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="SV" link="#467886" vlink="#96607D" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span lang="EN-GB">Hi!<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">I have finally had the opportunity review the new 3GPP 23.038 "code pages",<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">mostly for Indic scripts. (Started five years ago, but been busy with other<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">stuff; sorry for the delay.) Note that these "code pages" are for SMS/CBS<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">use only. They are suitable ONLY for that realm of use, and inappropriate<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">everywhere else.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">Unfortunately, the "code pages" in current 23.038 are not well constructed,<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">nor does it seem that they have even been independently reviewed. So...<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">I made new ones to replace them (technically with other reference numbers<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">since changing an existing "code page", using the same refence number would<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">be inappropriate). I also added "code pages" for several scripts not currently<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">covered by 7-bit code pages (thus having to fall back to using "UCS2" (actually<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">UTF-16(BE) currently, likely incurring a “size penalty”…; the SMS protocol has<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">strict size restrictions, it is not called SHORT message service for nothing).<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">I have no "new" "code pages" for Spanish, Portuguese or Turkish (which have<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">separate "code pages" in 23.038), since these languages are covered better<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">by the new(!!) "default" (actually not default but Latin script) "code page";<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">intending to deprecate the special code pages for Spanish, Portuguese and
<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">Turkish. (Though I call it "new default" it actually has to be set explicitly.)<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">SMS and CBS are still “a thing” for 5G, 6G and very likely beyond, despite<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">the numerous chat apps and other apps.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">You can find (draft!) mapping tables (.TXT) and charts (.docx) in
<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">https://github.com/kent-karlsson/3gpp-propositions. The text files have<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">in the file name the language code for the principal language for which it is<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">intended (except the "default" code page). The charts have the (SMS/CBS)<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">protocol code page number (in hexadecimal) in the file name and section name.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">Note that this is work in progress, not yet put forward for standardisation.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">If you want to comment on these draft proposals, you can do so via github.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">/Kent Karlsson<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</body>
</html>