get the sourcecode [of UTF-8]

Jim Breen jimbreen at gmail.com
Thu Nov 7 17:00:17 CST 2024


On Fri, 8 Nov 2024 at 08:48, Jim DeLaHunt via Unicode
<unicode at corp.unicode.org> wrote:
> If you want to understand how Linux behaves, ask
> Linux and the Linux source code.

Indeed. I've been a long-term user of Unicode with Linux (mostly
Debian). I do quite a lot of conversion between legacy codes, such as
JIS, and Unicode (usually in UTF-8 format). Virtually all my code
conversion is done using  "iconv" which is available as a library
routine and a command-line utility. The source code is available, see:
https://www.gnu.org/software/libiconv/

On rare occasions, I need to dig into UTF-8 at the bit level. I have a
note pinned near my desk as an aide memoire. It has 3 lines:

UTF-8
zzzzyyyyyxxxxx
1110zzzz 10yyyyyy 10xxxxxx

Cheers

Jim
-- 
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/


More information about the Unicode mailing list