get the sourcecode [of UTF-8]
Jim Breen
jimbreen at gmail.com
Thu Nov 7 17:00:17 CST 2024
On Fri, 8 Nov 2024 at 08:48, Jim DeLaHunt via Unicode
<unicode at corp.unicode.org> wrote:
> If you want to understand how Linux behaves, ask
> Linux and the Linux source code.
Indeed. I've been a long-term user of Unicode with Linux (mostly
Debian). I do quite a lot of conversion between legacy codes, such as
JIS, and Unicode (usually in UTF-8 format). Virtually all my code
conversion is done using "iconv" which is available as a library
routine and a command-line utility. The source code is available, see:
https://www.gnu.org/software/libiconv/
On rare occasions, I need to dig into UTF-8 at the bit level. I have a
note pinned near my desk as an aide memoire. It has 3 lines:
UTF-8
zzzzyyyyyxxxxx
1110zzzz 10yyyyyy 10xxxxxx
Cheers
Jim
--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/
More information about the Unicode
mailing list