"A Programmer's Introduction to Unicode"

Richard Wordingham richard.wordingham at ntlworld.com
Mon Mar 13 18:48:37 CDT 2017

On Mon, 13 Mar 2017 15:26:00 -0700
Manish Goregaokar <manish at mozilla.com> wrote:

> Do you have examples of AA being split that way (and further reading)?
> I think I'm aware of what you're talking about, but would love to read
> more about it.

Just googling for the three words 'Sanskrit', 'sandhi' and 'resolution'
brings up plenty of papers and discussion, e.g. Hellwig's at
http://ltc.amu.edu.pl/book/papers/LRL-1.pdf and a multi-author paper at

There are even technical terms for before and after.  Unsplit text is
'samhita text', and text split into words is 'pada text'.


More information about the Unicode mailing list