"A Programmer's Introduction to Unicode"
richard.wordingham at ntlworld.com
Mon Mar 13 18:48:37 CDT 2017
On Mon, 13 Mar 2017 15:26:00 -0700
Manish Goregaokar <manish at mozilla.com> wrote:
> Do you have examples of AA being split that way (and further reading)?
> I think I'm aware of what you're talking about, but would love to read
> more about it.
Just googling for the three words 'Sanskrit', 'sandhi' and 'resolution'
brings up plenty of papers and discussion, e.g. Hellwig's at
http://ltc.amu.edu.pl/book/papers/LRL-1.pdf and a multi-author paper at
There are even technical terms for before and after. Unsplit text is
'samhita text', and text split into words is 'pada text'.
More information about the Unicode