[liblouis-liblouisxml] Re: Issue 15 in liblouis: Python wchar_t and UTF-16 mismatch

  • From: Neil Soiffer <NeilS@xxxxxxxxxx>
  • To: liblouis-liblouisxml@xxxxxxxxxxxxx
  • Date: Wed, 11 May 2011 22:07:06 -0700

On Wed, May 11, 2011 at 3:35 PM, Michael Whapples <mwhapples@xxxxxxx> wrote:

> I believe there is a function call now available for getting the size of
> widechar, however that will not work if there is an old version of liblouis
> installed.
>
> Regarding converting from 16-bit and 32-bit unicode, in jlouis I used the
> utf-16 and utf-32 encodings. However I do caution the use of that system as
> liblouis uses ucs-2 not utf-16 (ucs-2 is a fixed 2 byte encoding where as
> utf-16 is variable length so can accept characters which require 4 bytes to
> represent, I don't know what would happen if such a character were passed to
> liblouis).
>

FYI: in MathML, some of the "Math Alphanumerics" are defined in plane 1 and
thus require more than two bytes to encode. liblouis does need to handle
these characters.

<snip>

   Neil Soiffer

Other related posts: