ref: aa18ef32a67e03aea52890628ce530f73fe0564c
parent: 45208d7331a8df67970dbf796bc74e30128d1d29
author: Tor Andersson <tor.andersson@gmail.com>
date: Tue Apr 4 11:57:38 EDT 2017
Update documentation to note our use of CESU-8 encoded strings.
--- a/docs/reference.html
+++ b/docs/reference.html
@@ -47,7 +47,11 @@
Numbers are represented using double precision floating point values.
<p>
-Strings in the C interface are zero-terminated byte arrays in UTF-8 encoding.
+Strings in the C interface are zero-terminated byte arrays in CESU-8 encoding.
+CESU-8 is a variant of UTF-8 which encodes supplementary unicode characters as
+surrogate pairs. This maintains compatibility with the UTF-16 nature of
+JavaScript, but requires attention when passing strings using supplementary
+unicode characters to and from the MuJS library.
<h3>Environments</h3>