The API uses UTF-8 encoding. The begin highlighting character corresponds to Unicode code 0xE000 and the end highlighting character corresponds to Unicode code 0xE001.
Please refer to the code samples on how to handle these characters on our MSDN documentation. For the web source type for example the code sample is at http://msdn.microsoft.com/en-us/library/dd250950.aspx.
Thanks!