Utf8 fixes
The UTF-16 surrogate halves might need to still be valid for compatibility. Also, should we start rejecting RFC-invalid codepoint encodings?
The UTF-16 surrogate halves might need to still be valid for compatibility. Also, should we start rejecting RFC-invalid codepoint encodings?