Contains an irregular UTF-8 sequence (i.e. a surrogate pair)
<!DOCTYPE foo [ <!ELEMENT foo ANY> ]> <foo>𐀀</foo>
Expected result | Actual result for org.apache.crimson.parser.XMLReaderImpl |
---|---|
<?xml version="1.0" encoding="UTF-8"?> <ConformanceResults> <startDocument/> <startElement> <namespaceURI/> <localName>foo</localName> <qualifiedName>foo</qualifiedName> <attributes/> </startElement> <fatalError/> <endDocument/> </ConformanceResults> | <?xml version="1.0" encoding="UTF-8"?> <ConformanceResults> <startDocument/> <startElement> <namespaceURI/> <localName>foo</localName> <qualifiedName>foo</qualifiedName> <attributes/> </startElement> <char>\uD800</char> <char>\uDC00</char> <endElement> <namespaceURI/> <localName>foo</localName> <qualifiedName>foo</qualifiedName> </endElement> <endDocument/> </ConformanceResults> |