All line-ends are normalized, even those not passed to the application. NB this can only be tested effectively in XML 1.1, since CR is in the S production; in 1.1 we can use NEL which isn't.
<?xml version="1.1" encoding="iso-8859-1"?> <!DOCTYPE foo [ <!ELEMENT foo ANY> <!ATTLIST foo bar CDATA #IMPLIED> ]> <foo bar="hello"/>
Expected result | Actual result for org.apache.crimson.parser.XMLReaderImpl |
---|---|
<?xml version="1.0" encoding="UTF-8"?> <ConformanceResults> <startDocument/> <startElement> <namespaceURI/> <localName>foo</localName> <qualifiedName>foo</qualifiedName> <attributes> <attribute> <namespaceURI/> <localName>bar</localName> <qualifiedName>bar</qualifiedName> <value>hello</value> <type>CDATA</type> </attribute> </attributes> </startElement> <endElement> <namespaceURI/> <localName>foo</localName> <qualifiedName>foo</qualifiedName> </endElement> <endDocument/> </ConformanceResults> | <?xml version="1.0" encoding="UTF-8"?> <ConformanceResults> <startDocument/> <fatalError/> </ConformanceResults> |