我尝试在java中解析XML
DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
DocumentBuilder db = dbf.newDocumentBuilder();
Document doc = db.parse(new ByteArrayInputStream(msg.getBytes("UTF-8")));
一切似乎都还可以
<data>äöü</data>
被正确解析(特别是关于德国变音符号)。
但是当我尝试解析时
<däta>xxx</däta>
解析器抛出异常,其中带有变音符号的名称似乎不起作用:
org.w3c.dom.DOMException: WFä at org.apache.harmony.xml.dom.NodeImpl.setName(NodeImpl.java:286) at
org.apache.harmony.xml.dom.AttrImpl.<init>(AttrImpl.java:55) at
org.apache.harmony.xml.dom.DocumentImpl.createAttribute(DocumentImpl.java:324) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:314) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:321) at
org.apache.harmony.xml.parsers.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:128)