您现在的位置： Linux教程網 >> UnixLinux > >> Linux編程 >> Linux編程

Java 使用dom讀取XML文件及對中文字符的支持

我本機的開發環境編碼是UTF-8;

以下這個方法正常讀取不含中文的XML文件是沒問題的

public static Element returnRootElement(String fileName) {
String deviceInformation = "";
Document document = null;
Element root = null;
try {
   deviceInformation = FileResource.readFile(fileName,UiUtilPlugin.PLUGIN_ID);
   DocumentBuilder builder;
   builder = DocumentBuilderFactory.newInstance().newDocumentBuilder();
   InputStream is = new ByteArrayInputStream(
     deviceInformation.getBytes());
   document = builder.parse(is);
   root = document.getDocumentElement();
   return root;
} catch (Exception e) {
   e.printStackTrace();
}
return null;
}

但是遇到含有中文的字符呢就會出現這個錯誤

org.apache.xerces.internal.impl.io.MalformedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence

這是因為SAX’s parser沒有設置正確的編碼格式，導致讀取文件出錯。

那原因知道了

我下面貼一下我修改的代碼：

很簡單，加入這幾句就可以了

   InputStream iStream= new ByteArrayInputStream(xmlInformation.getBytes());
   Reader reader = new InputStreamReader(iStream,"GB2312");
   InputSource iSource = new InputSource(reader);
   iSource.setEncoding("GB2312");
   document = builder.parse(iSource);

共同學習。

上一篇文章： iPhone中frame與bounds的區別
下一篇文章： Java加解密藝術之AES對稱加密算法

Linux編程

Java獲取XML節點總結之讀取XML文檔節點

Android中讀取中文字符的文件與文件讀取相關

Java解析XML文件的DOM和SAX方式