您现在的位置： Linux教程網 >> UnixLinux > >> Linux編程 >> Linux編程

Python解析xml文件操作的例子

Python解析xml文件操作實例,操作XML文件的常見技巧。

xml文件內容：

<?xml version="1.0" ?>

<book>
<title>
sample xml thing
</title>
<author>
<name>
<first>
ma
</first>
<last>
xiaoju
</last>
</name>
<affiliation>
Springs Widgets, Inc.
</affiliation>
</author>
<chapter number="1">
<title>
First
</title>
<para>
I think widgets are greate.You should buy lots of them forom
<company>
Spirngy Widgts, Inc
</company>
</para>
</chapter>
</book>

python代碼：

from xml.dom import minidom, Node
import re, textwrap ## www.jbxue.com

class SampleScanner:
""""""

def __init__(self, doc):
"""Constructor"""
assert(isinstance(doc, minidom.Document))
for child in doc.childNodes:
if child.nodeType == Node.ELEMENT_NODE and \
child.tagName == "book":
self.handle_book(child)

def handle_book(self, node):

for child in node.childNodes:
if child.nodeType != Node.ELEMENT_NODE:
continue
if child.tagName == "title":
print "Book titile is:", self.gettext(child.childNodes)
if child.tagName == "author":
self.handle_author(child)
if child.tagName == "chapter":
self.handle_chapter(child)

def handle_chapter(self, node):
number = node.getAttribute("number")
print "number:", number
title_node = node.getElementsByTagName("title")
print "title:", self.gettext(title_node)

for child in node.childNodes:
if child.nodeType != Node.ELEMENT_NODE:
continue
if child.tagName == "para":
self.handle_chapter_para(child)

def handle_chapter_para(self, node):
company = ""
company = self.gettext(node.getElementsByTagName("company"))
print "chapter:para:company", company

def handle_author(self, node):
for child in node.childNodes:
if child.nodeType != Node.ELEMENT_NODE:
continue
if child.tagName == "name":
self.handle_author_name(child)
if child.tagName == "affiliation":
print "affiliation:", self.gettext(child.childNodes)

def handle_author_name(self, node):
first = ""
last = ""
for child in node.childNodes:
if child.nodeType != Node.ELEMENT_NODE:
continue
if child.tagName == "first":
first = self.gettext(child.childNodes)
if child.tagName == 'last':
last = self.gettext(child.childNodes)

print "firstname:%s,lastname:%s" % (first, last)

def gettext(self, nodelist):
retlist = []
for node in nodelist:
if node.nodeType == Node.TEXT_NODE:
retlist.append(node.wholeText)
elif node.hasChildNodes:
retlist.append(self.gettext(node.childNodes))

return re.sub('\s+', " ", ''.join(retlist))

if __name__=="__main__":
doc = minidom.parse("simple.xml")
sample = SampleScanner(doc)

Python解析xml文檔實例 http://www.linuxidc.com/Linux/2012-02/54760.htm

《Python核心編程第二版》.(Wesley J. Chun ).[高清PDF中文版] http://www.linuxidc.com/Linux/2013-06/85425.htm

《Python開發技術詳解》.( 周偉,宗傑).[高清PDF掃描版+隨書視頻+代碼] http://www.linuxidc.com/Linux/2013-11/92693.htm

Python腳本獲取Linux系統信息 http://www.linuxidc.com/Linux/2013-08/88531.htm

在Ubuntu下用Python搭建桌面算法交易研究環境 http://www.linuxidc.com/Linux/2013-11/92534.htm

Python 語言的發展簡史 http://www.linuxidc.com/Linux/2014-09/107206.htm

Python 的詳細介紹：請點這裡
Python 的下載地址：請點這裡

上一篇文章：支持https但不驗證證書的HttpClient
下一篇文章： C++多態實現的機制

Linux編程

Java中采用Dom4j解析XML文件

Ruby解析XML文件

Android之 AndroidManifest.xml 文件解析

使用SAX解析XML文件

Android解析XML文件

使用Python創建xml文件

Python解析xml文檔實例

Android SAX解析xml文件

相關文章

DOM解析XML格式文件實例

Python文件和目錄操作函數總結

Python文件和目錄操作實例代碼

Android 中使用Pull解析XML文件

Dom4j解析帶有命名空間的XML文件

Python文件或目錄操作的常用函數

Java操作XML文件--修改節點

Java操作XML文件--讀取內容

Android開發之XML文件的解析的三種方法

Python解析XML字符串

Android PULL解析xml文件

Android使用Pull解析器解析XML文件

Linux編程

SHELL編程

PERL編程