大气层煮月亮

【目标检测】一文干翻xml文件的读取

前言

在目标检测中xml文件的读取非常常见，常常要用到labelimg、labelme等标注软件，打标时往往需要打开xml文件，但奈何一直没找到一篇完整的文章，故自己打算手写一篇。下面介绍利用python解析xml文件的方法。

【目标检测】一文搞定xml文件的读取

.xml实例

.py实战中得真知：

1、获取根节点标签中的文本。

.py

vscode.output

2、获取标签中的子节点的属性（多个）。

.py

vscode.output

番外阅读

.xml实例

这是一个在目标检测中十分常见的.xml文件，今天我们就以它来作为例子！


	VOC2007
	000001.jpg
	
		The VOC2007 Database
		PASCAL VOC2007
		flickr
		341012865
	
	
		Fried Camels
		Jinky the Fruit Bat
	
	
		353
		500
		3
	
	0

.py实战中得真知：

tree = ET.parse(xml_path) 读取xml文档
root = tree.getroot() 获取根节点

解析：ET.parse()将xml文件读入到dom,返回一个etree对象，可以通过etree的getroot()、find()等函数对树的根节点和某个子节点进行访问。如findall("object")则返回所有的object节点，还可以通过.text访问节点的文本属性。

1、获取根节点标签中的文本。

.py

import xml.etree.ElementTree as ET
def get_JPGImgName(xmlpath):
    dom=ET.parse(xmlpath)
    root=dom.getroot()
    #print(root.find('filename').text)
    return root.find('filename').text

if __name__ == '__main__':
    print(get_JPGImgName(r'VOC2007_Annotations\000001.xml'))

vscode.output

2、获取