XML(eXtensible Markup Language)是国际标准化组织的标准通用标记语言SGML的子集。由以下规范组成:
在PHP5之前的版本对XML的处理存在很多的问题,例如XML工具之间只具有简单的关联、每个工具不能一起协同工作等,PHP5中的新XML扩展具有下列特点:
HTML和XML都是SGML的子集,所以他们有很大的相似性。下面是XML相对于HTML具有的独特性:
<?xml version="1.0"?> <shows> <show> <name>Simpsons</name> <channel>FOX</channel> <start>8:00 PM</start> <duration>30</duration> </show> <show> <name>Law & Order</name> <channel>NBC</channel> <start>8:00 PM</start> <duration>60</duration> </show> </shows>
形式良好的XML文档须具备下列特征:
// 创建一个新的文档
$dom = new DOMDocument('1.0'); // 创建一个根元素<book>并将其添加到文档
$book = $dom->appendChild($dom->createElement('book')); // 创建一个title子元素,并添加到$book中 $title = $book->appendChild($dom->createElement('title')); // 设置title元素的文本及cover属性 $title->appendChild($dom->createTextNode('PHP Cookbook')); $title->setAttribute('cover', 'soft'); // 创建并将author元素添加到$book中 $sklar = $book->appendChild($dom->createElement('author')); //添加文本到author节点
$sklar->appendChild($dom->createTextNode('Sklar')); $trachtenberg = $book->appendChild($dom->createElement('author')); $trachtenberg->appendChild($dom->createTextNode('Trachtenberg')); // 输出完美格式化的XML文档
$dom->formatOutput = true; echo $dom->saveXML();
输出内容如下:
<?xml version="1.0"?> <book?> <cover="soft">PHP Cookbook</title> </book>
常用有三种方式来解析XML文件
XML示例文件如下(address-book.xml):
<?xml version="1.0"?> <address-book> <person id="1"> <!--David Sklar--> <firstname>David</firstname> <lastname>Sklar</lastname> <city>New York</city> <state>NY</state> <email>[email protected]</email> </person> <person id="2"> <!--Adam Trachtenberg--> <firstname>Adam</firstname> <lastname>Trachtenberg</lastname> <city>San Francisco</city> <state>CA</state> <email>[email protected]</email> </person> </address-book>
通过SimpleXML方式:
$sx = simplexml_load_file('address-book.xml'); foreach ($sx->person as $person) { $firstname_text_value = $person->firstname; $lastname_text_value = $person->lastname; print "$firstname_text_value $lastname_text_value\n"; }
通过DOM扩展:
$dom = new DOMDocument; $dom->load('address-book.xml'); foreach ($dom->getElementsByTagname('person') as $person) { $firstname = $person->getElementsByTagname('firstname'); $firstname_text_value = $firstname->item(0)->firstChild->nodeValue; $lastname = $person->getElementsByTagname('lastname'); $lastname_text_value = $lastname->item(0)->firstChild->nodeValue; print "$firstname_text_value $lastname_text_value\n"; }
通过XMLReader扩展:
$reader = new XMLReader(); $reader->open('card-catalog.xml'); while ($reader->read()) { if ($reader->nodeType == XMLREADER::ELEMENT && $reader->localName == 'author') { $reader->read(); print $reader->value . "\n"; } }
在SimpleXML和DOM扩展中都有XPath
//SimpleXml示例 $emails = $s->xpath('/address-book/preson/email'); //DOM扩展示例 $xpath = new DOMXPath($dom); $email = $xpath->query('/address-book/preson/email');
在PHP中,DOM扩展支持基于DTD,XML Schema和RelaxNG的验证,而SimpleXML则只提供了XML Schema验证。
如果数据来源为其他格式,需要经过编码为UTF-8格式,下面是通过iconv库进行转换的示例
$utf_8 = iconv('ISO-8859-1', 'UTF-8', $iso_8859_1);
其他参考资料:《PHP高级开发技术与实例》 清华大学出版社