PacosonSWJTU

dom4j-cookbook

【0】README

1）本文译自http://dom4j.sourceforge.net/dom4j-1.6.1/cookbook.html

2）intro：

2.1）dom4j 是一个对象模型，在内存中表示一颗XML 树。dom4j 提供了易于使用的API以提供强大的处理特性，操纵或控制 XML 和结合 XPath， XSLT 以及 SAX， JAXP 和 DOM 来进行处理；

2.2）dom4j 是基于接口来设计的，来提供高可配置的实现策略。你只需提供一个DocumentFactory的实现就可以创建你自己的XML树实现。这使得我们易于重用dom4j 的代码，当扩展dom4j来提供所需特性的实现的时候；

【1】读取XML 数据

1）intro：dom4j 附带了一组builder 类用于解析xml 数据和创建类似于树的对象结构。读取XML 数据的代码如下：

public class DeployFileLoaderSample {
 /** dom4j object model representation of a xml document. Note: We use the interface(!) not its implementation */
 private Document doc;
   /**
    * Loads a document from a file.
    * @param aFile the data source
    * @throw a org.dom4j.DocumentExcepiton occurs on parsing failure.
    */
 public void parseWithSAX(File aFile) throws DocumentException {
  SAXReader xmlReader = new SAXReader();
  this.doc = xmlReader.read(aFile);
 }
 /**
  * Loads a document from a file.
  * @param aURL the data source
  * @throw a org.dom4j.DocumentExcepiton occurs on parsing failure.
  */
 public void parseWithSAX(URL aURL) throws DocumentException {
  SAXReader xmlReader = new SAXReader();
  this.doc = xmlReader.read(aURL);
 }
 public Document getDoc() {
  return doc;
 }
}

2）以上代码阐明了使用 SAXReader根据给定文件来创建一个完整dom4j 树。org.dom4j.io 包包含了一组类用于创建和序列化XML对象。其中read() 方法被重载了使得你能够传递表示不同资源的对象；

java.lang.String - a SystemId is a String that contains a URI e.g. a URL to a XML file
java.net.URL - represents a Uniform Resource Loader or a Uniform Resource Identifier. Encapsulates a URL.
java.io.InputStream - an open input stream that transports xml data
java.io.Reader - more compatible. Has abilitiy to specify encoding scheme
org.sax.InputSource - a single input source for a XML entity.

2.1）添加新方法为为 DeployFileCreator 增加更多的扩展性，代码还是上面那个代码；

3）测试用例如下

@Test
 public void readXML() {
  String base = System.getProperty("user.dir") + File.separator
    + "src" + File.separator;
 
  DeployFileLoaderSample sample = new DeployFileLoaderSample();
  try { // via parameter of URL type.
   sample.parseWithSAX(new URL("file:" + base + "pom.xml"));
   Document doc = sample.getDoc();
   System.out.println(doc.asXML());
  } catch (Exception e) {
   e.printStackTrace();
  }
 
  try { // via parameter of File type.
   sample.parseWithSAX(new File(base + "pom.xml"));
   Document doc = sample.getDoc();
   System.out.println(doc.asXML());
  } catch (Exception e) {
   e.printStackTrace();
  }
 }

【2】dom4j 和其他XML API 整合

1）intro：dom4j 也提供了类用于和两个原始 XML 处理API（SAX 和 DOM）进行整合。

2）DomReader类：允许你将一个存在的 DOM 树转换为 dom4j 树。你也可以转换一个DOM 文档，DOM 节点分支和单个元素；代码如下：

public class DOMIntegratorSample {
 
 public DOMIntegratorSample() {}
 
 public org.w3c.dom.Document parse(URL url) {
  DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
  try {
   DocumentBuilder builder = factory.newDocumentBuilder();
   return builder.parse(url.toString());
  } catch (Exception e) {
   e.printStackTrace();
   return null;
  }
 }
 
 /** converts a W3C DOM document into a dom4j document */
 public Document buildDocment(org.w3c.dom.Document domDocument) {
  DOMReader xmlReader = new DOMReader();
  return xmlReader.read(domDocument);
 }
}

public String base = System.getProperty("user.dir") + File.separator
   + "src" + File.separator;

@Test // 测试用例,.
 public void testIntegrate() {
  DOMIntegratorSample sample = new DOMIntegratorSample();
  try {
   org.w3c.dom.Document doc = sample.parse(new URL("file:"+ base + "pom.xml"));
   Document doc4j  = sample.buildDocment(doc);
   System.out.println(doc4j.asXML());
  } catch (Exception e) {
   e.printStackTrace();
  }
 }

【3】DocumentFactory 的秘密

1）intro：从头到尾创建一个 Document，代码如下：

public class GranuatedDeployFileCreator {
 private DocumentFactory factory;
 private Document doc;

 public GranuatedDeployFileCreator() {
  this.factory = DocumentFactory.getInstance(); // 单例方法.
 }
 public void generateDoc(String aRootElement) {
  doc = factory.createDocument();
  Element root = doc.addElement(aRootElement);
 }
}

1.1）测试用例如下：

@Test
 public void testGenerateDoc() {
  GranuatedDeployFileCreator creator = new GranuatedDeployFileCreator();
 
  creator.generateDoc("project");
  Document doc = creator.getDoc();
  System.out.println(doc.asXML());
 }

2）Document 和 Element 接口有许多助手方法以简单的方式来创动态建 XML 文档；

public class Foo {
 
 public Foo() {}
 
 public Document createDocument() {
  Document document = DocumentHelper.createDocument();
  Element root = document.addElement("root");
  Element author2 = root.addElement("author").addAttribute("name", "Toby").addAttribute("location", "Germany")
    .addText("Tobias Rademacher");
  Element author1 = root.addElement("author").addAttribute("name", "James").addAttribute("location", "UK")
    .addText("James Strachan");
  return document;
 }
}

2.1）测试用例如下：

@Test
 public void testCreateDocByHelper() {
  Foo foo = new Foo();
 
  Document doc = foo.createDocument();
  System.out.println(doc.asXML());
 }

2.2）dom4j 是基于API 的接口。这意味着dom4j中的 DocumentFactory 和阅读器类总是使用 org.dom4j 接口而不是其实现类。集合 API 和 W3C 的DOM 也采用了这种方式；

2.3）一旦你解析后创建了一个文档，你就想要将其序列化到硬盘或普通流中。dom4j 提供了一组类以以下四种方式来序列化你的 dom4j 树； XML + HTML + DOM + SAX Events；

【4】序列化到XML

1）intro： 使用 XMLWriter 构造器根据给定的字符编码来传递输出流。相比于输出流，Writer 更容易使用，因为Writer 是基于字符串的，因此有很少的编码问题。Writer.write()方法被重写了，你可以按需逐个写出dom4j对象；

2）代码如下：

// 序列化xml
public class DeployFileCreator3 {
 private Document doc;
 public DeployFileCreator3(Document doc) {
  this.doc = doc;
 }
 
 public void serializetoXML(OutputStream out, String aEncodingScheme) throws Exception {
  OutputFormat outformat = OutputFormat.createPrettyPrint();
  outformat.setEncoding(aEncodingScheme);
  XMLWriter writer = new XMLWriter(out, outformat);
  writer.write(this.doc);
  writer.flush();
  writer.close();
 }
 
}

3）测试用例

@Test
 public void testSerializetoXML() {
  Foo foo = new Foo();
 
  Document doc = foo.createDocument();
  DeployFileCreator3 creator = new DeployFileCreator3(doc);
  try {
   creator.serializetoXML(new FileOutputStream(base + "serializable.xml"),
     "UTF-8");
   System.out.println("serializable successfully.");
  } catch (Exception e) {
   e.printStackTrace();
  }
 }

<?xml version="1.0" encoding="UTF-8"?>
<root>
  <author name="Toby" location="Germany">Tobias Rademacher</author>
  <author name="James" location="UK">James Strachan</author>
</root>

【4.1】自定义输出格式

1）intro：即是说，你可以定义xml的输出格式（aEncodingScheme）

 // customize output format.
public class DeployFileCreator4 {
 private Document doc;
 private OutputFormat outFormat;
 public DeployFileCreator4(Document doc) {
  this.outFormat = OutputFormat.createPrettyPrint();
  this.doc = doc;
 }
 public DeployFileCreator4(Document doc, OutputFormat outFormat) {
  this.doc = doc;
  this.outFormat = outFormat;
 }
 public void writeAsXML(OutputStream out) throws Exception {
  XMLWriter writer = new XMLWriter(out, this.outFormat);
  writer.write(this.doc);
 }
 public void writeAsXML(OutputStream out, String encoding) throws Exception {
  this.outFormat.setEncoding(encoding);
  this.writeAsXML(out);
 }
}

2）OutputFormat中一个有趣的特性是能够设置字符编码。使用这种机制设置XMLWriter的编码方式是一个好习惯，使用这种编码方式创建OutputStream 和输出XML的声明。

3）测试用例：

@Test
 public void testCustomizeOutputFormat() {
  Foo foo = new Foo();
 
  Document doc = foo.createDocument();
  OutputFormat format = OutputFormat.createCompactFormat();
  format.setEncoding("UTF-8");
  DeployFileCreator4 creator = new DeployFileCreator4(
    doc, format);
  try {
   creator.writeAsXML(new FileOutputStream(base + "customizeFormat.xml"));
   System.out.println("successful customize format");
  } catch (Exception e) {
   e.printStackTrace();
  }
 }

<?xml version="1.0" encoding="UTF-8"?>
<root><author name="Toby" location="Germany">Tobias Rademacher</author><author name="James" location="UK">James Strachan</author></root>

【5】打印HTML

1）intro：HTMLWriter 带有一个dom4j 树且会将该树格式化为 HMTL流。这个格式化器类似于 XMLWriter 但输出的是 CDATA 和实体区域而不是序列化格式的XML，且它支持许多没有结束标签的HTML 元素。如<br>；

2）代码如下：

public class PrintHTML {
	private Document doc;
	private OutputFormat outFormat;

	public PrintHTML(Document doc) {
		this.outFormat = OutputFormat.createPrettyPrint();
		this.doc = doc;
	}

	public PrintHTML(Document doc, OutputFormat outFormat) {
		this.doc = doc;
		this.outFormat = outFormat;
	}

	public void writeAsHTML(OutputStream out) throws Exception {
		HTMLWriter writer = new HTMLWriter(out, this.outFormat);
		writer.write(this.doc);
		writer.flush();
	}
}

3）测试用例：

@Test
	public void testPrintHTML() {
		Foo foo = new Foo();
		
		Document doc = foo.createDocument();
		PrintHTML creator = new PrintHTML(doc);
		try {
			creator.writeAsHTML(new FileOutputStream(base + "printHtml.html"));
			System.out.println("PrintHTML successfully");
		} catch (Exception e) {
			e.printStackTrace();
		}
	}

你可能感兴趣的:(dom4j-cookbook)

dom4j-cookbook PacosonSWJTU
【0】README1）本文译自http://dom4j.sourceforge.net/dom4j-1.6.1/cookbook.html 2）intro： 2.1）dom4j是一个对象模型，在内存中表示一颗XML树。dom4j提供了易于使用的API以提供强大的处理特性，操纵或控制XML和结合XPath，XSLT以及SAX，JAXP和DOM来进行处理；2.2）dom4j是基于接口来设计的，来提供高
桌面上有多个球在同时运动，怎么实现球之间不交叉，即碰撞？换个号韩国红果果 html 小球碰撞
稍微想了一下，然后解决了很多bug，最后终于把它实现了。其实原理很简单。在每改变一个小球的x y坐标后，遍历整个在dom树中的其他小球，看一下它们与当前小球的距离是否小于球半径的两倍？若小于说明下一次绘制该小球（设为a）前要把他的方向变为原来相反方向（与a要碰撞的小球设为b），即假如当前小球的距离小于球半径的两倍的话，马上改变当前小球方向。那么下一次绘制也是先绘制b，再绘制a，由于a的方向已经改变
《高性能HTML5》读后整理的Web性能优化内容白糖_ html5
读后感先说说《高性能HTML5》这本书的读后感吧，个人觉得这本书前两章跟书的标题完全搭不上关系，或者说只能算是讲解了“高性能”这三个字，HTML5完全不见踪影。个人觉得作者应该首先把HTML5的大菜拿出来讲一讲，再去分析性能优化的内容，这样才会有吸引力。因为只是在线试读，没有机会看后面的内容，所以不胡乱评价了。
[JShop]Spring MVC的RequestContextHolder使用误区 dinguangx jeeshop 商城系统 jshop 电商系统
在spring mvc中，为了随时都能取到当前请求的request对象，可以通过RequestContextHolder的静态方法getRequestAttributes()获取Request相关的变量，如request, response等。在jshop中，对RequestContextHolder的
算法之时间复杂度周凡杨 java 算法时间复杂度效率
在计算机科学中，算法的时间复杂度是一个函数，它定量描述了该算法的运行时间。这是一个关于代表算法输入值的字符串的长度的函数。时间复杂度常用大O符号表述，不包括这个函数的低阶项和首项系数。使用这种方式时，时间复杂度可被称为是渐近的，它考察当输入值大小趋近无穷时的情况。这样用大写O()来体现算法时间复杂度的记法，
Java事务处理 g21121 java
一、什么是Java事务通常的观念认为，事务仅与数据库相关。事务必须服从ISO/IEC所制定的ACID原则。ACID是原子性（atomicity）、一致性（consistency）、隔离性（isolation）和持久性（durability）的缩写。事务的原子性表示事务执行过程中的任何失败都将导致事务所做的任何修改失效。一致性表示当事务执行失败时，所有被该事务影响的数据都应该恢复到事务执行前的状
Linux awk命令详解 510888780 linux
一. AWK 说明 awk是一种编程语言，用于在linux/unix下对文本和数据进行处理。数据可以来自标准输入、一个或多个文件，或其它命令的输出。它支持用户自定义函数和动态正则表达式等先进功能，是linux/unix下的一个强大编程工具。它在命令行中使用，但更多是作为脚本来使用。 awk的处理文本和数据的方式：它逐行扫描文件，从第一行到
android permission 布衣凌宇 Permission
<uses-permission android:name="android.permission.ACCESS_CHECKIN_PROPERTIES" ></uses-permission>允许读写访问"properties"表在checkin数据库中，改值可以修改上传 <uses-permission android:na
Oracle和谷歌Java Android官司将推迟 aijuans java oracle
北京时间 10 月 7 日，据国外媒体报道，Oracle 和谷歌之间一场等待已久的官司可能会推迟至 10 月 17 日以后进行，这场官司的内容是 Android 操作系统所谓的 Java 专利权之争。本案法官 William Alsup 称根据专利权专家 Florian Mueller 的预测，谷歌 Oracle 案很可能会被推迟。　　该案中的第二波辩护被安排在 10 月 17 日出庭，从目前看来
linux shell 常用命令 antlove linux shell command
grep [options] [regex] [files] /var/root # grep -n "o" * hello.c:1:/* This C source can be compiled with:
Java解析XML配置数据库连接(DOM技术连接 SAX技术连接) 百合不是茶 sax技术 Java解析xml文档 dom技术 XML配置数据库连接
XML配置数据库文件的连接其实是个很简单的问题,为什么到现在才写出来主要是昨天在网上看了别人写的,然后一直陷入其中,最后发现不能自拔所以今天决定自己完成 ,,,,现将代码与思路贴出来供大家一起学习 XML配置数据库的连接主要技术点的博客; JDBC编程 : JDBC连接数据库 DOM解析XML: DOM解析XML文件 SA
underscore.js 学习（二） bijian1013 JavaScript underscore
Array Functions 所有数组函数对参数对象一样适用。1.first _.first(array, [n]) 别名: head, take 返回array的第一个元素，设置了参数n，就
plSql介绍 bijian1013 oracle 数据库 plsql
/* * PL/SQL 程序设计学习笔记 * 学习plSql介绍.pdf * 时间：2010-10-05 */ --创建DEPT表 create table DEPT ( DEPTNO NUMBER(10), DNAME NVARCHAR2(255), LOC NVARCHAR2(255) ) delete dept; select
【Nginx一】Nginx安装与总体介绍 bit1129 nginx
启动、停止、重新加载Nginx nginx 启动Nginx服务器，不需要任何参数u nginx -s stop 快速(强制)关系Nginx服务器 nginx -s quit 优雅的关闭Nginx服务器 nginx -s reload 重新加载Nginx服务器的配置文件 nginx -s reopen 重新打开Nginx日志文件
spring mvc开发中浏览器兼容的奇怪问题 bitray jquery Ajax springMVC 浏览器上传文件
最近个人开发一个小的OA项目,属于复习阶段.使用的技术主要是spring mvc作为前端框架,mybatis作为数据库持久化技术.前台使用jquery和一些jquery的插件. 在开发到中间阶段时候发现自己好像忽略了一个小问题,整个项目一直在firefox下测试,没有在IE下测试,不确定是否会出现兼容问题.由于jquer
Lua的io库函数列表 ronin47 lua io
1、io表调用方式：使用io表，io.open将返回指定文件的描述，并且所有的操作将围绕这个文件描述　　io表同样提供三种预定义的文件描述io.stdin,io.stdout,io.stderr 　　2、文件句柄直接调用方式,即使用file:XXX()函数方式进行操作,其中file为io.open()返回的文件句柄　　多数I/O函数调用失败时返回nil加错误信息,有些函数成功时返回nil
java-26-左旋转字符串 bylijinnan java
public class LeftRotateString { /** * Q 26 左旋转字符串 * 题目：定义字符串的左旋转操作：把字符串前面的若干个字符移动到字符串的尾部。 * 如把字符串abcdef左旋转2位得到字符串cdefab。 * 请实现字符串左旋转的函数。要求时间对长度为n的字符串操作的复杂度为O(n)，辅助内存为O(1)。 */ pu
《vi中的替换艺术》-linux命令五分钟系列之十一 cfyme linux命令
vi方面的内容不知道分类到哪里好，就放到《Linux命令五分钟系列》里吧！今天编程，关于栈的一个小例子，其间我需要把”S.”替换为”S->”(替换不包括双引号)。其实这个不难，不过我觉得应该总结一下vi里的替换技术了，以备以后查阅。 1 所有替换方案都要在冒号“:”状态下书写。 2 如果想将abc替换为xyz，那么就这样 :s/abc/xyz/ 不过要特别
[轨道与计算]新的并行计算架构 comsci 并行计算
我在进行流程引擎循环反馈试验的过程中，发现一个有趣的事情。。。如果我们在流程图的每个节点中嵌入一个双向循环代码段，而整个流程中又充满着很多并行路由，每个并行路由中又包含着一些并行节点，那么当整个流程图开始循环反馈过程的时候，这个流程图的运行过程是否变成一个并行计算的架构呢？
重复执行某段代码 dai_lm android
用handler就可以了 private Handler handler = new Handler(); private Runnable runnable = new Runnable() { public void run() { update(); handler.postDelayed(this, 5000); } }; 开始计时 h
Java实现堆栈（list实现） datageek 数据结构——堆栈
public interface IStack<T> { //元素出栈，并返回出栈元素 public T pop(); //元素入栈 public void push(T element); //获取栈顶元素 public T peek(); //判断栈是否为空 public boolean isEmpty
四大备份MySql数据库方法及可能遇到的问题 dcj3sjt126com DB backup
一：通过备份王等软件进行备份前台进不去？用备份王等软件进行备份是大多老站长的选择，这种方法方便快捷，只要上传备份软件到空间一步步操作就可以，但是许多刚接触备份王软件的客用户来说还原后会出现一个问题：因为新老空间数据库用户名和密码不统一，网站文件打包过来后因没有修改连接文件，还原数据库是好了，可是前台会提示数据库连接错误，网站从而出现打不开的情况。解决方法：学会修改网站配置文件，大多是由co
github做webhooks：[1]钩子触发是否成功测试 dcj3sjt126com github git webhook
转自: http://jingyan.baidu.com/article/5d6edee228c88899ebdeec47.html github和svn一样有钩子的功能，而且更加强大。例如我做的是最常见的push操作触发的钩子操作，则每次更新之后的钩子操作记录都会在github的控制板可以看到！工具/原料 github 方法/步骤
">的作用" target="_blank">JSP中的作用蕃薯耀
JSP中<base href="<%=basePath%>">的作用 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
linux下SAMBA服务安装与配置 hanqunfeng linux
局域网使用的文件共享服务。一.安装包： rpm -qa | grep samba samba-3.6.9-151.el6.x86_64 samba-common-3.6.9-151.el6.x86_64 samba-winbind-3.6.9-151.el6.x86_64 samba-client-3.6.9-151.el6.x86_64 samba-winbind-clients
guava cache IXHONG cache
缓存，在我们日常开发中是必不可少的一种解决性能问题的方法。简单的说，cache 就是为了提升系统性能而开辟的一块内存空间。　　缓存的主要作用是暂时在内存中保存业务系统的数据处理结果，并且等待下次访问使用。在日常开发的很多场合，由于受限于硬盘IO的性能或者我们自身业务系统的数据处理和获取可能非常费时，当我们发现我们的系统这个数据请求量很大的时候，频繁的IO和频繁的逻辑处理会导致硬盘和CPU资源的
Query的开始--全局变量,noconflict和兼容各种js的初始化方法 kvhur JavaScript jquery css
这个是整个jQuery代码的开始，里面包含了对不同环境的js进行的处理，例如普通环境，Nodejs，和requiredJs的处理方法。还有jQuery生成$, jQuery全局变量的代码和noConflict代码详解完整资源： http://www.gbtags.com/gb/share/5640.htm jQuery 源码： (
美国人的福利和中国人的储蓄 nannan408
今天看了篇文章，震动很大，说的是美国的福利。美国医院的无偿入院真的是个好措施。小小的改善，对于社会是大大的信心。小孩，税费等，政府不收反补，真的体现了人文主义。美国这么高的社会保障会不会使人变懒？答案是否定的。正因为政府解决了后顾之忧，人们才得以倾尽精力去做一些有创造力，更造福社会的事情，这竟成了美国社会思想、人
N阶行列式计算(JAVA) qiuwanchi N阶行列式计算
package gaodai; import java.util.List; /** * N阶行列式计算 * @author 邱万迟 * */ public class DeterminantCalculation { public DeterminantCalculation(List<List<Double>> determina
C语言算法之打渔晒网问题 qiufeihu c 算法
如果一个渔夫从2011年1月1日开始每三天打一次渔，两天晒一次网，编程实现当输入2011年1月1日以后任意一天，输出该渔夫是在打渔还是在晒网。代码如下： #include <stdio.h> int leap(int a) /*自定义函数leap()用来指定输入的年份是否为闰年*/ { if((a%4 == 0 && a%100 != 0
XML中DOCTYPE字段的解析 wyzuomumu xml
DTD声明始终以!DOCTYPE开头,空一格后跟着文档根元素的名称,如果是内部DTD,则再空一格出现[],在中括号中是文档类型定义的内容. 而对于外部DTD,则又分为私有DTD与公共DTD,私有DTD使用SYSTEM表示,接着是外部DTD的URL. 而公共DTD则使用PUBLIC,接着是DTD公共名称,接着是DTD的URL. 私有DTD <!DOCTYPErootSYST