NOTE: We’re currently working on documenting these sections. We believe the information here is accurate, however be aware we are also still working on this chapter. Additional information will be provided as we go which should make this chapter more solid.


Introducing WebDriver

The primary new feature in Selenium 2.0 is the integration of the WebDriver API. WebDriver is designed to provide a simpler, more concise programming interface in addition to addressing some 

一些初级的新的特性在2.0中是集成在WebDriver  API中的。 Webdriver会提供一个简便的、更简洁的编程接口,除了处理一些限制的API。

limitations in the Selenium-RC API. Selenium-WebDriver was developed to better support dynamic web pages where elements of a page may change without the page itself being reloaded.


 WebDriver’s goal is to supply a well-designed object-oriented API that provides improved support for modern advanced web-app testing problems.


How Does WebDriver ‘Drive’ the Browser Compared to Selenium-RC?

Selenium-WebDriver makes direct calls to the browser using each browser’s native support for automation. How these direct calls are made, and the features they support depends on the 


browser you are using. Information on each ‘browser driver’ is provided later in this chapter.


For those familiar with Selenium-RC, this is quite different from what you are used to. Selenium-RC worked the same way for each supported browser. It ‘injected’ javascript functions into the

对于那些熟悉selenium rc,这是完全不同于你所使用过的。selenium rc为每个受支持的浏览器以同样的方式工作。

 browser when the browser was loaded and then used its javascript to drive the AUT within the browser. WebDriver does not use this technique. Again, it drives the browser directly using the 

它“注入”javascript函数到浏览器当浏览器加载,然后利用其javascript驱动AUT中浏览器。WebDriver 不使用这些技术。再有, 它直接驱动浏览器使用浏览器自动支持编译。

browser’s built in support for automation.

WebDriver and the Selenium-Server

You may, or may not, need the Selenium Server, depending on how you intend to use Selenium-WebDriver. If you will be only using the WebDriver API you do not need the Selenium-Server. If your browser and tests will all run on the same machine, and your tests only use the WebDriver API, then you do not need to run the Selenium-Server; WebDriver will run the browser directly.

There are some reasons though to use the Selenium-Server with Selenium-WebDriver.

  • You are using Selenium-Grid to distribute your tests over multiple machines or virtual machines (VMs).

  • 您正在使用selenium grid将测试分配到多台机器上或虚拟机(vm)。

  • You want to connect to a remote machine that has a particular browser version that is not on your current machine.


  • You are not using the Java bindings (i.e. Python, C#, or Ruby) and would like to use HtmlUnit Driver

  • 你不使用Java绑定(例如Python,c#或Ruby),愿用HtmlUnit驱动。

Setting Up a Selenium-WebDriver Project

To install Selenium means to set up a project in a development so you can write a program using Selenium. How you do this depends on your programming language and your development environment.

安装Selenium意味着建立一个项目在 开发工具,这样你可以使用Selenium写一个程序。你如何做到这一点取决于你的编程语言和开发环境。


The easiest way to set up a Selenium 2.0 Java project is to use Maven. Maven will download the java bindings (the Selenium 2.0 java client library) and all its dependencies, and will create the 

最容易的方法是建立一个Selenium2.0java项目用Maven.Maven将下载java绑定(Selenium2.0 java客户端库)及其所有依赖项。并将为你创建一个项目,用Maven pom.xml (project configuration) file.

project for you, using a maven pom.xml (project configuration) file. Once you’ve done this, you can import the maven project into your preferred IDE, IntelliJ IDEA or Eclipse.

一旦你这样做,你可以将maven项目导入到你喜欢的IDE,IntelliJ IDEA或Eclipse。

First, create a folder to contain your Selenium project files. Then, to use Maven, you need a pom.xml file. This can be created with a text editor. We won’t teach the details of pom.xml files or for 

首先, 创建一个文件夹包含你的Selenium项目文件 。 然后,你用Maven ,你需要一个pom.xml file。 这将创建一个文本编辑器,我们会提供一些 细节针对pom.xml files

using Maven since there are already excellent references on this. Your pom.xml file will look something like this. Create this file in the folder you created for your project.



Be sure you specify the most current version. At the time of writing, the version listed above was the most current, however there were frequent releases immediately after the release of Selenium 2.0. Check the Maven download page for the current release and edit the above dependency accordingly.

确保您指定最新版本。在写这篇文章的时候,上面列出的版本是最新的,然而之后有频繁的发布Selenium .检查Maven下载页面的当前版本和编辑上述相应的依赖.

Now, from a command-line, CD into the project directory and run maven as follows.

现在,从一个命令行,cd命令进入项目目录 并运行Maven按照如下。

mvn clean install

This will download Selenium and all its dependencies and will add them to the project.


Finally, import the project into your preferred development environment. For those not familiar with this, we’ve provided an appendix which shows this.


Importing a maven project into IntelliJ IDEA. Importing a maven project into Eclipse.

Introducing the Selenium-WebDriver API by Example

WebDriver is a tool for automating web application testing, and in particular to verify that they work as expected. It aims to provide a friendly API that’s easy to explore and understand, easier to 


use than the Selenium-RC (1.0) API, which will help to make your tests easier to read and maintain. It’s not tied to any particular test framework, so it can be used equally well in a unit testing or 

使之比1.0的API更容易 ,这将有助于使您的测试更容易阅读和维护。不与任何特定的测试框架,它同样可以使用在一个单元测试从一个普通的“主要”方法。

from a plain old “main” method. This section introduces WebDriver’s API and helps get you started becoming familiar with it. Start by setting up a WebDriver project if you haven’t already. 

本节介绍WebDriver API和帮助你开始熟悉它。首先建立一个WebDriver项目如果你还没有准备好。

This was described in the previous section, Setting Up a Selenium-WebDriver Project.


Once your project is set up, you can see that WebDriver acts just as any normal library: it is entirely self-contained, and you usually don’t need to remember to start any additional processes or 


run any installers before using it, as opposed to the proxy server with Selenium-RC.

运行安装程序之前使用它,而不是与selenium rc代理服务器。

Note: additional steps are required to use ChromeDriver, Opera Driver, Android Driver and iOS Driver

You’re now ready to write some code. An easy way to get started is this example, which searches for the term “Cheese” on Google and then outputs the result page’s title to the console.


package org.openqa.selenium.example;

import org.openqa.selenium.By;
import org.openqa.selenium.WebDriver;
import org.openqa.selenium.WebElement;
import org.openqa.selenium.firefox.FirefoxDriver;

public class Selenium2Example  {
    public static void main(String[] args) {
        // Create a new instance of the Firefox driver
        // Notice that the remainder of the code relies on the interface, 
        // not the implementation.
        WebDriver driver = new FirefoxDriver();

        // And now use this to visit Google
        // Alternatively the same thing can be done like this
        // driver.navigate().to("");

        // Find the text input element by its name
        WebElement element = driver.findElement("q"));

        // Enter something to search for

        // Now submit the form. WebDriver will find the form for us from the element

        // Check the title of the page
        System.out.println("Page title is: " + driver.getTitle());
        // Google's search is rendered dynamically with JavaScript.
        // Wait for the page to load, timeout after 10 seconds
        (new WebDriverWait(driver, 10)).until(new ExpectedCondition() {
            public Boolean apply(WebDriver d) {
                return d.getTitle().toLowerCase().startsWith("cheese!");

        // Should see: "cheese! - Google Search"
        System.out.println("Page title is: " + driver.getTitle());
        //Close the browser

In upcoming sections, you will learn more about how to use WebDriver for things such as navigating forward and backward in your browser’s history, and how to test web sites that use frames and windows. We also provide a more thorough discussions and examples.


Selenium-WebDriver API Commands and Operations

Fetching a Page

The first thing you’re likely to want to do with WebDriver is navigate to a page. The normal way to do this is by calling “get”:



Dependent on several factors, including the OS/Browser combination, WebDriver may or may not wait for the page to load. In some circumstances, WebDriver may return control before the page has finished, or even started, loading. To ensure robustness, you need to wait for the element(s) to exist in the page using Explicit and Implicit Waits.


Locating UI Elements (WebElements)


Locating elements in WebDriver can be done on the WebDriver instance itself or on a WebElement. Each of the language bindings expose a “Find Element” and “Find Elements” method. The first returns a WebElement object otherwise it throws an exception. The latter returns a list of WebElements, it can return an empty list if no DOM elements match the query.

WebDriver定位元素可以在完成本身或WebElement WebDriver实例。每种语言绑定公开“Find Element”和“Find Elements”方法。第一个返回一个WebElement对象否则它将抛出一个异常。后者WebElements返回一个列表,它能返回一个空列表如果没有DOM元素匹配查询。

The “Find” methods take a locator or query object called “By”. “By” strategies are listed below.



This is the most efficient and preferred way to locate an element. Common pitfalls that UI developers make is having non-unique id’s on a page or auto-generating the id, both should be avoided. A class on an html element is more appropriate than an auto-generated id.


Example of how to find an element that looks like this:


WebElement element = driver.findElement("coolestWidgetEvah"));

By Class Name

“Class” in this case refers to the attribute on the DOM element. Often in practical use there are many DOM elements with the same class name, thus finding multiple elements becomes the more practical option over finding the first element.


Example of how to find an element that looks like this:
