Angela㐅cc

潭州课堂25班：Ph201805201 爬虫基础第八课 selenium (课堂笔记）

Selenium笔记（1）安装和简单使用

简介

Selenium是一个用于Web应用程序测试的工具。

Selenium测试直接运行在浏览器中，就像真正的用户在操作一样。支持的浏览器包括IE（7, 8, 9, 10, 11），Firefox，Safari，Chrome，Opera等。

这个工具的主要功能包括：测试与浏览器的兼容性——测试你的应用程序看是否能够很好得工作在不同浏览器和操作系统之上。测试系统功能——创建回归测试检验软件功能和用户需求。

而用在爬虫上则是模拟正常用户访问网页并获取数据。

安装

ChromeDriver（浏览器驱动）安装

使用selenium驱动chrome浏览器需要下载chromedriver，而且chromedriver版本需要与chrome的版本对应，版本错误的话则会运行报错。

Chromedriver下载地址：https://chromedriver.storage.googleapis.com/index.html

Chromedriver与Chrome版本映射表：

chromedriver版本	支持的Chrome版本
v2.37	v64-66
v2.36	v63-65
v2.35	v62-64
v2.34	v61-63
v2.33	v60-62
v2.32	v59-61
v2.31	v58-60
v2.30	v58-60
v2.29	v56-58
v2.28	v55-57
v2.27	v54-56
v2.26	v53-55
v2.25	v53-55
v2.24	v52-54
v2.23	v51-53

Mac/Linux

下载完成解压后，将文件移动至/usr/local/bin目录中，则可以正常使用。

Windows

也可将驱动文件许放在脚本文件下

下载完成解压后，将文件移动到一个配置了环境变量的文件夹中，例如你的Python安装文件夹。

Selenium安装

Selenium的安装非常简单，直接pip就可以搞定。

pip install selenium

简单使用

Chrome无界面运行

这是chrome浏览器2017年发布的新特性，需要unix版本的chrome版本高于57，windows版本的chrome版本高于58。

使用selenium无界面运行chrome的代码如下:

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

# 实例化一个启动参数对象 chrome_options = Options() # 设置浏览器以无界面方式运行 chrome_options.add_argument('--headless') # 官方文档表示这一句在之后的版本会消失，但目前版本需要加上此参数 chrome_options.add_argument('--disable-gpu') # 设置浏览器参数时最好固定好窗口大小，窗口大小不同会在解析网页时出现不同的结果 chrome_options.add_argument('--window-size=1366,768') # 启动浏览器 browser = webdriver.Chrome(chrome_options=chrome_options)

运行上述代码，则会打开一个无界面chrome浏览器的空白页，去掉headless那一句可以看到效果。

Selenium简单例子

这是一个打开百度首页，在输入框中输入Python，并点击搜索的例子。

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC from selenium.webdriver.support.wait import WebDriverWait # 打开一个Chrome浏览器 browser = webdriver.Chrome() # 请求百度首页 browser.get('https://www.baidu.com') # 找到输入框位置 input = WebDriverWait(browser, 10).until( EC.presence_of_element_located((By.XPATH, '//*[@id="kw"]')) ) # 在输入框中输入Python input.send_keys('Python') # 找到输入按钮 button = WebDriverWait(browser, 10).until( EC.element_to_be_clickable( (By.XPATH, '//*[@id="su"]')) ) # 点击一次输入按钮 button.click() browser.quit()

# -*- coding: utf-8 -*-
# 斌彬电脑
# @Time : 2018/9/6 0006 5:08


#  开启谷歌浏览器
from selenium import webdriver
drt = webdriver.Chrome()
drt.get('http://www.baidu.com')

from selenium.webdriver.support.wait import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By
# 找到搜索框，
input = WebDriverWait(drt, 10).until(EC.presence_of_element_located((By.XPATH,'//input[@id="kw"]')))
input.send_keys('123')
# 找到百度一下按钮
btn = WebDriverWait(drt, 10).until(EC.element_to_be_clickable((By.XPATH,'//*[@id="su"]')))
btn.click()
#关闭浏览器
# drt.quit()

Selenium笔记（2）Chrome Webdriver启动选项

在Selenium中使用不同的Webdriver可能会有不一样的方法，有些相同的操作会得到不一样的结果，本文主要介绍的是Chrome()的使用方法。

其他Webdriver可以查阅官方文档。

Chrome WebDriver Options

简介

这是一个Chrome的参数对象，在此对象中使用add_argument()方法可以添加启动参数，添加完毕后可以在初始化Webdriver对象时将此Options对象传入，则可以实现以特定参数启动Chrome。

例子

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

# 实例化一个启动参数对象 chrome_options = Options() # 添加启动参数 chrome_options.add_argument('--window-size=1366,768') # 将参数对象传入Chrome，则启动了一个设置了窗口大小的Chrome browser = webdriver.Chrome(chrome_options=chrome_options)

常用的启动参数

启动参数	作用
--user-agent=""	设置请求头的User-Agent
--window-size=1366,768	设置浏览器分辨率
--headless	无界面运行
--start-maximized	最大化运行
--incognito	隐身模式
--disable-javascript	禁用javascript
--disable-infobars	禁用浏览器正在被自动化程序控制的提示

完整启动参数可以到此页面查看：

https://peter.sh/experiments/chromium-command-line-switches/

禁用图片加载

Chrome的禁用图片加载参数设置比较复杂，如下所示：

prefs = {
    'profile.default_content_setting_values' : {
        'images' : 2
    }
}
options.add_experimental_option('prefs',prefs)

禁用浏览器弹窗

使用浏览器时常常会有弹窗弹出，以下选项可以禁止弹窗：

prefs = {  
    'profile.default_content_setting_values' :  {  
        'notifications' : 2  
     }  
}  
options.add_experimental_option('prefs',prefs)

完整文档

class selenium.webdriver.chrome.options.Options

Bases: object

Method

__init__()

add_argument(argument)

Adds an argument to the listArgs:Sets the arguments
add_encoded_extension(extension)

Adds Base64 encoded string with extension data to a list that will be used to extract it to the ChromeDriverArgs:extension: Base64 encoded string with extension data
add_experimental_option(name, value)

Adds an experimental option which is passed to chrome.Args:name: The experimental option name. value: The option value.
add_extension(extension)

Adds the path to the extension to a list that will be used to extract it to the ChromeDriverArgs:extension: path to the *.crx file
set_headless(headless=True)

Sets the headless argumentArgs:headless: boolean value indicating to set the headless option
to_capabilities()

Creates a capabilities with all the options that have been set andreturns a dictionary with everything

Values

KEY = 'goog:chromeOptions'

arguments

Returns a list of arguments needed for the browser
binary_location

Returns the location of the binary otherwise an empty string
debugger_address

Returns the address of the remote devtools instance
experimental_options

Returns a dictionary of experimental options for chrome.
extensions

Returns a list of encoded extensions that will be loaded into chrome
headless

Returns whether or not the headless argument is set

Chrome WebDriver对象

简介

这个对象继承自selenium.webdriver.remote.webdriver.WebDriver，这个类会在下一章讲到，Chrome的WebDriver作为子类增添了几个方法。

指定chromedriver.exe的位置

chromedriver.exe一般可以放在环境文件中，但是有时候为了方便部署项目，或者为了容易打包，我们可以将chromedriver.exe放到我们的项目目录中，然后在初始化Chrome Webdriver对象时，传入chromedriver.exe的路径。

如下所示：

from selenium import webdriver
browser = webdriver.Chrome(executable_path='chromedriver.exe')

完整文档

class selenium.webdriver.chrome.webdriver.WebDriver(executable_path='chromedriver', port=0, options=None,service_args=None, desired_capabilities=None, service_log_path=None, chrome_options=None)

Bases: selenium.webdriver.remote.webdriver.WebDriver

Controls the ChromeDriver and allows you to drive the browser.

You will need to download the ChromeDriver executable fromhttp://chromedriver.storage.googleapis.com/index.html

__init__(executable_path='chromedriver', port=0, options=None, service_args=None, desired_capabilities=None,service_log_path=None, chrome_options=None)

Creates a new instance of the chrome driver.

Starts the service and then creates new instance of chrome driver.

Args:
- executable_path - path to the executable. If the default is used it assumes the executable is in the $PATHport
- port you would like the service to run, if left as 0, a free port will be found.
- desired_capabilities: Dictionary object with non-browser specific capabilities only, such as “proxy” or “loggingPref”.
- options: this takes an instance of ChromeOptions
create_options()

get_network_conditions()

Gets Chrome network emulation settings.

Returns:A dict. For example:

{‘latency’: 4, ‘download_throughput’: 2, ‘upload_throughput’: 2, ‘offline’: False}
launch_app(id)

Launches Chrome app specified by id.
quit()

Closes the browser and shuts down the ChromeDriver executable that is started when starting the ChromeDriver

set_network_conditions(**network_conditions)

Sets Chrome network emulation settings.

Args:

network_conditions: A dict with conditions specification.

Usage:

driver.set_network_conditions(offline=False, latency=5, # additional latency (ms)
                              download_throughput=500 * 1024, # maximal throughput upload_throughput=500 * 1024) # maximal throughput

Note: ‘throughput’ can be used to set both (for download and upload).

Selenium笔记（3）Remote Webdriver

简介

selenium.webdriver.remote.webdriver.WebDriver 这个类其实是所有其他Webdriver的父类，例如Chrome Webdriver，Firefox Webdriver都是继承自这个类。这个类中实现了每个Webdriver间相通的方法。

常用操作

get(url)

在当前浏览器会话中访问传入的url地址。

用法：
```
driver.get('https://www.baidu.com')
```
close()

关闭浏览器当前窗口。
quit()

退出webdriver并关闭所有窗口。
refresh()

刷新当前页面。
title

获取当前页的标题。
page_source

获取当前页渲染后的源代码。
current_url

获取当前页面的url。
window_handles

获取当前会话中所有窗口的句柄。

查找元素

Webdriver对象中内置了查找节点元素的方法，使用非常方便。

单个查找

以下是查找单个元素的方法：

方法	作用
`find_element_by_xpath`()	通过`Xpath`查找
`find_element_by_class_name`()	通过`class属性`查找
`find_element_by_css_selector`()	通过`css选择器`查找
`find_element_by_id`()	通过`id`查找
`find_element_by_link_text`()	通过`链接文本`查找
`find_element_by_name`()	通过`name属性`进行查找
`find_element_by_partial_link_text`()	通过`链接文本的部分匹配`查找
`find_element_by_tag_name`()	通过`标签名`查找

查找后返回的是一个Webelement对象。

多个查找

上面的方法都是将第一个找到的元素进行返回，而将所有匹配的元素进行返回使用的是find_elements_by_*方法。

注：将其中的element加上一个s，则是对应的多个查找方法。

此方法返回的是一个Webelement对象组成的列表。

通过私有方法进行查找

除了以上的多种查找方式，还有两种私有方法find_element()和find_elements()可以使用：

例子：

from selenium.webdriver.common.by import By

driver.find_element(By.XPATH, '//button[text()="Some text"]')
driver.find_elements(By.XPATH, '//button')

By这个类是专门用来查找元素时传入的参数，这个类中有以下属性：

ID = "id"
XPATH = "xpath"
LINK_TEXT = "link text"
PARTIAL_LINK_TEXT = "partial link text"
NAME = "name" TAG_NAME = "tag name" CLASS_NAME = "class name" CSS_SELECTOR = "css selector"

操作Cookie

add_cookie(cookie_dict)

给当前会话添加一个cookie。

cookie_dict: 一个字典对象，必须要有"name"和"value"两个键，可选的键有：“path”, “domain”, “secure”, “expiry” 。

用法：

driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’})
driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’, ‘path’ : ‘/’})
driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’, ‘path’ : ‘/’, ‘secure’:True})

get_cookie(name)

按name获取单个Cookie，没有则返回None。
get_cookies()

获取所有Cookie，返回的是一组字典。
delete_all_cookies()¶

删除所有Cookies。
delete_cookie(name)

按name删除指定cookie。

获取截屏

get_screenshot_as_base64()

获取当前窗口的截图保存为一个base64编码的字符串。
get_screenshot_as_file(filename)

获取当前窗口的截图保存为一个png格式的图片，filename参数为图片的保存地址，最后应该以.png结尾。如果出现IO错误，则返回False。

用法：
```
driver.get_screenshot_as_file(‘/Screenshots/foo.png’)
```
get_screenshot_as_png()

获取当前窗口的截图保存为一个png格式的二进制字符串。

获取窗口信息

get_window_position(windowHandle='current')

获取当前窗口的x,y坐标。
get_window_rect()

获取当前窗口的x,y坐标和当前窗口的高度和宽度。
get_window_size(windowHandle='current')

获取当前窗口的高度和宽度。

切换

switch_to_frame(frame_reference)

将焦点切换到指定的子框架中
switch_to_window(window_name)

切换窗口

执行JS代码

execute_async_script(script, *args)

在当前的window/frame中异步执行JS代码。

script：是你要执行的JS代码。

*args：是你的JS代码执行要传入的参数。

用法：

script = “var callback = arguments[arguments.length - 1]; ”
script2 = “window.setTimeout(function(){ callback(‘timeout’) }, 3000);” 
driver.execute_async_script(script + script2)

execute_script(script, *args)

在当前的window/frame中同步执行JS代码。

script：是你要执行的JS代码。

*args：是你的JS代码执行要传入的参数。

完整文档

class selenium.webdriver.remote.webdriver.``WebDriver(command_executor='http://127.0.0.1:4444/wd/hub',desired_capabilities=None, browser_profile=None, proxy=None, keep_alive=False, file_detector=None, options=None)

Bases: object

Controls a browser by sending commands to a remote server. This server is expected to be running the WebDriver wire protocol as defined at

https://github.com/SeleniumHQ/selenium/wiki/JsonWireProtocol 。

Attributes:
- session_id - String ID of the browser session started and controlled by this WebDriver.
- capabilities - Dictionaty of effective capabilities of this browser session as returned
  
  by the remote server. See https://github.com/SeleniumHQ/selenium/wiki/DesiredCapabilities
- command_executor - remote_connection.RemoteConnection object used to execute commands.
- error_handler - errorhandler.ErrorHandler object used to handle errors.
__init__(command_executor='http://127.0.0.1:4444/wd/hub', desired_capabilities=None, browser_profile=None, proxy=None,keep_alive=False, file_detector=None, options=None)

Create a new driver that will issue commands using the wire protocol.

Args:
- command_executor - Either a string representing URL of the remote server or a customremote_connection.RemoteConnection object. Defaults to ‘http://127.0.0.1:4444/wd/hub’.
- desired_capabilities - A dictionary of capabilities to request whenstarting the browser session. Required parameter.
- browser_profile - A selenium.webdriver.firefox.firefox_profile.FirefoxProfile object.Only used if Firefox is requested. Optional.
- proxy - A selenium.webdriver.common.proxy.Proxy object. The browser session willbe started with given proxy settings, if possible. Optional.
- keep_alive - Whether to configure remote_connection.RemoteConnection to useHTTP keep-alive. Defaults to False.
- file_detector - Pass custom file detector object during instantiation. If None,then default LocalFileDetector() will be used.
- options - instance of a driver options.Options class

add_cookie(cookie_dict)

Adds a cookie to your current session.

Args:

cookie_dict: A dictionary object, with required keys - “name” and “value”;optional keys - “path”, “domain”, “secure”, “expiry”

Usage:

driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’})
driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’, ‘path’ : ‘/’})
driver.add_cookie({‘name’ : ‘foo’, ‘value’ : ‘bar’, ‘path’ : ‘/’, ‘secure’:True})

back()

Goes one step backward in the browser history.

Usage:

driver.back()
close()

Closes the current window.Usage:driver.close()
create_web_element(element_id)

Creates a web element with the specified element_id.
delete_all_cookies()

Delete all cookies in the scope of the session.

Usage:

driver.delete_all_cookies()
delete_cookie(name)

Deletes a single cookie with the given name.

Usage:

driver.delete_cookie(‘my_cookie’)
execute(driver_command, params=None)

Sends a command to be executed by a command.CommandExecutor.

Args:
- driver_command: The name of the command to execute as a string.
- params: A dictionary of named parameters to send with the command.
Returns:

The command’s JSON response loaded into a dictionary object.
execute_async_script(script, *args)

Asynchronously Executes JavaScript in the current window/frame.

Args:
- script: The JavaScript to execute.
- *args: Any applicable arguments for your JavaScript.
Usage:
```
script = “var callback = arguments[arguments.length - 1]; ” “window.setTimeout(function(){ callback(‘timeout’) }, 3000);”
driver.execute_async_script(script)
```
execute_script(script, *args)

Synchronously Executes JavaScript in the current window/frame.

Args:
- script: The JavaScript to execute.
- *args: Any applicable arguments for your JavaScript.
Usage:
```
driver.execute_script(‘return document.title;’)
```
file_detector_context(*args, **kwds)

Overrides the current file detector (if necessary) in limited context. Ensures the original file detector is set afterwards.

Example:
```
with webdriver.file_detector_context(UselessFileDetector):
    someinput.send_keys(‘/etc/hosts’)
```
Args:
- file_detector_class - Class of the desired file detector. If the class is differentfrom the current file_detector, then the class is instantiated with args and kwargs and used as a file detector during the duration of the context manager.
- args - Optional arguments that get passed to the file detector class duringinstantiation.
- kwargs - Keyword arguments, passed the same way as args.
find_element(by='id', value=None)

‘Private’ method used by the find_element_by_* methods.

Usage:

Use the corresponding find_element_by_* instead of this.

Return type:

WebElement
forward()

Goes one step forward in the browser history.

Usage:

driver.forward()
fullscreen_window()

Invokes the window manager-specific ‘full screen’ operation
get(url)

Loads a web page in the current browser session.
get_cookie(name)

Get a single cookie by name. Returns the cookie if found, None if not.

Usage:

driver.get_cookie(‘my_cookie’)
get_cookies()

Returns a set of dictionaries, corresponding to cookies visible in the current session.

Usage:

driver.get_cookies()
get_log(log_type)

Gets the log for a given log type

Args:
- log_type: type of log that which will be returned
Usage:

driver.get_log(‘browser’) driver.get_log(‘driver’) driver.get_log(‘client’) driver.get_log(‘server’)
get_screenshot_as_base64()

Gets the screenshot of the current window as a base64 encoded stringwhich is useful in embedded images in HTML.

Usage:

driver.get_screenshot_as_base64()
get_screenshot_as_file(filename)

Saves a screenshot of the current window to a PNG image file. ReturnsFalse if there is any IOError, else returns True. Use full paths in your filename.

Args:
- filename: The full path you wish to save your screenshot to. This should end with a .png extension.
Usage:

driver.get_screenshot_as_file(‘/Screenshots/foo.png’)
get_screenshot_as_png()

Gets the screenshot of the current window as a binary data.

Usage:

driver.get_screenshot_as_png()
get_window_position(windowHandle='current')

Gets the x,y position of the current window.

Usage:

driver.get_window_position()
get_window_rect()

Gets the x, y coordinates of the window as well as height and width of the current window.

Usage:

driver.get_window_rect()
get_window_size(windowHandle='current')

Gets the width and height of the current window.

Usage:

driver.get_window_size()
implicitly_wait(time_to_wait)

Sets a sticky timeout to implicitly wait for an element to be found,or a command to complete. This method only needs to be called one time per session. To set the timeout for calls to execute_async_script, see set_script_timeout.

Args:
- time_to_wait: Amount of time to wait (in seconds)
Usage:

driver.implicitly_wait(30)
maximize_window()

Maximizes the current window that webdriver is using
minimize_window()

Invokes the window manager-specific ‘minimize’ operation
quit()

Quits the driver and closes every associated window.

Usage:

driver.quit()
refresh()

Refreshes the current page.

Usage:

driver.refresh()
save_screenshot(filename)

Saves a screenshot of the current window to a PNG image file. ReturnsFalse if there is any IOError, else returns True. Use full paths in your filename.

Args:
- filename: The full path you wish to save your screenshot to. This should end with a .png extension.
Usage:

driver.save_screenshot(‘/Screenshots/foo.png’)
set_page_load_timeout(time_to_wait)

Set the amount of time to wait for a page load to completebefore throwing an error.

Args:
- time_to_wait: The amount of time to wait
Usage:

driver.set_page_load_timeout(30)
set_script_timeout(time_to_wait)

Set the amount of time that the script should wait during anexecute_async_script call before throwing an error.

Args:
- time_to_wait: The amount of time to wait (in seconds)
Usage:

driver.set_script_timeout(30)
set_window_position(x, y, windowHandle='current')

Sets the x,y position of the current window. (window.moveTo)

Args:
- x: the x-coordinate in pixels to set the window position
- y: the y-coordinate in pixels to set the window position
Usage:

driver.set_window_position(0,0)
set_window_rect(x=None, y=None, width=None, height=None)

Sets the x, y coordinates of the window as well as height and width of the current window.

Usage:

driver.set_window_rect(x=10, y=10) driver.set_window_rect(width=100, height=200) driver.set_window_rect(x=10, y=10, width=100, height=200)
set_window_size(width, height, windowHandle='current')

Sets the width and height of the current window. (window.resizeTo)

Args:
- width: the width in pixels to set the window to
- height: the height in pixels to set the window to
Usage:

driver.set_window_size(800,600)
start_client()

Called before starting a new session. This method may be overridden to define custom startup behavior.
start_session(capabilities, browser_profile=None)

Creates a new session with the desired capabilities.

Args:
- browser_name - The name of the browser to request.
- version - Which browser version to request.platform - Which platform to request the browser on.
- javascript_enabled - Whether the new session should support JavaScript.
- browser_profile - A selenium.webdriver.firefox.firefox_profile.FirefoxProfile object. Only used if Firefox is requested.
stop_client()

Called after executing a quit command. This method may be overridden to define custom shutdown behavior.
switch_to_active_element()

Deprecated use driver.switch_to.active_element
switch_to_alert()

Deprecated use driver.switch_to.alert
switch_to_default_content()

Deprecated use driver.switch_to.default_content
switch_to_frame(frame_reference)

Deprecated use driver.switch_to.frame
switch_to_window(window_name)

Deprecated use driver.switch_to.window
application_cache

Returns a ApplicationCache Object to interact with the browser app cache
current_url

Gets the URL of the current page.

Usage:

driver.current_url
current_window_handle

Returns the handle of the current window.

Usage:

driver.current_window_handle
desired_capabilities

returns the drivers current desired capabilities being used
file_detector

log_types

Gets a list of the available log types

Usage:

driver.log_types
mobile

name

Returns the name of the underlying browser for this instance.

Usage:

name = driver.name
orientation

Gets the current orientation of the device

Usage:

orientation = driver.orientation
page_source

Gets the source of the current page.

Usage:

driver.page_source
switch_to

Returns:
- SwitchTo: an object containing all options to switch focus into
Usage:

element = driver.switch_to.active_element alert = driver.switch_to.alert driver.switch_to.default_content() driver.switch_to.frame(‘frame_name’) driver.switch_to.frame(1) driver.switch_to.frame(driver.find_elements_by_tag_name(“iframe”)[0]) driver.switch_to.parent_frame() driver.switch_to.window(‘main’)
title

Returns the title of the current page.

Usage:

title = driver.title
window_handles

Returns the handles of all windows within the current session.

Usage:

driver.window_handles

Selenium笔记（4）Webelement

这是通过find方法找到的页面元素，此对象提供了多种方法，让我们可以与页面元素进行交互，例如点击、清空。

方法

clear()清空

如果当前元素中有文本，则清空文本
click()单击

点击当前元素
get_attribute(name)获取属性

获取元素的attribute/property

优先返回完全匹配属性名的值，如果不存在，则返回属性名中包含name的值。
screenshot(filename) 获取截图

获取当前元素的截图，保存为png，最好用绝对路径，（谷歌上用不了，火狐可以）。
send_keys(value) 模拟键入元素

给当前元素模拟输入

webelement的此方法在Chrome中应该是有bug，无法使用。
submit()提交表单

提交表单

在页面元素中，同样提供find_elements_by_*等查找方法，可以将查找范围限制到当前元素。

属性

text

获取当前元素的文本内容
tag_name

获取当前元素的标签名
size

获取当前元素的大小
screenshot_as_png

将当前元素截屏并保存为png格式的二进制数据
screenshot_as_base64

将当前元素截屏并保存为base64编码的字符串
rect

获取一个包含当前元素大小和位置的字典
parent

获取当前元素的父节点
location

当前元素的位置
id

当前元素的id值，主要用来selenium内部使用，可以用来判断两个元素是否是同一个元素

Keys

我们经常需要模拟键盘的输入，当输入普通的值时，在send_keys()方法中传入要输入的字符串就好了。

但是我们有时候会用到一些特殊的按键，这时候就需要用到我们的Keys类。

简例

from selenium.webdriver.common.keys import Keys

elem.send_keys(Keys.CONTROL, 'c')

属性

这个Keys类有很多属性，每个属性对应一个按键。所有的属性如下所示：

ADD = u'\ue025'
ALT = u'\ue00a'
ARROW_DOWN = u'\ue015'
ARROW_LEFT = u'\ue012'
ARROW_RIGHT = u'\ue014' ARROW_UP = u'\ue013' BACKSPACE = u'\ue003' BACK_SPACE = u'\ue003' CANCEL = u'\ue001' CLEAR = u'\ue005' COMMAND = u'\ue03d' CONTROL = u'\ue009' DECIMAL = u'\ue028' DELETE = u'\ue017' DIVIDE = u'\ue029' DOWN = u'\ue015' END = u'\ue010' ENTER = u'\ue007' EQUALS = u'\ue019' ESCAPE = u'\ue00c' F1 = u'\ue031' F10 = u'\ue03a' F11 = u'\ue03b' F12 = u'\ue03c' F2 = u'\ue032' F3 = u'\ue033' F4 = u'\ue034' F5 = u'\ue035' F6 = u'\ue036' F7 = u'\ue037' F8 = u'\ue038' F9 = u'\ue039' HELP = u'\ue002' HOME = u'\ue011' INSERT = u'\ue016' LEFT = u'\ue012' LEFT_ALT = u'\ue00a' LEFT_CONTROL = u'\ue009' LEFT_SHIFT = u'\ue008' META = u'\ue03d' MULTIPLY = u'\ue024' NULL = u'\ue000' NUMPAD0 = u'\ue01a' NUMPAD1 = u'\ue01b' NUMPAD2 = u'\ue01c' NUMPAD3 = u'\ue01d' NUMPAD4 = u'\ue01e' NUMPAD5 = u'\ue01f' NUMPAD6 = u'\ue020' NUMPAD7 = u'\ue021' NUMPAD8 = u'\ue022' NUMPAD9 = u'\ue023' PAGE_DOWN = u'\ue00f' PAGE_UP = u'\ue00e' PAUSE = u'\ue00b' RETURN = u'\ue006' RIGHT = u'\ue014' SEMICOLON = u'\ue018' SEPARATOR = u'\ue026' SHIFT = u'\ue008' SPACE = u'\ue00d' SUBTRACT = u'\ue027' TAB = u'\ue004' UP = u'\ue013'

Selenium笔记（5）动作链

简介

一般来说我们与页面的交互可以使用Webelement的方法来进行点击等操作。但是，有时候我们需要一些更复杂的动作，类似于拖动，双击，长按等等。

这时候就需要用到我们的Action Chains（动作链）了。

简例

from selenium.webdriver import ActionChains

element = driver.find_element_by_name("source")
target = driver.find_element_by_name("target")

actions = ActionChains(driver)
actions.drag_and_drop(element, target)
actions.perform()

在导入动作链模块以后，需要声明一个动作链对象，在声明时将webdriver当作参数传入，并将对象赋值给一个actions变量。

然后我们通过这个actions变量，调用其内部附带的各种动作方法进行操作。

注：在调用各种动作方法后，这些方法并不会马上执行，而是会按你代码的顺序存储在ActionChains对象的队列中。当你调用perform()时，这些动作才会依次开始执行。

常用动作方法

click(on_element=None)

左键单击传入的元素，如果不传入的话，点击鼠标当前位置。
context_click(on_element=None)

右键单击。
double_click(on_element=None)

双击。
click_and_hold(on_element=None)

点击并抓起
drag_and_drop(source, target)

在source元素上点击抓起，移动到target元素上松开放下。
drag_and_drop_by_offset(source, xoffset, yoffset)

在source元素上点击抓起，移动到相对于source元素偏移xoffset和yoffset的坐标位置放下。
send_keys(*keys_to_send)

将键发送到当前聚焦的元素。
send_keys_to_element(element, *keys_to_send)

将键发送到指定的元素。
reset_actions()

清除已经存储的动作。

完整文档

class selenium.webdriver.common.action_chains.``ActionChains(driver)

Bases: object

ActionChains are a way to automate low level interactions such as mouse movements, mouse button actions, key press, and context menu interactions. This is useful for doing more complex actions like hover over and drag and drop.

Generate user actions.

When you call methods for actions on the ActionChains object, the actions are stored in a queue in the ActionChains object. When you call perform(), the events are fired in the order they are queued up.

ActionChains can be used in a chain pattern:

menu = driver.find_element_by_css_selector(".nav")
hidden_submenu = driver.find_element_by_css_selector(".nav #submenu1")

ActionChains(driver).move_to_element(menu).click(hidden_submenu).perform()

Or actions can be queued up one by one, then performed.:

menu = driver.find_element_by_css_selector(".nav")
hidden_submenu = driver.find_element_by_css_selector(".nav #submenu1")

actions = ActionChains(driver)
actions.move_to_element(menu)
actions.click(hidden_submenu)
actions.perform()

Either way, the actions are performed in the order they are called, one after another.

__init__(driver)

Creates a new ActionChains.

Args:
- driver: The WebDriver instance which performs user actions.
click(on_element=None)

Clicks an element.

Args:
- on_element: The element to click. If None, clicks on current mouse position.
click_and_hold(on_element=None)

Holds down the left mouse button on an element.

Args:
- on_element: The element to mouse down. If None, clicks on current mouse position.
context_click(on_element=None)

Performs a context-click (right click) on an element.

Args:
- on_element: The element to context-click. If None, clicks on current mouse position.
double_click(on_element=None)

Double-clicks an element.

Args:
- on_element: The element to double-click. If None, clicks on current mouse position.
drag_and_drop(source, target)

Holds down the left mouse button on the source element,then moves to the target element and releases the mouse button.

Args:
- source: The element to mouse down.
- target: The element to mouse up.
drag_and_drop_by_offset(source, xoffset, yoffset)

Holds down the left mouse button on the source element,then moves to the target offset and releases the mouse button.

Args:
- source: The element to mouse down.
- xoffset: X offset to move to.
- yoffset: Y offset to move to.
key_down(value, element=None)

Sends a key press only, without releasing it.Should only be used with modifier keys (Control, Alt and Shift).

Args:
- value: The modifier key to send. Values are defined in Keys class.
- element: The element to send keys. If None, sends a key to current focused element.
Example, pressing ctrl+c:
```
ActionChains(driver).key_down(Keys.CONTROL).send_keys('c').key_up(Keys.CONTROL).perform() 
```
key_up(value, element=None)

Releases a modifier key.

Args:
- value: The modifier key to send. Values are defined in Keys class.
- element: The element to send keys. If None, sends a key to current focused element.
Example, pressing ctrl+c:
```
ActionChains(driver).key_down(Keys.CONTROL).send_keys('c').key_up(Keys.CONTROL).perform()
```
move_by_offset(xoffset, yoffset)

Moving the mouse to an offset from current mouse position.

Args:
- xoffset: X offset to move to, as a positive or negative integer.
- yoffset: Y offset to move to, as a positive or negative integer.
move_to_element(to_element)

Moving the mouse to the middle of an element.

Args:
- to_element: The WebElement to move to.
move_to_element_with_offset(to_element, xoffset, yoffset)

Move the mouse by an offset of the specified element.Offsets are relative to the top-left corner of the element.

Args:
- to_element: The WebElement to move to.
- xoffset: X offset to move to.
- yoffset: Y offset to move to.
pause(seconds)

Pause all inputs for the specified duration in seconds
perform()

Performs all stored actions.
release(on_element=None)

Releasing a held mouse button on an element.

Args:
- on_element: The element to mouse up. If None, releases on current mouse position.
reset_actions()

Clears actions that are already stored on the remote end.
send_keys(*keys_to_send)

Sends keys to current focused element.

Args:
- keys_to_send: The keys to send. Modifier keys constants can be found in the ‘Keys’ class.
send_keys_to_element(element, *keys_to_send)

Sends keys to an element.

Args:
- element: The element to send keys.
- keys_to_send: The keys to send. Modifier keys constants can be found in the ‘Keys’ class.

Selenium笔记（6）等待

简介

在selenium操作浏览器的过程中，每一次请求url，selenium都会等待页面加载完毕以后，才会将操作权限再次交给我们的程序。

但是，由于ajax和各种JS代码的异步加载问题，所以我们在使用selenium的时候常常会遇到操作的元素还没有加载出来，就会引发报错。为了解决这个问题，Selenium提供了几种等待的方法，让我们可以等待元素加载完毕后，再进行操作。

显式等待

例子

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait from selenium.webdriver.support import expected_conditions as EC driver = webdriver.Chrome() driver.get("http://somedomain/url_that_delays_loading") try: element = WebDriverWait(driver, 10).until( EC.presence_of_element_located((By.ID, "myDynamicElement")) ) finally: driver.quit()

在这个例子中，我们在查找一个元素的时候，不再使用find_element_by_*这样的方式来查找元素，而是使用了WebDriverWait。

try代码块中的代码的意思是：在抛出元素不存在异常之前，最多等待10秒。在这10秒中，WebDriverWait会默认每500ms运行一次until之中的内容，而until中的EC.presence_of_element_located则是检查元素是否已经被加载，检查的元素则通过By.ID这样的方式来进行查找。

就是说，在10秒内，默认每0.5秒检查一次元素是否存在，存在则将元素赋值给element这个变量。如果超过10秒这个元素仍不存在，则抛出超时异常。

Expected Conditions

Expected Conditions这个类提供了很多种常见的检查条件可以供我们使用。

title_is
title_contains
presence_of_element_located
visibility_of_element_located
visibility_of
presence_of_all_elements_located
text_to_be_present_in_element
text_to_be_present_in_element_value
frame_to_be_available_and_switch_to_it
invisibility_of_element_located
element_to_be_clickable
staleness_of
element_to_be_selected
element_located_to_be_selected
element_selection_state_to_be
element_located_selection_state_to_be
alert_is_present

例子：

from selenium.webdriver.support import expected_conditions as EC

wait = WebDriverWait(driver, 10)
# 等待直到元素可以被点击 element = wait.until(EC.element_to_be_clickable((By.ID, 'someid')))

隐式等待

隐式等待指的是，在webdriver中进行find_element这一类查找操作时，如果找不到元素，则会默认的轮询等待一段时间。

这个值默认是0，可以通过以下方式进行设置：

from selenium import webdriver

driver = webdriver.Chrome()
driver.implicitly_wait(10) # 单位是秒
driver.get("http://somedomain/url_that_delays_loading") myDynamicElement = driver.find_element_by_id("myDynamicElement")

Selenium笔记（7）异常

完整文档

Exceptions that may happen in all the webdriver code.

- exceptionselenium.common.exceptions.``ElementClickInterceptedException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionThe Element Click command could not be completed because the element receiving the events is obscuring the element that was requested clicked.
- exceptionselenium.common.exceptions.``ElementNotInteractableException(msg=None,screen=None, stacktrace=None)¶
  
  Bases:selenium.common.exceptions.InvalidElementStateExceptionThrown when an element is present in the DOM but interactions with that element will hit another element do to paint order
- exceptionselenium.common.exceptions.``ElementNotSelectableException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.InvalidElementStateExceptionThrown when trying to select an unselectable element.For example, selecting a ‘script’ element.
- exceptionselenium.common.exceptions.``ElementNotVisibleException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.InvalidElementStateExceptionThrown when an element is present on the DOM, but it is not visible, and so is not able to be interacted with.Most commonly encountered when trying to click or read text of an element that is hidden from view.
- exceptionselenium.common.exceptions.``ErrorInResponseException(response,msg)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when an error has occurred on the server side.This may happen when communicating with the firefox extension or the remote driver server.__init__(response, msg)
- exceptionselenium.common.exceptions.``ImeActivationFailedException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when activating an IME engine has failed.
- exceptionselenium.common.exceptions.``ImeNotAvailableException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when IME support is not available. This exception is thrown for every IME-related method call if IME support is not available on the machine.
- exceptionselenium.common.exceptions.``InsecureCertificateException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionNavigation caused the user agent to hit a certificate warning, which is usually the result of an expired or invalid TLS certificate.
- exceptionselenium.common.exceptions.``InvalidArgumentException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionThe arguments passed to a command are either invalid or malformed.
- exceptionselenium.common.exceptions.``InvalidCookieDomainException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when attempting to add a cookie under a different domain than the current URL.
- exceptionselenium.common.exceptions.``InvalidCoordinatesException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionThe coordinates provided to an interactions operation are invalid.
- exceptionselenium.common.exceptions.``InvalidElementStateException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverException
- exceptionselenium.common.exceptions.``InvalidSelectorException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.NoSuchElementExceptionThrown when the selector which is used to find an element does not return a WebElement. Currently this only happens when the selector is an xpath expression and it is either syntactically invalid (i.e. it is not a xpath expression) or the expression does not select WebElements (e.g. “count(//input)”).
- exceptionselenium.common.exceptions.``InvalidSessionIdException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionOccurs if the given session id is not in the list of active sessions, meaning the session either does not exist or that it’s not active.
- exceptionselenium.common.exceptions.``InvalidSwitchToTargetException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when frame or window target to be switched doesn’t exist.
- exceptionselenium.common.exceptions.``JavascriptException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionAn error occurred while executing JavaScript supplied by the user.
- exceptionselenium.common.exceptions.``MoveTargetOutOfBoundsException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when the target provided to the ActionsChains move() method is invalid, i.e. out of document.
- exceptionselenium.common.exceptions.``NoAlertPresentException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when switching to no presented alert.This can be caused by calling an operation on the Alert() class when an alert is not yet on the screen.
- exceptionselenium.common.exceptions.``NoSuchAttributeException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when the attribute of element could not be found.You may want to check if the attribute exists in the particular browser you are testing against. Some browsers may have different property names for the same property. (IE8’s .innerText vs. Firefox .textContent)
- exceptionselenium.common.exceptions.``NoSuchCookieException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionNo cookie matching the given path name was found amongst the associated cookies of the current browsing context’s active document.
- exceptionselenium.common.exceptions.``NoSuchElementException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when element could not be found.If you encounter this exception, you may want to check the following:Check your selector used in your find_by…Element may not yet be on the screen at the time of the find operation, (webpage is still loading) see selenium.webdriver.support.wait.WebDriverWait() for how to write a wait wrapper to wait for an element to appear.
- exceptionselenium.common.exceptions.``NoSuchFrameException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.InvalidSwitchToTargetExceptionThrown when frame target to be switched doesn’t exist.
- exceptionselenium.common.exceptions.``NoSuchWindowException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.InvalidSwitchToTargetExceptionThrown when window target to be switched doesn’t exist.To find the current set of active window handles, you can get a list of the active window handles in the following way:print driver.window_handles
- exceptionselenium.common.exceptions.``RemoteDriverServerException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverException
- exceptionselenium.common.exceptions.``ScreenshotException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionA screen capture was made impossible.
- exceptionselenium.common.exceptions.``SessionNotCreatedException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionA new session could not be created.
- exceptionselenium.common.exceptions.``StaleElementReferenceException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when a reference to an element is now “stale”.Stale means the element no longer appears on the DOM of the page.Possible causes of StaleElementReferenceException include, but not limited to:You are no longer on the same page, or the page may have refreshed since the element was located.The element may have been removed and re-added to the screen, since it was located. Such as an element being relocated. This can happen typically with a javascript framework when values are updated and the node is rebuilt.Element may have been inside an iframe or another context which was refreshed.
- exceptionselenium.common.exceptions.``TimeoutException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when a command does not complete in enough time.
- exceptionselenium.common.exceptions.``UnableToSetCookieException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when a driver fails to set a cookie.
- exceptionselenium.common.exceptions.``UnexpectedAlertPresentException(msg=None,screen=None, stacktrace=None, alert_text=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when an unexpected alert is appeared.Usually raised when when an expected modal is blocking webdriver form executing any more commands.__init__(msg=None, screen=None,stacktrace=None, alert_text=None)
- exceptionselenium.common.exceptions.``UnexpectedTagNameException(msg=None,screen=None, stacktrace=None)
  
  Bases:selenium.common.exceptions.WebDriverExceptionThrown when a support class did not get an expected web element.
- exceptionselenium.common.exceptions.``UnknownMethodException(msg=None,screen=None, stacktrace=None)
  
  Bases: selenium.common.exceptions.WebDriverExceptionThe requested command matched a known URL but did not match an method for that URL.
- exceptionselenium.common.exceptions.``WebDriverException(msg=None,screen=None, stacktrace=None)
  
  Bases: exceptions.ExceptionBase webdriver exception.__init__(msg=None, screen=None,stacktrace=None)

Selenium笔记（8）常见的坑

用Xpath查找数据时无法直接获取节点属性

通常在我们使用xpath时，可以使用@class的方式直接获取节点的属性，如下所示：

page.xpath('//div/a/@class')

但在Selenium中不支持这种用法，只能在找到节点后，使用get_attribute(name)方法来获取属性：

page.xpath('//div/a').get_attribute('class')

同样的，Selenium同样不支持Xpath中的string()，text()这类的方法，只能获取元素节点。

使用了WebDriverWait以后仍然无法找到元素

有很多时候，一个简单的元素，明明也加了显式等待，但就是找不到，代码在仔细查看过后也没有问题后，多半是以下这几种情况：

由于分辨率设置的原因，查找的元素当前是不可见的。
某些页面的元素是需要向下滚动页面才会加载的。
由于某些其他元素的短暂遮挡，所以无法定位到。

1.分辨率原因

这时候应该设置好分辨率，使当前元素能够显示到页面中。

2.需要滚动页面

有些页面为了性能的考虑，页面下方不在当前屏幕中的元素是不会加载的，只有当页面向下滚动时才会继续加载。

而selenium本身不提供向下滚动的方法，所以我们需要去用JS去滚动页面：

driver.execute_script("window.scrollTo(0, document.body.scrollHeight)")

网上查到的一些滚动方式在Chrome上无效。但这一句是有效的。

3.由于其他元素的遮挡

有时候因为一些弹出元素的原因，如果还使用EC.presence_of_element_located()的话，我们需要定位的元素就无法被找到，这个时候我们就应该改变我们判断元素的方法：

element = WebDriverWait(driver, 10).until(
    EC.visibility_of_element_located((By.XPATH, ''))
)

使用EC.visibility_of_element_located()方法可以在等待到当前元素可见后，才获取元素。

在我们找不到元素，或者跟元素无法交互时，应该多去根据当前的情况，灵活选择显式等待的判断方式。

转载于:https://www.cnblogs.com/gdwz922/p/9596008.html

你可能感兴趣的:(爬虫,javascript,测试,ViewUI)

移动开发领域 MVP 模式的在线旅游应用开发与预订移动开发前沿旅游 ai
移动开发领域MVP模式的在线旅游应用开发与预订关键词：MVP模式、移动开发、在线旅游、预订系统、架构设计摘要：本文以在线旅游应用的预订功能开发为场景，深入解析MVP（Model-View-Presenter）模式在移动开发中的实践价值。通过“餐厅服务”的生活化类比、核心概念拆解、Kotlin代码实战以及旅游场景的具体应用，帮助开发者理解MVP如何解耦界面与业务逻辑，提升代码可维护性和可测试性。背景
爬虫小结 Crescent_P python小项目 python 数据分析
python爬虫小组作业上周布置了python的小组作业,每一组要求爬取老师指定的信息,本组抽到的题目如下:从中国银行网址：http://www.boc.cn/sourcedb/whpj/获取主要外汇（美元、欧元、英镑、加拿大元、澳大利亚元、日元、韩元、新台币、澳门元和港币）的牌价信息，计算出它们的每天平均价。要求把今年5月份每天平均价格保存到Excel文件中，每种外汇的数据保存在一个工作表中，并
Python 爬虫实战：抓取华尔街日报付费文章摘要的全方位指南 Python爬虫项目 python 爬虫开发语言信息可视化数据分析
引言在全球化的信息时代，获取高质量的新闻内容对于研究、投资和决策具有重要意义。《华尔街日报》（TheWallStreetJournal，简称WSJ）作为国际知名的财经媒体，其文章内容备受关注。然而，WSJ的大部分内容属于付费订阅，普通用户无法直接访问。本文将深入探讨如何使用Python爬虫技术，结合最新的工具和方法，抓取WSJ的付费文章摘要。一、了解目标网站结构1.1WSJ网站结构分析WSJ的官方
Python爬虫实战：使用最新技术爬取头条新闻数据 Python爬虫项目 2025年爬虫实战项目 python 爬虫开发语言 scrapy 音视频
一、前言：Python爬虫在现代数据获取中的重要性在当今信息爆炸的时代，数据已经成为最宝贵的资源之一。作为数据获取的重要手段，网络爬虫技术在各个领域发挥着越来越重要的作用。Python凭借其简洁的语法、丰富的库生态系统和强大的社区支持，已经成为网络爬虫开发的首选语言。本文将详细介绍如何使用Python及其最新的爬虫技术来爬取头条新闻数据。我们将从基础概念讲起，逐步深入到高级技巧，最后给出完整的爬虫
Python爬虫实战：爬取ETF基金持仓变化 Python爬虫项目 python 爬虫开发语言信息可视化数据分析
1.项目背景ETF（Exchange-TradedFund，交易型开放式指数基金）作为一种在交易所上市交易的基金，其持仓信息对于投资者具有重要参考价值。了解ETF的持仓变化，可以帮助投资者判断市场趋势和资金流向。本文将通过Python爬虫技术，自动化地获取ETF基金的持仓变化数据，进行存储和分析。2.技术选型与环境准备2.1技术选型编程语言：Python3.8+爬虫框架：Scrapy数据解析：Be
Python 爬虫实战：实时采集外汇汇率数据的全方位指南 Python爬虫项目 python 爬虫开发语言信息可视化数据分析
引言在全球化的金融市场中，外汇汇率的实时数据对于投资者、企业和研究人员来说至关重要。通过自动化的方式获取这些数据，不仅可以提高效率，还能为决策提供及时的支持。本文将深入探讨如何使用Python爬虫技术，结合最新的工具和方法，实时采集外汇汇率数据。一、外汇汇率数据的获取途径1.1使用官方API接口许多金融机构和数据提供商提供了官方的API接口，供开发者获取外汇汇率数据。例如：AlphaVantage
抓包工具fiddler详细使用教程金丝猴也是猿 http udp https websocket 网络安全网络协议 tcp/ip
抓包工具的使用技巧与配置指南各位做测试的同学想必对抓包工具并不陌生，Fiddler是大家常用的工具之一，但除了Fiddler，还有一款功能强大的抓包工具——SniffMaster（抓包大师），它在某些场景下表现尤为出色。今天我们将结合Fiddler和SniffMaster的使用技巧，为大家提供一份全面的抓包配置指南。Web端抓包配置Fiddler的HTTPS配置打开Fiddler，进入Tools-
Python 领域 pytest 的测试用例的可维护性设计
Python领域pytest的测试用例的可维护性设计关键词：pytest、测试用例、可维护性、测试框架、自动化测试、测试设计模式、重构摘要：本文深入探讨了如何在Python测试框架pytest中设计可维护的测试用例。我们将从测试用例可维护性的核心原则出发，分析pytest的特性和最佳实践，介绍多种提高测试代码可维护性的设计模式和技巧。文章包含实际代码示例、项目实战案例以及可维护性评估指标，帮助开发
FastAPI依赖注入：构建高可维护API的核心理念与实战源滚滚AI编程 fastapi log4j
依赖注入（DependencyInjection,DI）作为FastAPI的核心设计模式，通过解耦组件依赖关系、提升代码复用性和可测试性，已成为现代API开发的基石。本文将深入解析其工作原理、高级特性及企业级应用场景。一、依赖注入的核心价值解耦与模块化将数据库连接、认证逻辑等基础设施与业务逻辑分离，避免代码冗余。示例：路由函数无需手动创建数据库连接，通过Depends(get_db)自动注入[ci
[3-02-01].第14节：三方整合 - SpringData整合Redis集群 1.01^1000 阶段03：企业框架 spring boot
Redis大纲一、SpringBoot整合主从架构的Redis：1.1.问题说明：1.在Sentinel集群监管下的Redis哨兵架构中，其节点会因为自动故障转移而发生变化，Redis的客户端必须感知这种变化，及时更新连接信息2.SpringBoot中的RedisTemplate底层利用lettuce实现了节点的感知和自动切换，我们需要进行配置才可以实现这种动态上下线的情况。下面，我们通过一个测试
Python爬虫小白入门指南，成为大牛必须经历的三个阶段
学习任何一门技术，都应该带着目标去学习，目标就像一座灯塔，指引你前进，很多人学着学着就学放弃了，很大部分原因是没有明确目标，所以，一定要明确学习目的，在你准备学爬虫前，先问问自己为什么要学习爬虫。有些人是为了一份工作，有些人是为了好玩，也有些人是为了实现某个黑科技功能。不过可以肯定的是，学会了爬虫能给你的工作提供很多便利。小白入门必读作为零基础小白，大体上可分为三个阶段去实现。第一阶段是入门，掌握
【车载测试之CAPL编程系列】：【16】函数定义(2)
车载测试CAPL编程系列：CAPL中的函数定义(2)目录函数定义的基本形式参数类型与返回值函数重载（Overload）返回值限制：不能返回数组AI总结函数定义的基本形式CAPL函数定义具有灵活性，可根据需求设计无返回值、无参数的函数。无返回值、无参数的函数返回值类型：若函数无返回值，可声明为void，且void关键字可省略（CAPL特性，区别于C语言）。参数：允许无参数，但必须保留空括号()。示例
[晕事]今天做了件晕事83: pen test mzhan017 英语学习笔记晕事英语学习
这个缩写，就不能顾名思义了，而且pen是一个独立的单词，从读音上来说还容易和pain混淆，所以导致初接触者有些困扰。所以这个pentest的缩写，有些失败。全写是penetrationtest：渗透测试。https://en.wikipedia.org/wiki/Penetration_test修改建议是改成penetest，至少可以和pen在书写上区分，在读音是也可以区分，就读“排你test”。
Python爬虫在社交平台数据挖掘中的应用：深入探索用户互动程序员威哥 python 爬虫数据挖掘
引言社交媒体已经成为全球用户互动的主要平台，每天都有大量的信息生成，用户之间的互动行为如点赞、评论、分享、转发等构成了宝贵的数据资源。如何利用这些互动数据为商业决策、用户行为分析以及产品优化提供支持，已经成为数据科学与大数据分析领域的一个重要课题。Python作为一款强大的编程语言，凭借其丰富的爬虫库和数据分析工具，已经成为挖掘社交平台数据的重要工具。在本文中，我们将通过Python爬虫技术，深入
Python 爬虫实战：精准抓取母婴电商平台数据，深入分析用户评价洞察市场趋势程序员威哥最新爬虫实战项目 python 爬虫开发语言
前言随着生活水平的提高，越来越多的年轻父母开始关注母婴产品的质量和品牌。而母婴电商平台成为了他们选择和购买产品的主要渠道之一。母婴产品市场也因此变得异常活跃且充满竞争。在这样的市场环境下，用户评价不仅反映了产品的实际质量，也揭示了消费者的需求和偏好，成为品牌决策的核心依据之一。Python爬虫是获取电商平台用户评价数据、产品详情、价格等关键信息的强大工具。通过抓取和分析这些数据，品牌商可以实时了解
*Python爬虫应用：从社交媒体数据中提取有价值的用户行为洞察程序员威哥 python 爬虫媒体
引言在现代数字化时代，社交媒体已成为获取用户行为数据的重要来源。每秒钟，数百万条信息在平台上传播，用户的互动行为——点赞、评论、分享、关注等，构成了大量宝贵的行为数据。企业和个人通过分析这些数据，不仅可以理解用户需求、改进产品，还能精准制定营销策略。然而，如何高效地抓取、分析并从中提取有价值的用户行为洞察？这正是Python爬虫和数据分析技术的优势所在。本文将介绍如何利用Python爬虫从社交媒体
python 异步编程：协程与 asyncio 花_城 Python 开发语言后端异步协程
文章目录一、协程（coroutine）1.1协程的概念1.2实现协程的方式二、asyncio异步编程2.1事件循环2.2快速上手2.3运行协程2.4await关键字2.5可等待对象2.5.1协程2.5.2任务（Task）2.5.3asyncio.Future三、concurrent.futures.Future（补充）3.1爬虫案例（asyncio+不支持异步的模块）四、asyncio异步迭代器五
进阶之App 测试一只舰性能测试
App知识点什么是activityActivity一个应用程序的组件，它提供一个屏幕来与用户交互。Activity:应用程序中，一个Activity就相当于手机屏幕，它是一种可以包含用户界面的组件，主要用于和用户进行交互。一个应用程序可以包含许多活动，比如事件的点击，一般都会触发一个新的Activity。Activity生命周期四种状态:1、运行2、暂停3、停止4、系统回收（killed）Andr
Python 爬虫实战：如何搭建高效的分布式爬虫架构，突破数据抓取极限程序员威哥 python 爬虫分布式
随着互联网数据量的飞速增长，单一爬虫在抓取大量数据时的效率和稳定性往往无法满足需求。在这种情况下，分布式爬虫架构应运而生。分布式爬虫通过多节点并行工作，可以大大提高数据抓取的速度，同时减少单点故障的风险。本文将深入探讨如何使用Python构建一个高效的分布式爬虫架构，从架构设计到技术实现，帮助你突破数据抓取的极限。一、什么是分布式爬虫？分布式爬虫系统将爬虫任务拆分为多个子任务，分布到不同的服务器或
iOS App抓包工具排查后台唤醒引发请求异常代码背锅人日志 http udp https websocket 网络安全网络协议 tcp/ip
在一次iOSApp优化后台推送处理时，我们发现部分用户在通过推送唤醒App后，进入页面会出现数据加载失败。此时日志中并无请求发起记录，后端也未接收到该用户的访问。由于问题只发生在App由后台被唤醒的场景中，常规功能测试完全无法覆盖。我们通过一次完整的抓包分析流程，还原了App在后台唤醒后的请求链（如使用Sniffmaster进行iOS真机抓包），最终找到了隐藏的问题。背景：推送唤醒后页面数据加载失
如何让AI真正理解你的意图（自适应Prompt实战指南） nine是个工程师大语言模型人工智能 prompt
目前的LLM模型，在理解用户意图方面，正在使用自适应Prompt技术，来提升模型的理解能力。目前使用deepseek推理模型能明显看到自适应的一个过程。前言：为什么你的AI总是"答非所问"？相信很多人都遇到过这样的情况：你问：“帮我写一个Python爬虫”AI答：给你一堆理论知识和完整教程（你只想要简单代码）你问：“推荐一部电影”AI答：推荐了《教父》（你想看轻松喜剧）你问：“解释一下机器学习”A
#TypeScript高频面试题总结（2025版）沈大大520 typescript 前端面试
本文将分享TypeScript高频面试题的一些面试点以及相应的示列作者：沈大大更新时间：2025-03-11前言TypeScript作为JavaScript的超集，已经成为前端开发中不可或缺的技术。本文整理了最常见的TypeScript面试题，从基础到高级，帮助你全面准备技术面试。基础概念篇1.TypeScript与JavaScript的区别是什么？TypeScript是JavaScript的超集
H5页面点击调起腾讯/百度/高德地图APP
注意：在手机端测试时发现了一个问题，用百度浏览器只能调用百度地图app的，对腾讯/高德地图是无效的，于是我用qq浏览器测试，结果发现qq浏览器是都可以调起的。一：腾讯地图（api文档）window.open(`http://apis.map.qq.com/uri/v1/marker?marker=coord:${this.latitude},${this.longitude};addr:${thi
【PTA数据结构 | C语言版】在单链表 list 的第 i 个位置上插入元素 x
本专栏持续输出数据结构题目集，欢迎订阅。文章目录题目代码题目请编写程序，将n个整数插入初始为空的单链表，第i个整数插入在第i个位置上。注意：i代表位序，从1开始。插入结束后，输出链表长度，并顺序输出链表中的每个结点的数值。最后，尝试将最后一个整数插入到链表的第0个、第n+2个位置上，以测试错误信息的输出。输入格式：输入首先在第一行给出正整数n（≤20）；随后一行给出n个int范围内的整数，数字间以
rk3566开发之rknn npu 部署三十度角阳光的问候 rknn npu rk3566 目标检测
目录NPU使用RKNN模型非RKNN模型RKNN-Toolkit2工具RKNNNPU测试代码如下main.ccssd.cc调用ssd模型进行目标检测测试ssd.hqt中调用rknnnpu接口NPU使用RK3566内置NPU模块。使用该NPU需要下载RKNNSDK，RKNNSDK为带有NPU的RK3566/RK3568芯片平台提供编程接口，能够帮助用户部署使用RKNN-Toolkit2导出的RKNN
LabVIEW串口通信实战教程：上位机与下位机数据交互安检
本文还有配套的精品资源，点击获取简介：LabVIEW作为一种图形化编程工具，非常适合开发用于测试、测量和控制的应用程序。本文介绍了一个LabVIEW串口通信实例——“串口助手.vi”，通过它可以作为上位机接收下位机通过串口发送的数据。文章详细解释了LabVIEW中串口通信的关键技术点，包括串口配置、打开和关闭串口、数据读取与写入、错误处理、数据解析、用户界面设计、事件结构以及实时监控。掌握这些技术
实现顶部固定与平滑滑动二级菜单的网页导航设计
本文还有配套的精品资源，点击获取简介：现代网页设计中，高效的导航菜单对用户体验至关重要。本设计涵盖固定在顶部的导航栏和二级菜单项的平滑滑动效果。通过CSS实现导航栏的固定定位，而JavaScript则负责二级菜单的平滑过渡动画。包含的文件如HTML结构、JavaScript交互逻辑、CSS样式和可能的图像资源，共同构建了这种流行的导航菜单布局。1.顶部固定、二级栏目之间相互滑动的导航菜单在现代网页
快速启动静态网络服务器的Run工具使用指南闫泽华
本文还有配套的精品资源，点击获取简介：本文介绍了如何使用run工具，一个通过npm全局安装的Node.js包，来启动一个简单的静态文件服务器。介绍了npm的作用，以及如何全局安装run。随后，文章解释了run工具的用途，包括从任何目录快速启动静态网站服务器的能力，并讨论了它在开发、测试和演示中的应用。还涉及了使用run工具时涉及的一些基本任务，如处理HTTP请求和返回静态资源，以及提供了源代码文件
鲲鹏+银河麒麟v10离线安装docker
寻找软件源据说银河麒麟基于CentOS7，但是通过测试最终添加CentOS8的源才可以用，因为他喵的CentOS7只有x86_64，而CentOS8才有aarch64，厂商的话都信不得哦。手动配置了CentOS8的源后，yummakecache可以正常缓存，但是yum-yupdate会出现多个依赖错误问题，通过yum-yinstall可以安装软件，但是依赖问题依然很难受。最终在配置好CentOS8
system Verilog：clocking中定义信号为input和output的区别加载-ing system verilog
在SystemVerilog中，clocking块用于定义时钟块，这通常用于描述时钟边缘和同步的输入/输出行为，特别是在测试平台和硬件接口描述中。在下述两个代码示例中，主要区别在于a被定义为一个input还是output。当a被定义为input时：systemverilogclockingcb@(posedgeclk);inputa;endclocking这意味着a是一个从被测试设计（DUT）到测
html 周华华 html
js 1，数组的排列 var arr=[1,4,234,43,52,]; for(var x=0;x<arr.length;x++){ for(var y=x-1;y<arr.length;y++){ if(arr[x]<arr[y]){ &
【Struts2 四】Struts2拦截器 bit1129 struts2拦截器
Struts2框架是基于拦截器实现的，可以对某个Action进行拦截，然后某些逻辑处理，拦截器相当于AOP里面的环绕通知，即在Action方法的执行之前和之后根据需要添加相应的逻辑。事实上，即使struts.xml没有任何关于拦截器的配置，Struts2也会为我们添加一组默认的拦截器，最常见的是，请求参数自动绑定到Action对应的字段上。 Struts2中自定义拦截器的步骤是：
make:cc 命令未找到解决方法 daizj linux 命令未知 make cc
安装rz sz程序时，报下面错误： [root@slave2 src]# make posix cc -O -DPOSIX -DMD=2 rz.c -o rz make: cc：命令未找到 make: *** [posix] 错误 127 系统：centos 6.6 环境：虚拟机错误原因：系统未安装gcc，这个是由于在安
Oracle之Job应用周凡杨 oracle job
最近写服务，服务上线后，需要写一个定时执行的SQL脚本，清理并更新数据库表里的数据，应用到了Oracle 的 Job的相关知识。在此总结一下。一：查看相关job信息 1、相关视图 dba_jobs all_jobs user_jobs dba_jobs_running 包含正在运行
多线程机制朱辉辉33 多线程
转至http://blog.csdn.net/lj70024/archive/2010/04/06/5455790.aspx 程序、进程和线程：程序是一段静态的代码，它是应用程序执行的蓝本。进程是程序的一次动态执行过程，它对应了从代码加载、执行至执行完毕的一个完整过程，这个过程也是进程本身从产生、发展至消亡的过程。线程是比进程更小的单位，一个进程执行过程中可以产生多个线程，每个线程有自身的
web报表工具FineReport使用中遇到的常见报错及解决办法（一）老A不折腾 web报表 finereport java报表报表工具
FineReport使用中遇到的常见报错及解决办法（一）这里写点抛砖引玉，希望大家能把自己整理的问题及解决方法晾出来，Mark一下，利人利己。出现问题先搜一下文档上有没有，再看看度娘有没有，再看看论坛有没有。有报错要看日志。下面简单罗列下常见的问题，大多文档上都有提到的。 1、address pool is full：含义：地址池满，连接数超过并发数上
mysql rpm安装后没有my.cnf 林鹤霄没有my.cnf
Linux下用rpm包安装的MySQL是不会安装/etc/my.cnf文件的，至于为什么没有这个文件而MySQL却也能正常启动和作用，在这儿有两个说法，第一种说法，my.cnf只是MySQL启动时的一个参数文件，可以没有它，这时MySQL会用内置的默认参数启动，第二种说法，MySQL在启动时自动使用/usr/share/mysql目录下的my-medium.cnf文件，这种说法仅限于r
Kindle Fire HDX root并安装谷歌服务框架之后仍无法登陆谷歌账号的问题 aigo root
原文：http://kindlefireforkid.com/how-to-setup-a-google-account-on-amazon-fire-tablet/ Step 4: Run ADB command from your PC On the PC, you need install Amazon Fire ADB driver and instal
javascript 中var提升的典型实例 alxw4616 JavaScript
// 刚刚在书上看到的一个小问题,很有意思.大家一起思考下吧 myname = 'global'; var fn = function () { console.log(myname); // undefined var myname = 'local'; console.log(myname); // local }; fn() // 上述代码实际上等同于以下代码 m
定时器和获取时间的使用百合不是茶时间的转换定时器
定时器:定时创建任务在游戏设计的时候用的比较多 Timer();定时器 TImerTask();Timer的子类由 Timer 安排为一次执行或重复执行的任务。定时器类Timer在java.util包中。使用时，先实例化，然后使用实例的schedule(TimerTask task, long delay)方法，设定
JDK1.5 Queue bijian1013 java thread java多线程 Queue
JDK1.5 Queue LinkedList： LinkedList不是同步的。如果多个线程同时访问列表，而其中至少一个线程从结构上修改了该列表，则它必须保持外部同步。（结构修改指添加或删除一个或多个元素的任何操作；仅设置元素的值不是结构修改。）这一般通过对自然封装该列表的对象进行同步操作来完成。如果不存在这样的对象，则应该使用 Collections.synchronizedList 方
http认证原理和https bijian1013 http https
一.基础介绍在URL前加https://前缀表明是用SSL加密的。你的电脑与服务器之间收发的信息传输将更加安全。 Web服务器启用SSL需要获得一个服务器证书并将该证书与要使用SSL的服务器绑定。 http和https使用的是完全不同的连接方式，用的端口也不一样,前者是80，后
【Java范型五】范型继承 bit1129 java
定义如下一个抽象的范型类，其中定义了两个范型参数，T1，T2 package com.tom.lang.generics; public abstract class SuperGenerics<T1, T2> { private T1 t1; private T2 t2; public abstract void doIt(T
【Nginx六】nginx.conf常用指令(Directive) bit1129 Directive
1. worker_processes 8; 表示Nginx将启动8个工作者进程，通过ps -ef|grep nginx,会发现有8个Nginx Worker Process在运行 nobody 53879 118449 0 Apr22 ? 00:26:15 nginx: worker process
lua 遍历Header头部 ronin47 lua header 遍历　
local headers = ngx.req.get_headers() ngx.say("headers begin", "<br/>") ngx.say("Host : ", he
java-32.通过交换a,b中的元素，使[序列a元素的和]与[序列b元素的和]之间的差最小(两数组的差最小)。 bylijinnan java
import java.util.Arrays; public class MinSumASumB { /** * Q32.有两个序列a,b，大小都为n,序列元素的值任意整数，无序. * * 要求：通过交换a,b中的元素，使[序列a元素的和]与[序列b元素的和]之间的差最小。 * 例如: * int[] a = {100,99,98,1,2,3
redis 开窍的石头 redis
在redis的redis.conf配置文件中找到# requirepass foobared 把它替换成requirepass 12356789 后边的12356789就是你的密码打开redis客户端输入config get requirepass 返回 redis 127.0.0.1:6379> config get requirepass 1) "require
[JAVA图像与图形]现有的GPU架构支持JAVA语言吗？ comsci java语言
无论是opengl还是cuda，都是建立在C语言体系架构基础上的，在未来，图像图形处理业务快速发展，相关领域市场不断扩大的情况下，我们JAVA语言系统怎么从这么庞大，且还在不断扩大的市场上分到一块蛋糕，是值得每个JAVAER认真思考和行动的事情
安装ubuntu14.04登录后花屏了怎么办 cuiyadll ubuntu
这个情况，一般属于显卡驱动问题。可以先尝试安装显卡的官方闭源驱动。按键盘三个键：CTRL + ALT + F1 进入终端，输入用户名和密码登录终端：安装amd的显卡驱动 sudo apt-get install fglrx 安装nvidia显卡驱动 sudo ap
SSL 与数字证书的基本概念和工作原理 darrenzhu 加密 ssl 证书密钥签名
SSL 与数字证书的基本概念和工作原理 http://www.linuxde.net/2012/03/8301.html SSL握手协议的目的是或最终结果是让客户端和服务器拥有一个共同的密钥，握手协议本身是基于非对称加密机制的，之后就使用共同的密钥基于对称加密机制进行信息交换。 http://www.ibm.com/developerworks/cn/webspher
Ubuntu设置ip的步骤 dcj3sjt126com ubuntu
在单位的一台机器完全装了Ubuntu Server，但回家只能在XP上VM一个，装的时候网卡是DHCP的，用ifconfig查了一下ip是192.168.92.128,可以ping通。转载不是错： Ubuntu命令行修改网络配置方法 /etc/network/interfaces打开后里面可设置DHCP或手动设置静态ip。前面auto eth0，让网卡开机自动挂载. 1. 以D
php包管理工具推荐 dcj3sjt126com PHP Composer
http://www.phpcomposer.com/ Composer是 PHP 用来管理依赖（dependency）关系的工具。你可以在自己的项目中声明所依赖的外部工具库（libraries），Composer 会帮你安装这些依赖的库文件。中文文档入门指南下载安装包列表 Composer 中国镜像
Gson使用四（TypeAdapter） eksliang json gson Gson自定义转换器 gsonTypeAdapter
转载请出自出处：http://eksliang.iteye.com/blog/2175595 一.概述 Gson的TypeAapter可以理解成自定义序列化和返序列化二、应用场景举例例如我们通常去注册时（那些外国网站），会让我们输入firstName，lastName,但是转到我们都
JQM控件之Navbar和Tabs gundumw100 html xml css
在JQM中使用导航栏Navbar是简单的。只需要将data-role="navbar"赋给div即可： <div data-role="navbar"> <ul> <li><a href="#" class="ui-btn-active&qu
利用归并排序算法对大文件进行排序 iwindyforest java 归并排序大文件分治法 Merge sort
归并排序算法介绍，请参照Wikipeida zh.wikipedia.org/wiki/%E5%BD%92%E5%B9%B6%E6%8E%92%E5%BA%8F 基本思想：大文件分割成行数相等的两个子文件，递归（归并排序）两个子文件，直到递归到分割成的子文件低于限制行数低于限制行数的子文件直接排序两个排序好的子文件归并到父文件直到最后所有排序好的父文件归并到输入
iOS UIWebView URL拦截啸笑天 UIWebView
本文译者：candeladiao，原文：URL filtering for UIWebView on the iPhone说明：译者在做app开发时，因为页面的javascript文件比较大导致加载速度很慢，所以想把javascript文件打包在app里，当UIWebView需要加载该脚本时就从app本地读取，但UIWebView并不支持加载本地资源。最后从下文中找到了解决方法，第一次翻译，难免有
索引的碎片整理SQL语句 macroli sql
SET NOCOUNT ON DECLARE @tablename VARCHAR (128) DECLARE @execstr VARCHAR (255) DECLARE @objectid INT DECLARE @indexid INT DECLARE @frag DECIMAL DECLARE @maxfrag DECIMAL --设置最大允许的碎片数量,超过则对索引进行碎片
Angularjs同步操作http请求with $promise qiaolevip 每天进步一点点学习永无止境 AngularJS 纵观千象
// Define a factory app.factory('profilePromise', ['$q', 'AccountService', function($q, AccountService) { var deferred = $q.defer(); AccountService.getProfile().then(function(res) {
hibernate联合查询问题 sxj19881213 sql Hibernate HQL 联合查询
最近在用hibernate做项目，遇到了联合查询的问题，以及联合查询中的N+1问题。针对无外键关联的联合查询，我做了HQL和SQL的实验，希望能帮助到大家。（我使用的版本是hibernate3.3.2） 1 几个常识：（1）hql中的几种join查询，只有在外键关联、并且作了相应配置时才能使用。（2）hql的默认查询策略，在进行联合查询时，会产
struts2.xml wuai struts
<?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE struts PUBLIC "-//Apache Software Foundation//DTD Struts Configuration 2.3//EN" "http://struts.apache

潭州课堂25班：Ph201805201 爬虫基础 第八课 selenium (课堂笔记）

Selenium笔记（1）安装和简单使用

简介

安装

ChromeDriver（浏览器驱动）安装

Mac/Linux

Windows

Selenium安装

简单使用

Chrome无界面运行

Selenium简单例子

Selenium笔记（2）Chrome Webdriver启动选项

Chrome WebDriver Options

简介

例子

常用的启动参数

禁用图片加载

禁用浏览器弹窗

完整文档

Method

Values

Chrome WebDriver对象

简介

指定chromedriver.exe的位置

完整文档

Selenium笔记（3）Remote Webdriver

简介

常用操作

查找元素

单个查找

多个查找

通过私有方法进行查找

操作Cookie

获取截屏

获取窗口信息

切换

执行JS代码

完整文档

Selenium笔记（4）Webelement

方法

属性

Keys

简例

属性

Selenium笔记（5）动作链

简介

简例

常用动作方法

完整文档

Selenium笔记（6）等待

简介

显式等待

例子

Expected Conditions

隐式等待

Selenium笔记（7）异常

完整文档

Selenium笔记（8）常见的坑

用Xpath查找数据时无法直接获取节点属性

使用了WebDriverWait以后仍然无法找到元素

1.分辨率原因

2.需要滚动页面

3.由于其他元素的遮挡

你可能感兴趣的:(爬虫,javascript,测试,ViewUI)

潭州课堂25班：Ph201805201 爬虫基础第八课 selenium (课堂笔记）