urllib.request.urlretrieve及添加headers

下载图片:

import urllib.request

# url = "http://www.baidu.com/"
#
# response = urllib.request.urlretrieve(url, "hh.html")
#
# print(response)

image_url = "https://img04.sogoucdn.com/net/a/04/link?url=https%3A%2F%2Fi02piccdn.sogoucdn.com%2F7b90f00ce282f336&appid=122"
print(urllib.request.urlretrieve(image_url, "lz.png"))

请求头:

import urllib.request

url = "http://www.baidu.com/"

headers = {
     
    'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.99 Safari/537.36'
}

request = urllib.request.Request(url=url, headers=headers)

response = urllib.request.urlopen(request)

print(response)

报错:urllib.error.URLError: 
解决:url出现问题

你可能感兴趣的:(爬虫,爬虫学习)