爬虫必看:最简单的爬取百度翻译思路&程序

首先通过Fiddler抓包获取其url,找到form_data表单,伪装自己的UA,以post方式请求服务器,得到json格式数据后先转化为字典,找到所需要的字符串信息进行输出,并可对错误输入请求进行提示。

import urllib.request
import urllib.parse

init_url = "https://fanyi.baidu.com/sug"

word = input("input your word:")
#构建表单数据
form_data = {
      "kw":word,}
form_data = urllib.parse.urlencode(form_data).encode()

headers = {
     "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) \
    Chrome/63.0.3239.132 Safari/537.36X-Requested-With: XMLHttpRequest",}
try:
    request = urllib.request.Request(init_url,headers)
    response = urllib.request.urlopen(request,form_data)

    temp_json = eval(response.read().decode())
    tran_word = temp_json['data']
    print("The word means:",tran_word[0]['v'])

except IndexError:
    print("input error!please input again.....")

你可能感兴趣的:(python,爬虫,python,爬虫)