抖音视频爬取

首先要用手机连接电脑或者安装安卓模拟器,不断刷新抖音,可以从Fiddle里找到视频接口,同时也要将Fiddle里的Rules中的Customize Rules进行相应设置
抖音视频爬取_第1张图片
上图代码为:

    static function OnBeforeResponse(oSession: Session) {
	if (m_Hide304s && oSession.responseCode == 304) {
		oSession["ui-hide"] = "true";
	}
	if (oSession.uriContains("https://aweme-eagle-hl.snssdk.com/aweme/v1/")){
		var strBody=oSession.GetResponseBodyAsString();
		var sps = oSession.PathAndQuery.slice(-58,);
		//FiddlerObject.alert(sps)
		var curDate = new Date().getTime();   
		var filename = "C:/Users/Administrator/Desktop/抖音" + "/" + sps + curDate + ".json"; 
		var sw : System.IO.StreamWriter;  
		if (System.IO.File.Exists(filename)){  
			sw = System.IO.File.AppendText(filename);  
			sw.Write(strBody);  
		}  
		else{  
			sw = System.IO.File.CreateText(filename);  
			sw.Write(strBody);  
		}  
			
		sw.Close();  
		sw.Dispose();  
	}
    }

以下是在pycharm中进行编辑的代码:

import json
import os
import requests
order=os.listdir(r'C:\Users\Administrator\Desktop\抖音') #打开文件夹
start_urls=[]#创建空列表
for file in order: #遍历文件
    path=r'C:\Users\Administrator\Desktop\抖音\\'+file  #文件路径
    with open(path,'r',encoding='utf-8')as f:
        data1=f.read()  
    data=json.loads(data1)  #读取文件
    for ele in data['aweme_list']:  #遍历视频链接
        url=ele['video']['play_addr']['url_list']
        if url:
            start_urls.append(url[0])   #如果找到视频链接,添加到创建的空列表里
n=0
for i in start_urls:
    n+=1
    response=requests.get(i)
    with open('douyin/'+str(n)+'.mp4','wb')as f:
        f.write(response.content)   #保存视频

你可能感兴趣的:(抖音视频爬取)