1. 登陆自己的人人网,进入到网页 http://friend.renren.com/myfriendlistx.do#item_0 ,得到如下图的页面:
右键查看源代码,将下图中自己好友列表的一行复制出来,保存为 myFriends.dat (保存为一行的文本)。
2. 运行下图的源代码:
import urllib,urllib2,cookielib
import re
def MyFriends(filename):
patt2='"id":(\d+),"vip":[\s\S]+?"name":"([\s\S]+?)",\
"head":"([\s\S]+?)","groups":\["([\s\S]+?)"\]' #id,name,head,groups
patt='"id":(\d+),'
f=open(filename,'r')
for eachLine in f:
friendList=re.findall(patt,eachLine)
return friendList
def Login(t,JSESSIONID):
cookie={'t':t,'JSESSIONID':JSESSIONID}
cookies=''.join(x+'='+cookie[x]+';' for x in cookie)
return cookies
def ShareFriends(cookies,friendID):
opener=urllib2.build_opener(urllib2.HTTPCookieProcessor(\
cookielib.CookieJar()))
urllib2.install_opener(opener)
req = urllib2.Request('http://friend.renren.com/shareFriends?\
t=0.8142646627966315&p={"init":true,"uid":true,"uhead":true,\
"uname":true,"group":true,"net":true,\
"param":{"guest":'+friendID+'}}')
req.add_header('Cookie', cookies)
content = urllib2.urlopen(req).read()
patt='"id":(\d+),"netName":"'
friends=re.findall(patt,content)
return friends
def AllFriendsLinkCSV(cookies,friendList,filename):
f=open(filename,'w')
f.write('Source,Target\n')
for friendID in friendList:
f.write('xxxx,'+friendID+'\n') # replace XXXX with your ID
for friendID in friendList:
friends=ShareFriends(cookies,friendID)
for ID in friends:
f.write(friendID+','+ID+'\n')
f.close()
def main():
friendList=MyFriends('myFriends.data2')
t,JSESSIONID='xxxxx','xxxxx'
cookies=Login(t,JSESSIONID)
AllFriendsLinkCSV(cookies,friendList,'FriendsLink.csv')
if __name__=='__main__':
main()
ShareFriends 用于获取和某个好友的共同好友
AllFriendsLinkCSV 按照Gephi的指定格式将好友关系保存为csv文件。
main函数中的t,JSESSIONID通过如下方式获取后,取代上面代码中的xxxxx:
在人人页面中右键,审查元素,资源选项下的红色覆盖部分
3. 替换t和JSESSIONID值后运行上述代码,得到 FriendsLink.csv。按照下图在Gephi中导入csv文件:
回到概览就可以看见网络图:
经过一系列处理后就可以得到自己的社交图啦~
参考文献:
[1] 人人cookie登陆: http://www.guokr.com/post/317472/
[2] Gephi中文教程: https://www.udemy.com/gephi
[3] Coursera 上的Social network analysis课程: https://www.coursera.org/course/sna