Python实现对网站目录扫描

一个很简单的版本,以后会做进一步的修改:多线程,从文件中读取,跟据Head头判断等等.

需要提供一个网站和不存在页面的错误提示

CODE:

#!/usr/bin/env python
# -*- coding:utf-8 -*-
import urllib
import urlparse
import HTMLParser

class CheckHTML(HTMLParser.HTMLParser):
    available = True
    def check_data(self,data,ErrorSym):
        error = ErrorSym
        if error in data:
            self.available = False

target = raw_input("Please enter your target:")
errorsyb = raw_input("Enter the errorsyb:(eg:404 Error)")
check_list = ['admin','inc','data','include','count']

for li in check_list:
    new_urls = urlparse.urljoin("http://"+target+"/",li)
    d = urllib.urlopen(new_urls)
    data = d.read()
    check = CheckHTML()
    check.check_data(data,errorsyb)
    if check.available:
        print "%s------------->Found!"%new_urls
    else:
        print "%s------------->Not Found!"%new_urls

--------------------------------------------------------------------------------------------

The Result maybe like this :

Please enter your target:www.fuck.com
Enter the errorsyb:(eg:404 Error)HTTP 错误 404
http://www.fuck.com/admin------------->Found!
http://www.fuck.com/inc------------->Not Found!
http://www.fuck.com/data------------->Found!
http://www.fuck.com/include------------->Found!
http://www.fuck.com/count------------->Found!


   

你可能感兴趣的:(python,职场,目录,扫描,休闲)