String filePath=this.getServletConfig().getServletContext().getRealPath("/");
System.out.println("Protocol: " + request.getProtocol());
System.out.println("Scheme: " + request.getScheme());
System.out.println("Server Name: " + request.getServerName() );
System.out.println("Server Port: " + request.getServerPort());
System.out.println("Protocol: " + request.getProtocol());
System.out.println("Server Info: " + getServletConfig().getServletContext().getServerInfo());
System.out.println("Remote Addr: " + request.getRemoteAddr());
System.out.println("Remote Host: " + request.getRemoteHost());
System.out.println("Character Encoding: " + request.getCharacterEncoding());
System.out.println("Content Length: " + request.getContentLength());
System.out.println("Content Type: "+ request.getContentType());
System.out.println("Auth Type: " + request.getAuthType());
System.out.println("HTTP Method: " + request.getMethod());
System.out.println("Path Info: " + request.getPathInfo());
System.out.println("Path Trans: " + request.getPathTranslated());
System.out.println("Query String: " + request.getQueryString());
System.out.println("Remote User: " + request.getRemoteUser());
System.out.println("Session Id: " + request.getRequestedSessionId());
System.out.println("Request URI: " + request.getRequestURI());
System.out.println("Request URL: " + request.getRequestURL());
System.out.println("Servlet Path: " + request.getServletPath());
System.out.println("Accept: " + request.getHeader("Accept"));
System.out.println("Host: " + request.getHeader("Host"));
System.out.println("Referer : " + request.getHeader("Referer"));
System.out.println("Accept-Language : " + request.getHeader("Accept-Language"));
System.out.println("Accept-Encoding : " + request.getHeader("Accept-Encoding"));
System.out.println("User-Agent : " + request.getHeader("User-Agent"));
System.out.println("Connection : " + request.getHeader("Connection"));
System.out.println("Cookie : " + request.getHeader("Cookie"));
System.out.println("Created : " + session.getCreationTime());
System.out.println("LastAccessed : " + session.getLastAccessedTime());
-------------------------------------------------------------
Protocol: HTTP/1.1
Scheme: http
Server Name: localhost
Server Port: 8080
Protocol: HTTP/1.1
Server Info: Apache Tomcat/6.0.10
Remote Addr: 127.0.0.1
Remote Host: 127.0.0.1
Character Encoding: null
Content Length: 2676
Content Type: multipart/form-data; boundary=---------------------------7da26197405d2
Auth Type: null
HTTP Method: POST
Path Info: null
Path Trans: null
Query String: null
Remote User: null
Session Id: D6A3440E9AE430D4D4C64D3AF5918797
Request URI: /Struts2.5/pro.jsp
Request URL: http://localhost:8080/Struts2.5/pro.jsp
Servlet Path: /pro.jsp
Accept: */*
Host: localhost:8080
Referer : http://localhost:8080/Struts2.5/
Accept-Language : zh-cn
Accept-Encoding : gzip, deflate
User-Agent : Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; CIBA)
Connection : Keep-Alive
Cookie : JSESSIONID=D6A3440E9AE430D4D4C64D3AF5918797
Created : 1271739994968
LastAccessed : 1271739994968
-------------------
可以通过HTTP header 内的 HTTP USER AGENT 识别引擎爬虫
一些常用的爬虫
百度爬虫
Baiduspider+(+http://www.baidu.com/search/spider.htm)
雅虎爬虫,分别是雅虎中国和美国总部的爬虫
Mozilla/5.0 (compatible; Yahoo! Slurp China; http://misc.yahoo.com.cn/help.html)
Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
新浪爱问爬虫
iaskspider/2.0(+http://iask.com/help/help_index.html)
Mozilla/5.0 (compatible; iaskspider/1.0; MSIE 6.0)
搜狗爬虫
Sogou web spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07″)
Sogou Push Spider/3.0(+http://www.sogou.com/docs/help/webmasters.htm#07″)
【早期用法:“sogou spider”】
Google爬虫
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Google AdSense广告内容匹配爬虫
Mediapartners-Google/2.1
网易爬虫
Mozilla/5.0 (compatible; YodaoBot/1.0; http://www.yodao.com/help/webmaster/spider/;)
【早期采用“ OutfoxBot/0.5 (for internet experiments; http://”;
[email protected])”】
Alexa排名爬虫
ia_archiver
MSN爬虫
msnbot/1.0 (+http://search.msn.com/msnbot.htm)
特点未知
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)
据称为北大天网的搜索引擎爬虫程序
P.Arthur 1.1
看来是Qihoo的
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; QihooBot 1.0)
Gigabot搜索引擎爬虫
Gigabot/2.0 (http://www.gigablast.com/spider.html)