在JSoup模拟登录网站(以校园内网为例)中我们只是模拟登录了华北电力大学内网,如下所示:
假设现在我们要访问其中某一个网站:新教务系统,其url为:
https://202-204-74-178.web.ncepu.edu.cn/jwglxt/xtgl/login_slogin.html
教务系统登录也需要账号密码,相当于我们经历了两次模拟登录。
我们先利用在JSoup模拟登录网站(以校园内网为例)中获得的cookies进入到新教务系统的登录页面:
public static void get_jwxt(Map<String, String> cookies) throws IOException {
String url_login = "https://202-204-74-178.web.ncepu.edu.cn/jwglxt/xtgl/login_slogin.html";
Document document = Jsoup.connect(url_login).cookies(cookies).post();
System.out.println(document);
}
这里面的cookies就是上一步登录获得的cookies。
如果没有这个cookies,我们像正常访问一个网页去访问新教务系统登录页面:
String url_login = "https://202-204-74-178.web.ncepu.edu.cn/jwglxt/xtgl/login_slogin.html";
Document document = Jsoup.connect(url_login).get();
System.out.println(document);
那么很不幸,打印出来的是内网的登录界面,也就是说:我们被强制退出了内网。
模拟登录新教务系统比较复杂,换成访问中国知网:
public static void get_cnki(Map<String, String> cookies) throws IOException {
String url = "https://www-cnki-net-443.web.ncepu.edu.cn/";
Connection connection = Jsoup.connect(url).cookies(cookies);
connection.header("User-Agent", "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.61 Safari/537.36"); // 配置模拟浏览器
Response res = connection.execute(); // 获取响应
Document d = Jsoup.parse(res.body());
System.out.println(d);
}
打印:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>head>
<body>
?
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<title>中国知网title>
<meta name="keywords" content="中国知网,数字出版,知识发现,知识服务,知识管理,数字出版,增强出版,CAJ-N,网络首发,CNKI首发,数字图书馆,学术文献,期刊,博士论文,硕士论文,会议论文,报纸,年鉴,统计数据,专利,科技成果,标准,法规,古籍,工具书,引文,图片搜索,外文文献">
<meta name="description" content="中国知网知识发现网络平台—面向海内外读者提供中国学术文献、外文文献、学位论文、报纸、会议、年鉴、工具书等各类资源统一检索、统一导航、在线阅读和下载服务。涵盖基础科学、文史哲、工程科技、社会科学、农业、经济与管理科学、医药卫生、信息科技等十大领域。">
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no">
<link rel="stylesheet" href="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmCSS_min/kindex.min.css?v=1.1">
<link rel="stylesheet" href="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmCSS_min/royalslider.min.css?v=1.1">
<link rel="stylesheet" href="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmCSS_min/rs-default.min.css?v=1.1">
<link rel="stylesheet" href="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmCSS_min/index.min.css?v=3.1">
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/jQuery-1.11.3.min.js?v=1.1">script>
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/jquery.cookie.js?v=1.0">script>
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/min/GlobalVar.min.js?v=1.21">script>
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/min/cachespace.min.js?v=1.1">script>
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/min/fieldJson.min.js?v=1.3">script>
<script type="text/javascript" src="https://piccache-cnki-net.web.ncepu.edu.cn/kdn/index/kns7/nvsmScripts/min/cnkisug.min.js?v=1.1">script>
<script type="text/javascript">
function CheckTerminalType() {
try {
var a = document.referrer;
if (a && a.toLowerCase().indexOf('wap-cnki-net.web.ncepu.edu.cn') > -1) {
return
}
var d = navigator.userAgent;
var b = d.toLowerCase();
if (b.indexOf('android') > -1 || b.indexOf('iphone') > -1) {
location.href = 'https://wap-cnki-net.web.ncepu.edu.cn/touch/web/guide'
}
} catch (c) {
}
};
CheckTerminalType();
script>
<div class="banner-main">
<div class="advbtn">
<a class="close-adv" href="javascript:void(0);" onclick="$(this).parents('.banner-main').slideUp();">Xa>
div>
<div class="banner-main-img">
<ul>
<li style="display: block; background-color: #041f68;"><a href="http://www.castscs.org.cn/?m=news&a=view&id=176532" target="_blank"><img src="https://piccache-cnki-net.web.ncepu.edu.cn/index/images2009/other/2020/topbanner02.jpg?0821">a>li>
<li style="background-color: #ba0e04;"> <a href="https://k-cnki-net.web.ncepu.edu.cn/Theme/Index/101" target="_blank"> <img src="https://piccache-cnki-net.web.ncepu.edu.cn/index/images2009/other/2020/topbanner03.jpg?08070813" usemap="#referemap" alt="全国政协“委员讲堂”"> <map name="referemap"> <area shape="rect" class="hotpoint" coords="1349,39,1451,67" hidefocus="true" onfocus="this.blur()" target="_blank" rel="noopener noreferrer" href="https://zxlzta-cnki-net.web.ncepu.edu.cn/"> map> a> li>
<li style="background-color: #3870cd;"> <img src="https://piccache-cnki-net.web.ncepu.edu.cn/index/images2009/other/2020/topbanner01.jpg?07070813" usemap="#buttonmap"> <map name="buttonmap"> <area shape="rect" class="hotpoint" coords="694,51,819,76" hidefocus="true" onfocus="this.blur()" target="_blank" rel="noopener noreferrer" href="//www.wjx.cn/jq/75177669.aspx"> <area shape="rect" class="hotpoint" coords="889,51,1013,76" hidefocus="true" onfocus="this.blur()" target="_blank" rel="noopener noreferrer" href="https://x-cnki-net.web.ncepu.edu.cn/search?value=4"> <area shape="rect" class="hotpoint" coords="1083,51,1206,76" hidefocus="true" onfocus="this.blur()" target="_blank" rel="noopener noreferrer" href="https://kf-cnki-net.web.ncepu.edu.cn/im/client/1oekjx"> map> li>
<li style="background-color: #093486;"><a href="https://piccache-cnki-net.web.ncepu.edu.cn/index/images2009/other/2020/proposal.html" target="_blank"><img src="https://piccache-cnki-net.web.ncepu.edu.cn/index/images2009/other/2020/proposalBanner.jpg?02110813">a>li>
ul>
div>
<div class="ui-controls">
<ul>
<li class="current">1li>
<li>2li>
<li>3li>
<li>4li>
ul>
div>
div>
<div class="wrapper section1">
<div class="header">
<div class="head-main">
<a class="link-home" href="https://www-cnki-net.web.ncepu.edu.cn">a>
<div class="head-right">
<div id="headerBox">
<script type="text/javascript" src="https://login-cnki-net-443.web.ncepu.edu.cn/TopLogin/api/loginapi/get?type=top&returnurl=https%3A%2F%2Fwww-cnki-net-443.web.ncepu.edu.cn%2f&localCSS=">script>
div>
<ul class="link-list">
<li><a href="javascript:void(0);" class="new-entrance">升级公告a>li>
<li><a href="https://kns8-cnki-net.web.ncepu.edu.cn/kns/" target="_blank" style="color:#ff9c00">新版入口a>li>
<li> <a href="javascript:void(0)">版本a>
<ul class="sublink-3">
<li><a href="https://m-cnki-net.web.ncepu.edu.cn/mcnkidown/index.html" target="_blank">手机版a>li>
<li><a href="https://oversea-cnki-net.web.ncepu.edu.cn" target="_blank">ENGLISHa>li>
<li><a href="https://www-cnki-net.web.ncepu.edu.cn/old" target="_blank">旧版a>li>
ul> li>
<li> <a href="javascript:void(0)">充值a>
<ul>
<li><a href="https://zhifu-cnki-net.web.ncepu.edu.cn/" target="_blank">充值中心a>li>
<li><a href="https://vipcard-cnki-net.web.ncepu.edu.cn/ec/skwd/skwd.htm" target="_blank">购买知网卡a>li>
ul> li>
<li> <a href="javascript:void(0)">帮助a>
<ul>
<li><a href="https://www-cnki-net.web.ncepu.edu.cn/subpages/webMap.html" target="_blank">网站地图a>li>
<li><a href="https://service-cnki-net.web.ncepu.edu.cn/helpcenter/" target="_blank">帮助中心a>li>
ul> li>
<li><a href="https://elib-cnki-net.web.ncepu.edu.cn/" target="_blank">个人/机构馆a>li>
<li><a href="https://my-cnki-net.web.ncepu.edu.cn/" target="_blank">我的CNKIa>li>
ul>
div>
div>
div>
<div class="searchmain">
<ul class="search-tab">
<li class="on" val="doc">文献检索li>
<li val="ele">知识元检索li>
<li val="ref">引文检索li>
ul>
<div class="search-form">
<input type="hidden" id="curdbcode" name="curdbcode" value="SCDB">
<input type="hidden" id="txt_sug" name="txt_sug" value="主题">
<div class="input-box">
<div class="sort reopt" id="DBFieldBox">
<div class="sort-default">
<span>主题span>
<i class="sort-icon">▼i>
div>
<ul class="sort-list" id="DBFieldList">
<li class="cur" val="0" field="SU$%=|"><a href="javascript:void(0);">主题a>li>
<li val="1" field="KY$=|"><a href="javascript:void(0);">关键词a>li>
<li val="2" field="TI$%=|"><a href="javascript:void(0);">篇名a>li>
<li val="3" field="FT$%=|"><a href="javascript:void(0);">全文a>li>
<li val="4" field="AU$=|"><a href="javascript:void(0);">作者a>li>
<li val="5" field="AF$%"><a href="javascript:void(0);">单位a>li>
<li val="6" field="AB$%=|"><a href="javascript:void(0);">摘要a>li>
<li val="7" field="RF$%=|"><a href="javascript:void(0);">被引文献a>li>
<li val="8" field="CLC$=|??"><a href="javascript:void(0);">中图分类号a>li>
<li val="9" field="LY$%=|"><a href="javascript:void(0);">文献来源a>li>
ul>
div>
<input class="search-input" id="txt_SearchText" name="txt_SearchText" type="text" autocomplete="off" placeholder="中文文献、外文文献" value="" maxlength="100" style="color: rgb(125, 125, 125);">
<input class="search-btn" type="button" value="">
div>
<div class="readvce">
<a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=scdb" target="_blank" id="highSearch">高级检索a>
<a href="https://navi-cnki-net.web.ncepu.edu.cn/knavi" target="_blank">出版物检索a>
div>
<div class="option-list">
<div class="wx">
<span class="tit">跨库span>
<ul>
<li class="selected" id="CJFQ" val="CJFQ" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CJFQ" target="_blank">学术期刊a>li>
<li class="selected" id="CDMD" val="CDFD,CMFD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CDMD" target="_blank">博硕a>li>
<li class="selected" id="CIPD" val="CPFD,IPFD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CIPD" target="_blank">会议a>li>
<li class="selected" id="CCND" val="CCND" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CCND" target="_blank">报纸a>li>
<li id="CYFD" val="CYFD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CYFD" target="_blank">年鉴a>li>
<li id="SCOD" val="SCOD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=SCOD" target="_blank">专利a>li>
<li id="CISD" val="CISD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CISD" target="_blank">标准a>li>
<li id="SNAD" val="SNAD" onclick="CrossDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=SNAD" target="_blank">成果a>li>
ul>
<span class="tit">单库span>
<ul>
<li id="WWBD" val="WWBD" onclick="SingleDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=WWBD" target="_blank">图书a>li>
<li id="GXDB" val="GXDB_SECTION" onclick="SingleDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=GXDB_SECTION" target="_blank">古籍a>li>
<li id="CLKD" val="CLKD" onclick="LinkDb(this,'https://lawnew-cnki-net.web.ncepu.edu.cn')"><a href="javascript:void(0)">法律法规a>li>
<li id="gwkt" val="gwkt" onclick="LinkDb(this,'https://r-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=gwkt')"><a href="javascript:void(0)">政府文件a>li>
<li id="scef" val="scef" onclick="LinkDb(this,'https://r-cnki-net.web.ncepu.edu.cn//KNS/brief/result.aspx?dbprefix=scef')"><a href="javascript:void(0)">企业标准a>li>
<li id="kjbg" val="kjbg" onclick="LinkDb(this,'https://r-cnki-net.web.ncepu.edu.cn/KNS/brief/result.aspx?dbPrefix=kjbg')"><a href="javascript:void(0)">科技报告a>li>
<li id="cgxx" val="cgxx" onclick="LinkDb(this,'https://r-cnki-net.web.ncepu.edu.cn/KNS/brief/result.aspx?dbPrefix=cgxx')"><a href="javascript:void(0)">政府采购a>li>
ul>
div>
<ul class="zsy">
<li class="selected" id="qa" val="qa" onclick="SingleDb(this)"><i>i><a href="https://qa-cnki-net.web.ncepu.edu.cn" target="_blank">知识问答a>li>
<li id="CRPD" val="CRPD" onclick="SingleDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CRPD" target="_blank">百科a>li>
<li id="CRDD" val="CRDD" onclick="SingleDb(this)"><i>i><a href="https://cidian-cnki-net.web.ncepu.edu.cn/" target="_blank">词典a>li>
<li id="CRMD" val="CRMD" onclick="SingleDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/result.aspx?dbprefix=CRMD" target="_blank">手册a>li>
<li id="CRFD" val="CRFD" onclick="SingleDb(this)"><i>i><a href="https://gongjushu-cnki-net.web.ncepu.edu.cn" target="_blank">工具书a>li>
<li id="IMAGE" val="IMAGE" onclick="SingleDb(this)"><i>i><a href="https://image-cnki-net.web.ncepu.edu.cn" target="_blank">图片a>li>
<li id="CSYD" val="CSYD" onclick="SingleDb(this)"><i>i><a href="https://data-cnki-net.web.ncepu.edu.cn" target="_blank">统计数据a>li>
<li id="CIDX" val="CIDX" onclick="SingleDb(this)"><i>i><a href="https://kns-cnki-net.web.ncepu.edu.cn/kns/brief/default_result.aspx?islist=1&code=CIDX&singleDBName=%E6%8C%87%E6%95%B0" target="_blank">指数a>li>
<li id="METHOD" val="METHOD" onclick="SingleDb(this)"><i>i><a href="https://method-cnki-net.web.ncepu.edu.cn/" target="_blank">方法a>li>
<li id="CONC" val="CONC" onclick="SingleDb(this)"><i>i><a href="https://concept-cnki-net.web.ncepu.edu.cn/" target="_blank">概念a>li>
ul>
<ul class="yw">
<li id="ywk" class="selected" onclick="LinkDb(this,'https://ref-cnki-net.web.ncepu.edu.cn/ref')"><i>i><a href="javascript:void(0)">中国引文数据库a>li>
ul>
div>
div>
div>
<div class="product-content">
<div class="unit unit-box1">
<h4>行业知识服务与知识管理平台h4>
<ul>
<li> <h6>农林牧渔、卫生、科学研究h6> <a href="https://ypt-cnki-net.web.ncepu.edu.cn" target="_blank">农业a> <a href="https://shipin-cnki-net.web.ncepu.edu.cn" target="_blank">食品a> <a href="https://chkd-cnki-net.web.ncepu.edu.cn" target="_blank">医疗a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/yaoye" target="_blank">药业a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/cdchy" target="_blank">公共卫生a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/gtzy" target="_blank">自然资源a> <a href="https://crjt-cnki-net.web.ncepu.edu.cn" target="_blank">海关检验a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/sthb" target="_blank">生态环境a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/shuili" target="_blank">水利a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/qixiang" target="_blank">气象a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/haiyang" target="_blank">海洋a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/dizhen" target="_blank">地震a> li>
<li> <h6>建筑、能源、冶炼、交通运输h6> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/up" target="_blank">城乡规划a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/bmi" target="_blank">建筑材料a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/bc" target="_blank">建筑a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/dianli" target="_blank">电力a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/yjgy" target="_blank">冶金a> <a href="https://oil-cnki-net.web.ncepu.edu.cn" target="_blank">石油a> <a href="https://mt-cnki-net.web.ncepu.edu.cn/" target="_blank">煤炭a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/jt" target="_blank">交通a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/chb" target="_blank">船舶a> li>
<li> <h6>制造、信息技术、贸易h6> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/qchy" target="_blank">汽车a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/jxhy" target="_blank">机械a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/hkhy" target="_blank">航空a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/hthy" target="_blank">航天a> <a href="https://myhk-cnki-net.web.ncepu.edu.cn/" target="_blank">民用航空a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/dzhy" target="_blank">电子a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/dqhy" target="_blank">电气a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/jdhy" target="_blank">家电a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/chemsci" target="_blank">化工a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/ychy" target="_blank">烟草a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/fzhy" target="_blank">纺织a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/mc" target="_blank">信息通信a> <a href="https://smpt-cnki-net.web.ncepu.edu.cn/" target="_blank">商贸a> li>
<li> <h6>党政、社团、国防、法律、金融h6> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/dang" target="_blank">党委a> <a href="https://law-cnki-net.web.ncepu.edu.cn/rdjc/" target="_blank">人大a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/zhengfu" target="_blank">政府a> <a href="https://zxlzta-cnki-net.web.ncepu.edu.cn/" target="_blank">政协a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/zhiku" target="_blank">智库a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/zzrs" target="_blank">党校a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/shetuan" target="_blank">社团a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/81" target="_blank">国防a> <a href="https://law-cnki-net.web.ncepu.edu.cn" target="_blank">法律a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/fin" target="_blank">金融a> <a href="https://kjs-cnki-net.web.ncepu.edu.cn/" target="_blank">会计a> li>
<li> <h6>教育、公共文化、社会服务h6> <a href="https://gj-cnki-net.web.ncepu.edu.cn" target="_blank">高教a> <a href="https://zyjy-cnki-net.web.ncepu.edu.cn/" target="_blank">职教a> <a href="https://cfed-cnki-net.web.ncepu.edu.cn/" target="_blank">基教a> <a href="https://r-cnki-net.web.ncepu.edu.cn/index/lib" target="_blank">公共图书馆a>