一只小骷髅

基于Webmagic框架的爬虫小Demo

如题：

Demo简介:

目标：爬取天善最热博文列表(https://blog.hellobi.com/hot/weekly)对应的博文信息存入mysql数据库中。

暂定的博文相关信息有：

博文url:：url
博文标题:：title
博文作者:：author
作者博客地址:：blogHomeUrl
博文阅读数：readNum
博文推荐数：recommandNum
博文评论数：commentNum
博文内容：content
博文发表时间：publishTime

根据相应的信息建表：

CREATE TABLE `hot_weekly_blogs` (
  `id` INT(11) NOT NULL AUTO_INCREMENT,
  `url` VARCHAR(100) DEFAULT NULL,
  `title` VARCHAR(100) DEFAULT NULL,
  `author` VARCHAR(50) DEFAULT NULL,
  `readNum` INT(11) DEFAULT NULL,
  `recommendNum` INT(11) DEFAULT NULL,
  `blogHomeUrl` VARCHAR(100) DEFAULT NULL,
  `commentNum` INT(11) DEFAULT NULL,
  `publishTime` VARCHAR(20) DEFAULT NULL,
  `content` MEDIUMTEXT,
  PRIMARY KEY (`id`)
) ENGINE=INNODB AUTO_INCREMENT=69 DEFAULT CHARSET=utf8;

代码：

1)新建maven工程，添加依赖：



    us.codecraft
    webmagic-core
    0.7.3


    us.codecraft
    webmagic-extension
    0.7.3

2)编写数据库工具类：

package com.qingqiuyue.ashura.util;

import java.sql.*;
import java.util.List;

public class DBHelper {
    public static final String driver_class = "com.mysql.jdbc.Driver";
    public static final String driver_url = "jdbc:mysql://localhost/ashura?useunicode=true&characterEncoding=utf8";
    public static final String user = "root";
    public static final String password = "root";
    private static Connection conn = null;
    private PreparedStatement pst = null;
    private ResultSet rst = null;

    /**
     * Connection
     */
    public DBHelper() {
        try {
            conn = DBHelper.getConnInstance();
        } catch (Exception e) {
            e.printStackTrace();
        }
    }

    /**
     * 单例模式
     * 线程同步
     *
     * @return
     */
    private static synchronized Connection getConnInstance() {
        if (conn == null) {
            try {
                Class.forName(driver_class);
                conn = DriverManager.getConnection(driver_url, user, password);
            } catch (ClassNotFoundException e) {
                e.printStackTrace();
            } catch (SQLException e) {
                e.printStackTrace();
            }
            System.out.println("连接数据库成功");
        }
        return conn;
    }

    /**
     * close
     */
    public void close() {

        try {
            if (conn != null) {
                DBHelper.conn.close();
            }
            if (pst != null) {
                this.pst.close();
            }
            if (rst != null) {
                this.rst.close();
            }
            System.out.println("关闭数据库成功");
        } catch (SQLException e) {
            e.printStackTrace();
        }
    }

    /**
     * query
     *
     * @param sql
     * @param sqlValues
     * @return ResultSet
     */
    public ResultSet executeQuery(String sql, List sqlValues) {
        try {
            pst = conn.prepareStatement(sql);
            if (sqlValues != null && sqlValues.size() > 0) {
                setSqlValues(pst, sqlValues);
            }
            rst = pst.executeQuery();
        } catch (SQLException e) {
            e.printStackTrace();
        }
        return rst;
    }

    /**
     * update
     *
     * @param sql
     * @param sqlValues
     * @return result
     */
    public int executeUpdate(String sql, List sqlValues) {
        int result = -1;
        try {
            pst = conn.prepareStatement(sql);
            if (sqlValues != null && sqlValues.size() > 0) {
                setSqlValues(pst, sqlValues);
            }
            result = pst.executeUpdate();
        } catch (SQLException e) {
            e.printStackTrace();
        }

        return result;
    }

    /**
     * sql set value
     *
     * @param pst
     * @param sqlValues
     */
    private void setSqlValues(PreparedStatement pst, List sqlValues) {
        for (int i = 0; i < sqlValues.size(); i++) {
            try {
                pst.setObject(i + 1, sqlValues.get(i));
            } catch (SQLException e) {
                e.printStackTrace();
            }
        }
    }
}

3)创建对应的实体对象：

package com.qingqiuyue.ashura.domain;

public class BlogInfo {

    private String url;
    private String title;
    private String author;
    private String readNum;
    private String recommendNum;
    private String blogHomeUrl;
    private String commentNum;
    private String publishTime;
    private String content;

    public String getUrl() {
        return url;
    }

    public void setUrl(String url) {
        this.url = url;
    }

    public String getTitle() {
        return title;
    }

    public void setTitle(String title) {
        this.title = title;
    }

    public String getAuthor() {
        return author;
    }

    public void setAuthor(String author) {
        this.author = author;
    }

    public String getReadNum() {
        return readNum;
    }

    public void setReadNum(String readNum) {
        this.readNum = readNum;
    }

    public String getRecommendNum() {
        return recommendNum;
    }

    public void setRecommendNum(String recommendNum) {
        this.recommendNum = recommendNum;
    }

    public String getBlogHomeUrl() {
        return blogHomeUrl;
    }

    public void setBlogHomeUrl(String blogHomeUrl) {
        this.blogHomeUrl = blogHomeUrl;
    }

    public String getCommentNum() {
        return commentNum;
    }

    public void setCommentNum(String commentNum) {
        this.commentNum = commentNum;
    }

    public String getPublishTime() {
        return publishTime;
    }

    public void setPublishTime(String publishTime) {
        this.publishTime = publishTime;
    }

    public String getContent() {
        return content;
    }

    public void setContent(String content) {
        this.content = content;
    }
}

4)Dao接口层

package com.qingqiuyue.ashura.service;

import com.qingqiuyue.ashura.domain.BlogInfo;

/**
 * 博文 数据持久化 接口
 * @author Jasmine
 */
public interface BlogDao {
    /**
     * 保存博文信息
     * @param blog
     * @return
     */
    public int saveBlog(BlogInfo blog);

}

5)Dao实现类

package com.qingqiuyue.ashura.service.impl;

import com.qingqiuyue.ashura.domain.BlogInfo;
import com.qingqiuyue.ashura.service.BlogDao;
import com.qingqiuyue.ashura.util.DBHelper;

import java.util.ArrayList;
import java.util.List;

/**
 * 博客 数据库持久化接口 实现
 * @author Jasmine
 */
public class BlogDaoImpl implements BlogDao {

    @Override
    public int saveBlog(BlogInfo blog) {
        DBHelper dbhelper = new DBHelper();
        StringBuffer sql = new StringBuffer();
        sql.append("INSERT INTO hot_weekly_blogs(url,title,author,readNum,recommendNum,blogHomeUrl,commentNum,publishTime,content)")
                .append("VALUES (? , ? , ? , ? , ? , ? , ? , ? , ? ) ");
        //设置 sql values 的值
        List sqlValues = new ArrayList<>();
        sqlValues.add(blog.getUrl());
        sqlValues.add(blog.getTitle());
        sqlValues.add(blog.getAuthor());
        sqlValues.add(""+blog.getReadNum());
        sqlValues.add(""+blog.getRecommendNum());
        sqlValues.add(blog.getBlogHomeUrl());
        sqlValues.add(""+blog.getCommentNum());
        sqlValues.add(blog.getPublishTime());
        sqlValues.add(blog.getContent());
        int result = dbhelper.executeUpdate(sql.toString(), sqlValues);
        return result;
    }
}

6)编写PageProcessor：

PageProcessor中的process方法是webmagic的核心，负责抽取目标url的逻辑。

package com.qingqiuyue.ashura.webmagic;

import com.qingqiuyue.ashura.service.BlogDao;
import com.qingqiuyue.ashura.service.impl.BlogDaoImpl;
import com.qingqiuyue.ashura.domain.BlogInfo;
import us.codecraft.webmagic.Page;
import us.codecraft.webmagic.Site;
import us.codecraft.webmagic.Spider;
import us.codecraft.webmagic.processor.PageProcessor;

import java.text.SimpleDateFormat;
import java.util.Calendar;
import java.util.Date;
import java.util.regex.Matcher;
import java.util.regex.Pattern;

/**
 * Created by Administrator on 2017/6/11.
 */
public class BlogPageProcessor implements PageProcessor {
    //抓取网站的相关配置，包括：编码、抓取间隔、重试次数等
    private Site site = Site.me().setRetryTimes(10).setSleepTime(1000);
    //博文数量
    private static int num = 0;
    //数据库持久化对象，用于将博文信息存入数据库
    private BlogDao blogDao = new BlogDaoImpl();

    public static void main(String[] args) throws Exception {
        long startTime, endTime;
        System.out.println("========天善最热博客小爬虫【启动】喽！=========");
        startTime = new Date().getTime();
        Spider.create(new BlogPageProcessor()).addUrl("https://blog.hellobi.com/hot/weekly?page=1").thread(5).run();
        endTime = new Date().getTime();
        System.out.println("========天善最热博客小爬虫【结束】喽！=========");
        System.out.println("一共爬到" + num + "篇博客！用时为：" + (endTime - startTime) / 1000 + "s");
    }

    @Override
    public void process(Page page) {
        //1. 如果是博文列表页面 【入口页面】，将所有博文的详细页面的url放入target集合中。
        // 并且添加下一页的url放入target集合中。
        if (page.getUrl().regex("https://blog\\.hellobi\\.com/hot/weekly\\?page=\\d+").match()) {
            //目标链接
            page.addTargetRequests(page.getHtml().xpath("//h2[@class='title']/a").links().all());
            //下一页博文列表页链接
            page.addTargetRequests(page.getHtml().xpath("//a[@rel='next']").links().all());
        }
        //2. 如果是博文详细页面
        else {
//            String content1 = page.getHtml().get();
            try {
                /*实例化BlogInfo，方便持久化存储。*/
                BlogInfo blog = new BlogInfo();
                //博文标题
                String title = page.getHtml().xpath("//h1[@class='clearfix']/a/text()").get();
                //博文url
                String url = page.getHtml().xpath("//h1[@class='clearfix']/a/@href").get();
                //博文作者
                String author = page.getHtml().xpath("//section[@class='sidebar']/div/div/a[@class='aw-user-name']/text()").get();
                //作者博客地址
                String blogHomeUrl = page.getHtml().xpath("//section[@class='sidebar']/div/div/a[@class='aw-user-name']/@href").get();
                //博文内容，这里只获取带html标签的内容，后续可再进一步处理
                String content = page.getHtml().xpath("//div[@class='message-content editor-style']").get();
                //推荐数(点赞数)
                String recommendNum = page.getHtml().xpath("//a[@class='agree']/b/text()").get();
                //评论数
                String commentNum = page.getHtml().xpath("//div[@class='aw-mod']/div/h2/text()").get().split("个")[0].trim();
                //阅读数（浏览数）
                String readNum = page.getHtml().xpath("//div[@class='row']/div/div/div/div/span/text()").get().split(":")[1].trim();
                //发布时间，发布时间需要处理，这一步获取原始信息
                String time = page.getHtml().xpath("//time[@class='time']/text()").get().split(":")[1].trim();
                SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd");
                Calendar cal = Calendar.getInstance();// 取当前日期。
                cal = Calendar.getInstance();
                String publishTime = null;
                Pattern p = Pattern.compile("^\\d{4}-\\d{2}-\\d{2}$");
                Matcher m = p.matcher(time);
                //如果time是“YYYY-mm-dd”这种格式的，则不需要处理
                if (m.matches()) {
                    publishTime = time;
                } else if (time.contains("天")) { //如果time包含“天”，如1天前，
                    int days = Integer.parseInt(time.split("天")[0].trim());//则获取对应的天数
                    cal.add(Calendar.DAY_OF_MONTH, -days);// 取当前日期的前days天.
                    publishTime = df.format(cal.getTime());  //并将时间转换为“YYYY-mm-dd”这个格式
                } else {//time是其他格式，如几分钟前，几小时前，都为当日日期
                    publishTime = df.format(cal.getTime());
                }
                //对象赋值
                blog.setUrl(url);
                blog.setTitle(title);
                blog.setAuthor(author);
                blog.setBlogHomeUrl(blogHomeUrl);
                blog.setCommentNum(commentNum);
                blog.setRecommendNum(recommendNum);
                blog.setReadNum(readNum);
                blog.setContent(content);
                blog.setPublishTime(publishTime);
                num++;//博文数++

                System.out.println("num:" + num + " " + blog.toString());//输出对象
                blogDao.saveBlog(blog);//保存博文信息到数据库
            } catch (Exception e) {
                e.printStackTrace();
            }
        }
    }

    @Override
    public Site getSite() {
        return this.site;
    }

}

完事，截图目录结构：

运行以及效果：

PS：1.本来直接转载的，发现有些问题，又截了一遍图

2.代码顺序变了一下，后面的需要引用前面的……

原文链接：

基于webmagic框架的爬虫小Demo：https://ask.hellobi.com/blog/jasmine3happy/8537

ASM 中添加删除磁盘 jnrjian 数据库 oracle
ThepresentdocumentexplainsindetailtheexactstepstomigrateASMdiskgroups(usingASMLIBdevices)fromoneSAN/Disk-Array/DAS/etc.toanotherSAN/Disk-Array/DAS/etc.withoutadowntime.Thisprocedurewillalsoworkfordisk
2018-05-13 Youth is a state of mind 一个人的朝圣远行
本科毕业，23岁，有一部分我的同龄人已经结婚了，更有甚者孩子都几岁了，而我还在干嘛？感觉中年危机提前来了，哈哈，喜欢这句话：Youthisastateofmind,butnotonlyconcernedwithage.Otherwise,youhonestlyjustgraspthetailofit.规划是，当兵入伍两年，军队提干，考学，考研。目前在准备雅思考试，争取在英语学习上达到一个自己所能到
关于JS中回调函数的个人理解 Jack_陈
近期在看到jQuery中test（index，test）的用法涉及到回调函数，之前未有涉及，于今晚专门看看了看，将个人对于回调函数的理解感悟记录一下，有不正确的地方希望指出。回调函数（callback），英文中的解释其实更容易理解：Acallbackisafunctionthatispassedasanargumenttoanotherfunctionandisexecutedafteritspa
2018-08-03 枯叶萧瑟
小花小草，它们也风姿撩人，它们也色彩华美，为了自己的夙愿，坚韧不屈，扎根地下，小小的身躯，任由风刮https://www.meipian.cn/1hz0q5y9?share_from=self&from=other&v=4.5.1&share_user_mpuuid=617466966298973f5766a375c1e77746
MyBatis之动态SQL编写指南 AA-代码批发V哥 mybatis mybatis
MyBatis之动态SQL编写指南一、动态SQL的核心价值传统JDBC的SQL拼接问题MyBatis动态SQL的优势二、核心动态SQL标签详解2.1`if`标签：条件判断基本用法`test`表达式规则2.2`where`与`trim`标签：条件拼接优化2.2.1`where`标签2.2.2`trim`标签：自定义拼接规则2.3`choose`、`when`、`otherwise`标签：多条件分支2
Ecounter South Lake Whatever璇
ThefirsttimetomeetSouthLakewasinautumn.Inourleisuretimeafterdinner,twoorthreefriendsinvitedeachotherandwewalkedaroundSouthLake.Thewindwaswhistlingthatday,andthewaveswerebeatingagainsttheshore,oneafter
基于MTK平台iwpriv 命令行 ZhouWu_313 前端网络智能路由器
MT7981root@OpenWrt:/etc/config#iwprivrax0statrax0stat:CurrentTemperature=55Txsuccess=18484Txfailcount=9751,PER=34.5%CurrentBWTxcount=21OtherBWTxcount=28214Rxsuccess=427555RxwithCRC=404019,PER=48.5%Rxd
野兽的呼唤~26 爱机车的异乡人
Andhesawnexttohim,nottheIndiancook,butanotherman,amanwithshorterlegs,andlongerarms.他看见身边并不是那个印第安厨子而是另一个人。Thismanhadlonghairanddeepeyes,anddeepeyes,andmadestrangenoisesinhisthroat.Hewasveryfrightenedof
主板基础知识 bcbobo21cn 硬件主板
主板，又叫主机板（mainboard）、系统板（systemboard）、或母板（motherboard），是计算机最基本的同时也是最重要的部件之一。主板一般为矩形电路板，上面安装了组成计算机的主要电路系统，一般有BIOS芯片、I/O控制芯片、键盘和面板控制开关接口、指示灯插接件、扩充插槽、主板及插卡的直流电源供电接插件等元件。主板制造质量的高低，决定了硬件系统的稳定性。主板与CPU关系密切，每一
【电脑】主板的基础知识 Mike_Wuzy 电脑
主板（Motherboard）是计算机的核心组件之一，它将所有其他硬件部件连接在一起并协调它们的工作。以下是关于主板的详细知识：1.架构组成一个典型的主板通常由以下几个主要部分构成：芯片组（Chipset）：分为南桥和北桥两个部分。北桥（Northbridge）：负责处理高速数据传输，如连接内存控制器、显示接口等。现代CPU集成了北桥的功能，因此许多主板上已经不再有独立的北桥芯片。南桥（South
linux shell if字符串比较大小,linux中shell if 判断总结玩皮的兔子 linux shell if字符串比较大小
UNIXShell里面比较字符写法-eq等于;-ne不等于;-gt大于;-lt小于;-le小于等于;-ge大于等于;-z空串;-n非空串;=两个字符相等;!=两个字符不等无论什么编程语言都离不开条件判断。SHELL也不例外。大体的格式如下：iflistthendosomethinghereeliflistthendoanotherthinghereelsedosomethingelseherefi
MyBatis动态SQL进阶：复杂查询与性能优化实战
引言在复杂业务场景中，SQL查询往往需要动态拼接条件、复用代码片段，并支持批量操作。MyBatis的动态SQL功能提供了强大的解决方案，本文将深入解析条件分支、片段复用、批量操作优化等核心技巧，助你写出高效、可维护的SQL映射。一、条件分支：choose/when/otherwise标签1.1场景说明假设需要实现一个商品查询接口，支持以下条件组合：按名称模糊查询按价格区间查询按状态精确查询若无条件
C语言基本概念（下）【C语言入门到精通】
C语言基本结构（下）Everyprogramisapartofsomeotherprogramandrarelyfits.1码字不易，对你有帮助点赞/转发/关注支持一下作者思维导图写在前面如果只是写个人学习总结的博客很容易，简单写一些感悟然后贴上代码走人就可以了，甚至不用校审。但是我命名本系列为【C语言必知必会】帮助你从入门到精通C语言，那势必要“事无巨细”一些：既要考虑到没有基础的初学者，又不能
ABAP - Excel OO - zcl_excel
classZCL_EXCELdefinitionpubliccreatepublic.publicsection.*"*publiccomponentsofclassZCL_EXCEL*"*donotincludeothersourcefileshere!!!interfacesZIF_EXCEL_BOOK_PROPERTIES.interfacesZIF_EXCEL_BOOK_PROTECTIO
[论文阅读] 人工智能 + 软件工程 | 当 LLM 写代码时，它的 “思考过程” 靠谱吗？—— 揭秘 CoT 质量的那些事儿张较瘦_ 前沿技术论文阅读人工智能软件工程
当LLM写代码时，它的“思考过程”靠谱吗？——揭秘CoT质量的那些事儿论文标题：AreTheyAllGood?EvaluatingtheQualityofCoTsinLLM-basedCodeGenerationarXiv:2507.06980[pdf,html,other]AreTheyAllGood?EvaluatingtheQualityofCoTsinLLM-basedCodeGenera
JavaSE的集合（Collection） pkhlll java
集合主要分为两大系列：Collection和MapCollection：Collection的子接口有Set、List、QueueCollection是层次结构的根接口，是所有单列集合的父接口，在Collection中定义了单列集合(List和Set)的通用的一些方法：1、添加元素（1）add(Eobj)：添加元素对象到当前集合中（2）addAll(Collectionother)：添加other
四六级，雅思必备连接词（持续更新~） dulu~dulu 自用笔记雅思英语雅思雅思词汇总结笔记雅思阅读雅思写作四六级写作
目录（一）观点对立（二）递进（三）因果（四）假设（五）总结（六）举例（七）优缺点承接说明（八）其他简单连接词1.并列关系2.顺序关系3.强调关系4.条件关系5.时间关系6.总结关系（一）观点对立1.Conversely：相反地Someviewtechnologyasadistraction.Conversely,othersseeitasapowerfullearningtool.有人视科技为干扰
php yaf_cg --app=www.yafapi.com --directory=D:\phpstudy_pro\WWW\www.yafapi.com --controller=Test` 到底
1.phpyaf_cg--app=www.yafapi.com--directory=D:\phpstudy_pro\WWW\www.yafapi.com--controller=Test到底是干什么的？这条命令是使用Yaf（YetAnotherFramework）框架提供的代码生成工具yaf_cg，自动生成一个基于Yaf框架的应用程序结构和代码文件。它的作用是帮助开发者快速搭建项目的基础结构，减
前端常见HTTP状态码阿文666 http html5 java
5种常见的HTTP状态码200(ok):请求已成功,请求所希望的响应头或数据体将随此响应返回303(SeeOther):告知客户端使用另一个URL来获取资源400(BadRequest):请求格式错误.1)语义有误;2)请求参数有误404(NotFound):请求失败,请求所希望的到的资源未被服务器发现500(InternalServerError):服务器遇到了一个未曾预料的情况,导致了它无法完
DPDK — App EAL options 指令行参数详解范桂飓 C/C++语言与网络编程手册 linux bash 运维
目录文章目录目录Lcore-relatedoptions（lcore相关选项）查看CPU布局系统层面的CPU隔离-cCOREMASK-lCORELIST--lcoresCOREMAPS--master-lcoreCOREID-sSERVICE_CORE_MASKMemory-relatedoptions（Memory相关参数）查看MainMemory布局OptionsOthersDevice-re
Hadoop核心组件最全介绍 Cachel wood 大数据开发 hadoop 大数据分布式 spark 数据库计算机网络
文章目录一、Hadoop核心组件1.HDFS(HadoopDistributedFileSystem)2.YARN(YetAnotherResourceNegotiator)3.MapReduce二、数据存储与管理1.HBase2.Hive3.HCatalog4.Phoenix三、数据处理与计算1.Spark2.Flink3.Tez4.Storm5.Presto6.Impala四、资源调度与集群管
线性代数向量内积_向量的点积| 使用Python的线性代数 cumubi7453 python 线性代数机器学习 numpy 算法
线性代数向量内积Prerequisite:LinearAlgebra|DefiningaVector先决条件：线性代数|定义向量Linearalgebraisthebranchofmathematicsconcerninglinearequationsbyusingvectorspacesandthroughmatrices.Inotherwords,avectorisamatrixinn-dim
基于多线程实现链表快排醇醛酸醚酮酯 C++并发编程链表数据结构
链表的splice函数与std::partition函数详解一、链表的splice函数：高效的节点迁移操作splice是std::list和std::forward_list特有的成员函数，用于在链表之间高效迁移节点，不涉及元素复制，仅修改指针连接。1.std::list的splice函数重载形式//1.移动单个节点到指定位置voidsplice(iteratorpos,list&other,it
C语言程序设计--算法与数据结构之建立初堆（大根堆）越太算法与数据结构数据结构程序设计算法 c语言
此代码可以正常运行，下附有运行区//算法8.8建初堆#include#include#defineMAXSIZE20//顺序表的最大长度typedefstruct{intkey;char*otherinfo;}ElemType;//顺序表的存储结构typedefstruct{ElemType*r;//存储空间的基地址intlength;//顺序表长度}SqList;//顺序表类型//用算法8.7筛
Vitest mock modules function in only one test and use the actual function in others 营赢盈英前端技术前端 javascript 开发语言 nuxt.js vitest unit-testing
题意：将Vitest的模块函数仅在一个测试中进行mock，其余测试中使用实际函数。问题背景：Thefollowingisanabstractionofmyproblemandthusdoesnotmaketoomuchsense:以下是我问题的抽象，因此并没有太多意义。GivenIhaveasimpleutilitycallMethodIfthat'sreturningthereturnofano
Linux安装Python失败常见缺失依赖项 xcosy Python python
1._ctypes模块构建失败buildcorrectlybutfinishedwiththismessage:Failedtobuildthesemodules:_ctypes解决AreyouusingUbuntuorotherLinuxdistribution?Thisproblemisbecauseyoudidn’tinstallthedependencypackage.Youmayfirs
n8n汉化部署一篇搞定工作流
制作汉化打包dockerfile需要注意的点是下面选择具体的汉化依赖需要和源镜像版本匹配不然打包之后运行访问不FROMdocker.n8n.io/n8nio/n8n:latestUSERrootWORKDIR/tmpRUNwgethttps://github.com/other-blowsnow/n8n-i18n-chinese/releases/download/n8n%401.99.1/
MySQL 8.0 OCP 1Z0-908 题目解析(17) 一只fish MYSQL OCP mysql 数据库
题目65Choosetwo.Whichtwoarecharacteristicsofsnapshot-basedbackups?□A)Thefrozenfilesystemcanbeclonedtoanothervirtualmachineimmediatelyintoactiveservice.□B)ThereisnoneedforInnoDBtablestoperformitsownrecov
python异步方法async love_521_ python 后端
一篇简单demo带你走进asyncimportasyncioimporttimeimportrequestsasyncdefntest2(i):r=awaitother_ntest(i)#等待other_ntest执行完成print(f"ntest2:{i}:{r}")asyncdefother_ntest(i):r=requests.get(i)print(f"other_ntest:{i}")
1163 Dijkstra Sequence (30) 圣保罗的大教堂 PAT刷题图 pat考试
Dijkstra'salgorithmisoneoftheveryfamousgreedyalgorithms.Itisusedforsolvingthesinglesourceshortestpathproblemwhichgivestheshortestpathsfromoneparticularsourcevertextoalltheotherverticesofthegivengraph.
sql统计相同项个数并按名次显示朱辉辉33 java oracle
现在有如下这样一个表： A表 ID Name time ------------------------------ 0001 aaa 2006-11-18 0002 ccc 2006-11-18 0003 eee 2006-11-18 0004 aaa 2006-11-18 0005 eee 2006-11-18 0004 aaa 2006-11-18 0002 ccc 20
Android+Jquery Mobile学习系列-目录白糖_ JQuery Mobile
最近在研究学习基于Android的移动应用开发，准备给家里人做一个应用程序用用。向公司手机移动团队咨询了下，觉得使用Android的WebView上手最快，因为WebView等于是一个内置浏览器，可以基于html页面开发，不用去学习Android自带的七七八八的控件。然后加上Jquery mobile的样式渲染和事件等，就能非常方便的做动态应用了。从现在起，往后一段时间，我打算
如何给线程池命名 daysinsun 线程池
在系统运行后，在线程快照里总是看到线程池的名字为pool-xx，这样导致很不好定位，怎么给线程池一个有意义的名字呢。参照ThreadPoolExecutor类的ThreadFactory，自己实现ThreadFactory接口，重写newThread方法即可。参考代码如下： public class Named
IE 中"HTML Parsing Error:Unable to modify the parent container element before the 周凡杨 html 解析 error readyState
错误： IE 中"HTML Parsing Error:Unable to modify the parent container element before the child element is closed" 现象：同事之间几个IE 测试情况下，有的报这个错，有的不报。经查询资料后，可归纳以下原因。
java上传 g21121 java
我们在做web项目中通常会遇到上传文件的情况，用struts等框架的会直接用的自带的标签和组件，今天说的是利用servlet来完成上传。我们这里利用到commons-fileupload组件，相关jar包可以取apache官网下载：http://commons.apache.org/ 下面是servlet的代码： //定义一个磁盘文件工厂 DiskFileItemFactory fact
SpringMVC配置学习 510888780 spring mvc
spring MVC配置详解现在主流的Web MVC框架除了Struts这个主力外，其次就是Spring MVC了，因此这也是作为一名程序员需要掌握的主流框架，框架选择多了，应对多变的需求和业务时，可实行的方案自然就多了。不过要想灵活运用Spring MVC来应对大多数的Web开发，就必须要掌握它的配置及原理。　　一、Spring MVC环境搭建：（Spring 2.5.6 + Hi
spring mvc-jfreeChart 柱图(1) 布衣凌宇 jfreechart
第一步：下载jfreeChart包，注意是jfreeChart文件lib目录下的，jcommon-1.0.23.jar和jfreechart-1.0.19.jar两个包即可；第二步：配置web.xml; web.xml代码如下 <servlet> <servlet-name>jfreechart</servlet-nam
我的spring学习笔记13-容器扩展点之PropertyPlaceholderConfigurer aijuans Spring3
PropertyPlaceholderConfigurer是个bean工厂后置处理器的实现，也就是BeanFactoryPostProcessor接口的一个实现。关于BeanFactoryPostProcessor和BeanPostProcessor类似。我会在其他地方介绍。PropertyPlaceholderConfigurer可以将上下文（配置文件）中的属性值放在另一个单独的标准java P
java 线程池使用 Runnable&Callable&Future antlove java thread Runnable callable future
1. 创建线程池 ExecutorService executorService = Executors.newCachedThreadPool(); 2. 执行一次线程，调用Runnable接口实现 Future<?> future = executorService.submit(new DefaultRunnable()); System.out.prin
XML语法元素结构的总结百合不是茶 xml 树结构
1.XML介绍1969年 gml (主要目的是要在不同的机器进行通信的数据规范)1985年 sgml standard generralized markup language1993年 html(www网)1998年 xml extensible markup language
改变eclipse编码格式 bijian1013 eclipse 编码格式
1.改变整个工作空间的编码格式改变整个工作空间的编码格式，这样以后新建的文件也是新设置的编码格式。 Eclipse->window->preferences->General->workspace-
javascript中return的设计缺陷 bijian1013 JavaScript AngularJS
代码1： <script> var gisService = (function(window) { return { name:function () { alert(1); } }; })(this); gisService.name(); &l
【持久化框架MyBatis3八】Spring集成MyBatis3 bit1129 Mybatis3
pom.xml配置 Maven的pom中主要包括： MyBatis MyBatis-Spring Spring MySQL-Connector-Java Druid applicationContext.xml配置 <?xml version="1.0" encoding="UTF-8"?> &
java web项目启动时自动加载自定义properties文件 bitray java Web 监听器相对路径
创建一个类 public class ContextInitListener implements ServletContextListener 使得该类成为一个监听器。用于监听整个容器生命周期的，主要是初始化和销毁的。类创建后要在web.xml配置文件中增加一个简单的监听器配置，即刚才我们定义的类。 <listener> <des
用nginx区分文件大小做出不同响应 ronin47
昨晚和前21v的同事聊天，说到我离职后一些技术上的更新。其中有个给某大客户(游戏下载类)的特殊需求设计，因为文件大小差距很大——估计是大版本和补丁的区别——又走的是同一个域名，而squid在响应比较大的文件时，尤其是初次下载的时候，性能比较差，所以拆成两组服务器，squid服务于较小的文件，通过pull方式从peer层获取，nginx服务于较大的文件，通过push方式由peer层分发同步。外部发布
java-67-扑克牌的顺子.从扑克牌中随机抽5张牌，判断是不是一个顺子，即这5张牌是不是连续的.2-10为数字本身，A为1，J为11，Q为12，K为13，而大 bylijinnan java
package com.ljn.base; import java.util.Arrays; import java.util.Random; public class ContinuousPoker { /** * Q67 扑克牌的顺子从扑克牌中随机抽5张牌，判断是不是一个顺子，即这5张牌是不是连续的。 * 2-10为数字本身，A为1，J为1
翟鸿燊老师语录 ccii 翟鸿燊
一、国学应用智慧TAT之亮剑精神A 1. 角色就是人格就像你一回家的时候，你一进屋里面，你已经是儿子，是姑娘啦，给老爸老妈倒怀水吧，你还觉得你是老总呢？还拿派呢？就像今天一样，你们往这儿一坐，你们之间是什么，同学，是朋友。还有下属最忌讳的就是领导向他询问情况的时候，什么我不知道，我不清楚，该你知道的你凭什么不知道
[光速与宇宙]进行光速飞行的一些问题 comsci 问题
在人类整体进入宇宙时代，即将开展深空宇宙探索之前，我有几个猜想想告诉大家仅仅是猜想。。。未经官方证实 1：要在宇宙中进行光速飞行，必须首先获得宇宙中的航行通行证，而这个航行通行证并不是我们平常认为的那种带钢印的证书，是什么呢？下面我来告诉
oracle undo解析 cwqcwqmax9 oracle
oracle undo解析2012-09-24 09:02:01 我来说两句作者：虫师收藏我要投稿 Undo是干嘛用的？ &nb
java中各种集合的详细介绍 dashuaifu java 集合
一，java中各种集合的关系图 Collection 接口的接口对象的集合 ├ List 子接口 &n
卸载windows服务的方法 dcj3sjt126com windows service
卸载Windows服务的方法在Windows中，有一类程序称为服务，在操作系统内核加载完成后就开始加载。这里程序往往运行在操作系统的底层，因此资源占用比较大、执行效率比较高，比较有代表性的就是杀毒软件。但是一旦因为特殊原因不能正确卸载这些程序了，其加载在Windows内的服务就不容易删除了。即便是删除注册表中的相应项目，虽然不启动了，但是系统中仍然存在此项服务，只是没有加载而已。如果安装其他
Warning: The Copy Bundle Resources build phase contains this target's Info.plist dcj3sjt126com ios xcode
http://developer.apple.com/iphone/library/qa/qa2009/qa1649.html Excerpt: You are getting this warning because you probably added your Info.plist file to your Copy Bundle
2014之C++学习笔记（一） Etwo C++Etwo Etwo iterator 迭代器
已经有很长一段时间没有写博客了，可能大家已经淡忘了Etwo这个人的存在，这一年多以来，本人从事了AS的相关开发工作，但最近一段时间，AS在天朝的没落，相信有很多码农也都清楚，现在的页游基本上达到饱和，手机上的游戏基本被unity3D与cocos占据，AS基本没有容身之处。so。。。最近我并不打算直接转型
js跨越获取数据问题记录 haifengwuch jsonp json Ajax
js的跨越问题，普通的ajax无法获取服务器返回的值。第一种解决方案，通过getson，后台配合方式，实现。 Java后台代码： protected void doPost(HttpServletRequest req, HttpServletResponse resp) throws ServletException, IOException { String ca
蓝色jQuery导航条 ini JavaScript html jquery Web html5
效果体验：http://keleyi.com/keleyi/phtml/jqtexiao/39.htmHTML文件代码： <!DOCTYPE html> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title>jQuery鼠标悬停上下滑动导航条 - 柯乐义<
linux部署jdk,tomcat,mysql kerryg jdk tomcat linux mysql
1、安装java环境jdk: 一般系统都会默认自带的JDK,但是不太好用，都会卸载了，然后重新安装。 1.1）、卸载：（rpm -qa :查询已经安装哪些软件包； rmp -q 软件包：查询指定包是否已
DOMContentLoaded VS onload VS onreadystatechange mutongwu jquery js
1. DOMContentLoaded 在页面html、script、style加载完毕即可触发，无需等待所有资源（image/iframe）加载完毕。（IE9+） 2. onload是最早支持的事件，要求所有资源加载完毕触发。 3. onreadystatechange 开始在IE引入，后来其它浏览器也有一定的实现。涉及以下 document , applet, embed, fra
sql批量插入数据 qifeifei 批量插入
hi，自己在做工程的时候，遇到批量插入数据的数据修复场景。我的思路是在插入前准备一个临时表，临时表的整理就看当时的选择条件了，临时表就是要插入的数据集，最后再批量插入到数据库中。 WITH tempT AS ( SELECT item_id AS combo_id, item_id, now() AS create_date FROM a
log4j打印日志文件如何实现相对路径到项目工程下 thinkfreer Web log4j 应用服务器日志
最近为了实现统计一个网站的访问量，记录用户的登录信息，以方便站长实时了解自己网站的访问情况，选择了Apache 的log4j,但是在选择相对路径那块卡主了，X度了好多方法(其实大多都是一样的内用，还一个字都不差的)，都没有能解决问题，无奈搞了2天终于解决了，与大家分享一下需求：用户登录该网站时，把用户的登录名,ip,时间。统计到一个txt文档里，以方便其他系统调用此txt。项目名
linux下mysql-5.6.23.tar.gz安装与配置笑我痴狂 mysql linux unix
1.卸载系统默认的mysql [root@localhost ~]# rpm -qa | grep mysql mysql-libs-5.1.66-2.el6_3.x86_64 mysql-devel-5.1.66-2.el6_3.x86_64 mysql-5.1.66-2.el6_3.x86_64 [root@localhost ~]# rpm -e mysql-libs-5.1

基于Webmagic框架的爬虫小Demo

Demo简介:

代码：

你可能感兴趣的:(other)