sky_551

corosync+pacemaker+crmsh实现高可用

1、引言及环境介绍

2、高可用环境部署

3、crmsh接口使用介绍

4、案例

5、总结

1、引言及环境介绍

在上一博文中介绍了一些关于高可用技术的理论基础知识，这一博文则是介绍corosync+pacemakcer这一高可用方案的安装部署，并会以实际的案例来演示高可用的实现，corosync提供集群的信息层（messaging layer）的功能，传递心跳信息和集群事务信息，pacemaker工作在资源分配层，提供资源管理器的功能，并以crmsh这个资源配置的命令接口来配置资源。在进入主题前先来介绍一下常见的开源高可用方案和这次环境搭建的系统环境。

常见的HA开源方案：

heartbeat v1 + haresources

heartbeat v 2 + crm

heartbeat v3 + cluster-glue + pacemaker

corosync + cluster-glue + pacemaker

cman + rgmanager

keepalived + script

此次测试的系统环境：

[root@nod1 tomcat]# cat /etc/issue
CentOS release 6.4 (Final)
Kernel \r on an \m
[root@nod1 tomcat]# uname -r
2.6.32-358.el6.x86_64

两个节点都是采用相同的操作系统

2、高可用环境部署

[root@nod1 ~]# yum -y install pacemakcer corosync    #pacemaker和corosync采用yum方式安装即可，前提是你要配置好yum源，注意：两个节点都要进行安装
[root@nod1 ~]# rpm -ql corosync
/etc/corosync
/etc/corosync/corosync.conf.example    #主配置文件模板
/etc/corosync/corosync.conf.example.udpu
/etc/corosync/service.d
/etc/corosync/uidgid.d
/etc/dbus-1/system.d/corosync-signals.conf
/etc/rc.d/init.d/corosync
/etc/rc.d/init.d/corosync-notifyd
/etc/sysconfig/corosync-notifyd
/usr/bin/corosync-blackbox
/usr/libexec/lcrso
/usr/libexec/lcrso/coroparse.lcrso
/usr/libexec/lcrso/objdb.lcrso
/usr/libexec/lcrso/quorum_testquorum.lcrso
/usr/libexec/lcrso/quorum_votequorum.lcrso
/usr/libexec/lcrso/service_cfg.lcrso
/usr/libexec/lcrso/service_confdb.lcrso
/usr/libexec/lcrso/service_cpg.lcrso
/usr/libexec/lcrso/service_evs.lcrso
/usr/libexec/lcrso/service_pload.lcrso
/usr/libexec/lcrso/vsf_quorum.lcrso
/usr/libexec/lcrso/vsf_ykd.lcrso
/usr/sbin/corosync
/usr/sbin/corosync-cfgtool
/usr/sbin/corosync-cpgtool
/usr/sbin/corosync-fplay
/usr/sbin/corosync-keygen    #为corosync生成authkey的命令，此命令是根据内核的熵池来生成认证文件的，如果熵池的随机性不足，则会运行此命令后一直卡着，此时用户只有不断的敲击键盘使产生足够的随机数后才能生成authkdy文件
/usr/sbin/corosync-notifyd
/usr/sbin/corosync-objctl
/usr/sbin/corosync-pload
/usr/sbin/corosync-quorumtool
/usr/share/doc/corosync-1.4.7
/usr/share/doc/corosync-1.4.7/LICENSE
/usr/share/doc/corosync-1.4.7/SECURITY
/usr/share/man/man5/corosync.conf.5.gz
/usr/share/man/man8/confdb_keys.8.gz
/usr/share/man/man8/corosync-blackbox.8.gz
/usr/share/man/man8/corosync-cfgtool.8.gz
/usr/share/man/man8/corosync-cpgtool.8.gz
/usr/share/man/man8/corosync-fplay.8.gz
/usr/share/man/man8/corosync-keygen.8.gz
/usr/share/man/man8/corosync-notifyd.8.gz
/usr/share/man/man8/corosync-objctl.8.gz
/usr/share/man/man8/corosync-pload.8.gz
/usr/share/man/man8/corosync-quorumtool.8.gz
/usr/share/man/man8/corosync.8.gz
/usr/share/man/man8/corosync_overview.8.gz
/usr/share/snmp/mibs/COROSYNC-MIB.txt
/var/lib/corosync
/var/log/cluster

生成集群节点间的认证文件：

[root@nod1 ~]# corosync-keygen   #生成认证文件
Corosync Cluster Engine Authentication key generator.
Gathering 1024 bits for key from /dev/random.
Press keys on your keyboard to generate entropy.
Press keys on your keyboard to generate entropy (bits = 80).
#熵池随机性不足时一直卡在这里，这里可以另开窗口进行其他的配置

提供corosync的配置文件，利用模板生成：

[root@nod1 ~]# cd /etc/corosync
[root@nod1 corosync]# cp corosync.conf.example corosync.conf  
[root@nod1 corosync]# ls
corosync.conf.example       service.d   corosync.conf  corosync.conf.example.udpu  uidgid.d
[root@nod1 corosync]# vim corosync.conf
# Please read the corosync.conf.5 manual page
compatibility: whitetank      #表示兼容whitetank版本，其实是corosync 0.8之前的版本
totem {     #定义集群环境下各corosync间通讯机制
        version: 2
        # secauth: Enable mutual node authentication. If you choose to
        # enable this ("on"), then do remember to create a shared
        # secret with "corosync-keygen".
        #secauth: off
        secauth: on   #表示基于authkey的方式来验证各节点
        threads: 0   #启动的线程数，0表示不启动线程机制，默认即可
        # interface: define at least one interface to communicate
        # over. If you define more than one interface stanza, you must
        # also set rrp_mode.
        interface {                   #定义哪个接口来传递心跳信息和集群事务信息
                # Rings must be consecutively numbered, starting at 0.
                ringnumber: 0    #表示心跳信息发出后能够在网络中转几圈，保持默认值即可
                # This is normally the *network* address of the
                # interface to bind to. This ensures that you can use
                # identical instances of this configuration file
                # across all your cluster nodes, without having to
                # modify this option.
                bindnetaddr: 192.168.0.0      #绑定的网络地址
                # However, if you have multiple physical network
                # interfaces configured for the same subnet, then the
                # network address alone is not sufficient to identify
                # the interface Corosync should bind to. In that case,
                # configure the *host* address of the interface
                # instead:
                # bindnetaddr: 192.168.1.1
                # When selecting a multicast address, consider RFC
                # 2365 (which, among other things, specifies that
                # 239.255.x.x addresses are left to the discretion of
                # the network administrator). Do not reuse multicast
                # addresses across multiple Corosync clusters sharing
                # the same network.
                mcastaddr: 239.255.21.111 #监听的多播地址，不要使用默认
                # Corosync uses the port you specify here for UDP
                # messaging, and also the immediately preceding
                # port. Thus if you set this to 5405, Corosync sends
                # messages over UDP ports 5405 and 5404.
                mcastport: 5405    #corosync间传递信息使用的端口，默认即可
                # Time-to-live for cluster communication packets. The
                # number of hops (routers) that this ring will allow
                # itself to pass. Note that multicast routing must be
                # specifically enabled on most network routers.
                ttl: 1    #包的生存周期，保持默认即可
        }
}
logging {
        # Log the source file and line where messages are being
        # generated. When in doubt, leave off. Potentially useful for
        # debugging.
        fileline: off
        # Log to standard error. When in doubt, set to no. Useful when
        # running in the foreground (when invoking "corosync -f")
        to_stderr: no
        # Log to a log file. When set to "no", the "logfile" option
        # must not be set.
        to_logfile: yes
        logfile: /var/log/cluster/corosync.log
        # Log to the system log daemon. When in doubt, set to yes.
        to_syslog: no    #关闭日志发往syslog
        # Log debug messages (very verbose). When in doubt, leave off.
        debug: off
        # Log messages with time stamps. When in doubt, set to on
        # (unless you are only logging to syslog, where double
        # timestamps can be annoying).
        timestamp: on    #打印日志时是否记录时间戳，会消耗较多的cpu资源
        logger_subsys {
                subsys: AMF
                debug: off
        }
}
#新增加以下内容
service {
        ver: 0
        name: pacemaker    #表示以插件化方式启用pacemaker
}
aisexec {    #运行openaix时所使用的用户及组，默认时也是采用root，可以不定义
        user: root
        group: root
}

当corosync-keygen命令顺利运行完成后，在/etc/corosync/目录下生成authkey认证文件：

[root@nod1 corosync]# ls
authkey        corosync.conf.example       service.d
corosync.conf  corosync.conf.example.udpu  uidgid.d
[root@nod1 corosync]#  scp authkey corosync.conf nod2.test.com:/etc/corosync/   #把认证文件与配置文件拷贝到另一节点
[root@nod1 corosync]# service corosync start   #启动服务，不要忘记另一个节点也要把corosync服务启动

验证corosync服务是否正常启动，在集群环境应对每个服务器都要验证：

验证corosync是否启动成功：

[root@nod1 corosync]# grep -e "Corosync Cluster Engine" /var/log/cluster/corosync.log    #查看corosync集群引擎是否启动
Jul 19 21:45:48 corosync [MAIN  ] Corosync Cluster Engine ('1.4.7'): started and ready to provide service.
[root@nod1 corosync]# grep -e "configuration file" /var/log/cluster/corosync.log     #查看corosync的配置文件是否成功加载
Jul 19 21:45:48 corosync [MAIN  ] Successfully read main configuration file '/etc/corosync/corosync.conf'.

查看定义的TOTEM接口是否启用：

[root@nod1 corosync]# grep "TOTEM" /var/log/cluster/corosync.log  
Jul 19 21:45:48 corosync [TOTEM ] Initializing transport (UDP/IP Multicast).
Jul 19 21:45:48 corosync [TOTEM ] Initializing transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Jul 19 21:45:48 corosync [TOTEM ] The network interface [192.168.0.201] is now up.
Jul 19 21:45:48 corosync [TOTEM ] A processor joined or left the membership and a new membership was formed.

验证启动时是否有错误：

[root@nod1 corosync]# grep "ERROR" /var/log/cluster/corosync.log
Jul 19 21:45:48 corosync [pcmk  ] ERROR: process_ais_conf: You have configured a cluster using the Pacemaker plugin for Corosync. The plugin is not supported in this environment and will be removed very soon.
Jul 19 21:45:48 corosync [pcmk  ] ERROR: process_ais_conf:  Please see Chapter 8 of 'Clusters from Scratch' (http://www.clusterlabs.org/doc) for details on using Pacemaker with CMAN
#上边的错误信息可以忽略，这里报错的信息主要意思是说pacemaker是以插件的方式配置的，在以后的版本中将不再支持

验证pacemaker是否正常启动：

[root@nod1 corosync]# grep "pcmk_startup" /var/log/cluster/corosync.log
Jul 19 21:45:48 corosync [pcmk  ] info: pcmk_startup: CRM: Initialized
Jul 19 21:45:48 corosync [pcmk  ] Logging: Initialized pcmk_startup
Jul 19 21:45:48 corosync [pcmk  ] info: pcmk_startup: Maximum core file size is: 18446744073709551615
Jul 19 21:45:48 corosync [pcmk  ] info: pcmk_startup: Service: 9
Jul 19 21:45:48 corosync [pcmk  ] info: pcmk_startup: Local hostname: nod1.test.com

3、crmsh接口使用介绍

pacemaker的配置接口有两种，一是crmsh，另一个是pcs，主里以crmsh的使用为例。

crmsh依赖pssh这个包，所以两个都需要分别在各个集群节点上进行安装，这两个包可以在这里进行下载http://crmsh.github.io/

[root@nod1 ~]# ls
crmsh-2.1-1.6.x86_64.rpm pssh-2.3.1-2.el6.x86_64.rpm
[root@nod1 ~]# yum install crmsh-2.1-1.6.x86_64.rpm pssh-2.3.1-2.el6.x86_64.rpm

crmsh的crm命令有两种模式：一种是命令模式，当执行一个命令，crmsh会把执行得到的结果输出到shell的标准输出；另一种是交互式模式；下边将有大量的例子来说明。

crm命令的使用：

[root@nod1 ~]# crm      #直接使用crm进入交互式模式
crm(live)#               
crm(live)# help    #查看帮助信息获取crm支持哪些子命令

crmsh常用的子命令：

status：查看集群的状态信息

configure：配置集群的命令

node：管理节点状态

ra：配置资源代理

resource：管理资源的子命令，比如关闭一个资源，清除资源的当前状态（比如一些出错信息）

接下来先查看一下集群的状态信息：

[root@nod1 ~]# crm
crm(live)# status
Last updated: Tue Jul 21 21:21:35 2015
Last change: Sun Jul 19 23:01:34 2015
Stack: classic openais (with plugin)             #这里表示基于插件化的方式用openais中的corosync调用pacemaker来工作的
Current DC: nod1.test.com - partition with quorum    #Designated  Coordinate简称DC，表示指定的协调员，这里表示nod1.test.com就是集群中的事务协调员，“partition with quorum”就表示当前分区是拥有法定票数的
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes   #表示配置了2个节点，预计的投票数为2票
0 Resources configured   #表示没有配置集群资源
Online: [ nod1.test.com nod2.test.com ]   #这里显示两个节点都是在线的

查看集群默认的配置信息：

[root@nod1 ~]# crm
crm(live)# configure
crm(live)configure# show    #使用show这个子命令就能查看当前集群的配置信息，使用“show xml”能以xml文件格式显示出当前的配置信息
node nod1.test.com
node nod2.test.com
property cib-bootstrap-options: \
 dc-version=1.1.11-97629de \
 cluster-infrastructure="classic openais (with plugin)" \
 expected-quorum-votes=2 \
 stonith-enabled=true \
 no-quorum-policy=stop \
 last-lrm-refresh=1436887216
crm(live)configure# verify          #verify是检查配置文件是否有错误
   error: unpack_resources: Resource start-up disabled since no STONITH resources have been defined
   error: unpack_resources: Either configure some or disable STONITH with the stonith-enabled option
   error: unpack_resources: NOTE: Clusters with shared data need STONITH to ensure data integrity
Errors found during check: config not valid
#这里报了一些错误，表示默认时没有定义STONITH设备，在corosync+pacemaker的集群是不允许的，当然可以定义忽略这个检查，下边有介绍

用property子命令定义集群的全局属性：

[root@nod1 ~]# crm
crm(live)configure# property    #在crmsh接口中是支持tab键命令补全功能的，这里输入property后连续敲击两下tab键就可列出可配置的参数
batch-limit=                   maintenance-mode=              remove-after-stop=
cluster-delay=                 migration-limit=               shutdown-escalation=
cluster-recheck-interval=      no-quorum-policy=              start-failure-is-fatal=
crmd-transition-delay=         node-action-limit=             startup-fencing=
dc-deadtime=                   node-health-green=             stonith-action=
default-action-timeout=        node-health-red=               stonith-enabled=
default-resource-stickiness=   node-health-strategy=          stonith-timeout=
election-timeout=              node-health-yellow=            stop-all-resources=
enable-acl=                    pe-error-series-max=           stop-orphan-actions=
enable-startup-probes=         pe-input-series-max=           stop-orphan-resources=
is-managed-default=            pe-warn-series-max=            symmetric-cluster=
load-threshold=                placement-strategy=            
crm(live)configure# property stonith-enabled=false   #把stonith设备的支持关闭，不然我们在想使用corosync的集群功能就需要定义stonith设备  
crm(live)configure# show
node nod1.test.com
node nod2.test.com
property cib-bootstrap-options: \
 dc-version=1.1.11-97629de \
 cluster-infrastructure="classic openais (with plugin)" \
 expected-quorum-votes=2 \
 stonith-enabled=false \      #已是false状态
 no-quorum-policy=stop \
 last-lrm-refresh=1436887216
crm(live)configure# verify   #再校验配置就不会报错了
crm(live)configure# commit   #提交配置

集群资源的配置

要想获取资源的详细信息就需要去ra（resource agent）中去查看，比如我们要定义一个虚拟ip资源：

[root@nod1 ~]# crm
crm(live)# ra
crm(live)ra# classes    #查看集群资源有哪些类型
lsb
ocf / heartbeat pacemaker
service
stonith
crm(live)ra# list ocf    #列出ocf这个类型下有哪些资源代理，下边就有IPaddr这个关于设置ip的资源代理
CTDB               ClusterMon         Delay              Dummy              Filesystem         HealthCPU          HealthSMART
IPaddr             IPaddr2            IPsrcaddr          LVM                MailTo             Route              SendArp
Squid              Stateful           SysInfo            SystemHealth       VirtualDomain      Xinetd             apache
conntrackd         controld           db2                dhcpd              ethmonitor         exportfs           iSCSILogicalUnit
mysql              named              nfsnotify          nfsserver          pgsql              ping               pingd
postfix            remote             rsyncd             symlink            tomcat 
crm(live)ra# meta ocf:IPaddr   #使用meta子命令能获取到一个资源的详细信息，即此资源的使用帮助信息

定义主资源用primitive命令：

[root@nod1 ~]# crm
crm(live)#configure
crm(live)configure# primitive webip ocf:IPaddr  params ip=192.168.0.100
crm(live)configure# verify
crm(live)configure# commit   #一旦提交成功，此资源就开始生效了
crm(live)configure# cd ..
crm(live)# status
Last updated: Tue Jul 21 22:14:43 2015
Last change: Tue Jul 21 22:12:44 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
1 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com    #这里就是我们定义好的资源，在nod1.test.com节点启用了
[root@nod1 ~]# ip add show
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UNKNOWN qlen 1000
    link/ether 00:0c:29:07:89:fe brd ff:ff:ff:ff:ff:ff
    inet 192.168.0.201/24 brd 192.168.0.255 scope global eth0
    inet 192.168.0.100/24 brd 192.168.0.255 scope global secondary eth0
    inet6 fe80::20c:29ff:fe07:89fe/64 scope link
       valid_lft forever preferred_lft forever
#我们定义的ip已生效

定义nginx的这个服务资源：

[root@nod1 ~]# crm
crm(live)# configure
crm(live)configure# primitive nginx lsb:nginx   #nginx这个服务是在lsb这个资源类别下的资源代理，primitive命令后的第一个nginx是定义集群资源的一个名称
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd ..
crm(live)# status
Last updated: Tue Jul 21 22:25:00 2015
Last change: Tue Jul 21 22:24:58 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com
 nginx(lsb:nginx):Started nod2.test.com     
#nginx这个资源在nod2.test.com节点启动起来了，这也验证了在高可用集群中集群会尽可能让资源分摊到各个节点的特性，而在实际环境中我们希望webip与nginx这两个资源是运行在同一个节点上的。

要想让多个资源同时运行在同一个节点上可以把多个资源定义在一个group中或定义排列约束（colocation）：

[root@nod1 ~]# crm
crm(live)# configure
crm(live)configure# group webservice webip nginx
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd ..
crm(live)# status
Last updated: Tue Jul 21 22:30:19 2015
Last change: Tue Jul 21 22:30:17 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     nginx(lsb:nginx):Started nod1.test.com
#两个资源同时运行在nod1.test.com上了

接下验证资源是否能转移到其他节点上：

[root@nod1 ~]# crm node standby   #把当前节点转换成standby状态
[root@nod1 ~]# crm status
Last updated: Tue Jul 21 22:37:14 2015
Last change: Tue Jul 21 22:37:09 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Node nod1.test.com: standby
Online: [ nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     nginx(lsb:nginx):Started nod2.test.com
#webservice组中的资源已转换到了nod2.test.com节点上

再让nod1.test.com重新上线，观察资源是否能转移回来：

[root@nod1 ~]# crm node online  #让当前节点重新上线
You have new mail in /var/spool/mail/root
[root@nod1 ~]# crm status
Last updated: Tue Jul 21 22:38:37 2015
Last change: Tue Jul 21 22:38:33 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     nginx(lsb:nginx):Started nod2.test.com
#webservice组资源没有转换到nod1.test.com，是因为没有定义组对节点的倾向性

如果此时把nod2.test.com节点上的corosync服务停止，webservice这个组中的资源能够转换到nod1.test.com节点上吗？如下测试：

[root@nod2 ~]# service corosync stop
Signaling Corosync Cluster Engine (corosync) to terminate: [确定]
Waiting for corosync services to unload:.                  [确定]
You have new mail in /var/spool/mail/root
在nod1.test.com节在上查看当前集群的状态：
[root@nod1 ~]# crm status
Last updated: Tue Jul 21 22:43:27 2015
Last change: Tue Jul 21 22:38:33 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition WITHOUT quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com ]
OFFLINE: [ nod2.test.com ]

从上边的输出信息可知资源并没有转移过来，为什么？仔细看上边的“Current DC: nod1.test.com - partition WITHOUT quorum ”表示当前分区没有法定的票数，所以此节点不会正常工作，资源当然不会转移过来。那如何解决这个问题，方案不止一个，一是可以增加一个ping node节点，二是可以增加一个仲裁磁盘，三是让集群中的节点数成奇数个，四是直接忽略当集群没有法定票数时直接忽略，第四种是最简单的方式，操作如下：

[root@nod2 ~]# service corosync start       #先把nod2.test.com的corosync服务启动
Starting Corosync Cluster Engine (corosync):               [确定]
[root@nod1 ~]# crm
crm(live)# status
Last updated: Tue Jul 21 22:50:08 2015
Last change: Tue Jul 21 22:38:33 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     nginx(lsb:nginx):Started nod1.test.com
crm(live)configure# property no   #敲击两下tab键后列出以no开头的可配置参数
no-quorum-policy=      node-health-green=     node-health-strategy=
node-action-limit=     node-health-red=       node-health-yellow=
crm(live)configure# property no-quorum-policy=       #输入“no-quorum-policy=”再敲击两下tab键后列出一些帮助信息
no-quorum-policy (enum, [stop]): What to do when the cluster does not have quorum
    What to do when the cluster does not have quorum  Allowed values: stop, freeze, ignore, suicide
crm(live)configure# property no-quorum-policy=ignore   #设置其值为"ignore"
 crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# show    #显示当前的配置信息
node nod1.test.com \
 attributes standby=off
node nod2.test.com
primitive nginx lsb:nginx
primitive webip IPaddr \
 params ip=192.168.0.100
group webservice webip nginx
property cib-bootstrap-options: \
 dc-version=1.1.11-97629de \
 cluster-infrastructure="classic openais (with plugin)" \
 expected-quorum-votes=2 \
 stonith-enabled=false \
 no-quorum-policy=ignore \
 last-lrm-refresh=1436887216
crm(live)# status
Last updated: Tue Jul 21 22:54:00 2015
Last change: Tue Jul 21 22:51:10 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     nginx(lsb:nginx):Started nod1.test.com
#当前资源已运行在nod1.test.com上

在nod1.test.com上停止corosync服务，再观察资源是否能转移到nod2.test.com上：

[root@nod1 ~]# service corosync stop
Signaling Corosync Cluster Engine (corosync) to terminate: [  OK  ]
Waiting for corosync services to unload:.                  [  OK  ]
[root@nod2 ~]# crm   #在nod2.test.com上进行crm管理接口
crm(live)# status
Last updated: Tue Jul 21 22:56:52 2015
Last change: Tue Jul 21 22:52:25 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition WITHOUT quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod2.test.com ]
OFFLINE: [ nod1.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     nginx(lsb:nginx):Started nod2.test.com
#资源已成功转移到nod2.test.com上，所以在两个节点的高可用的环境，要设置“no-quorum-policy=ignore”，忽略节点的得到的法定票数不大于一半时的情况

如果是我们把在nod2.test.com上的nginx进程杀掉，集群资源会被转移到nod1.test.com上吗？如下测试：

[root@nod1 ~]# service corosync start   #先把nod1.test.com上的corosync服务启动
Starting Corosync Cluster Engine (corosync):               [  OK  ]
[root@nod1 ~]# crm status
Last updated: Wed Jul 22 22:22:56 2015
Last change: Wed Jul 22 22:19:55 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     nginx(lsb:nginx):Started nod2.test.com
再切换到nod2.test.com节点上杀掉nginx进程：
[root@nod2 ~]# pgrep nginx
1798
1799
[root@nod2 ~]# killall nginx   #杀掉nginx进程
[root@nod2 ~]# pgrep nginx  #检验nginx进程是否被杀掉，没有任何信息输出表示nginx进程已不存在
[root@nod2 ~]# crm status
Last updated: Wed Jul 22 22:26:09 2015
Last change: Wed Jul 22 22:19:55 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     nginx(lsb:nginx):Started nod2.test.com

上边查看集群状态时发现资源还是在nod2.test.com节点上，这在实际的生产环境中是不允许的，所以需要让集群能监控我们定义的资源，如果发现某资源不存在了，自己会尝试启动这一资源，如果尝试启动不成功，则会转移资源，下边就来说说如何定义监控资源。

[root@nod2 ~]# service nginx start    #先把上边杀掉的nginx启动起来
正在启动 nginx：                                           [确定]

要定义资源的监控时也是在用全局定义命令primitive定义资源时一同定义，接下来我们先把之前定义的资源删掉后重新定义：

[root@nod1 ~]# crm
crm(live)# resource
crm(live)resource# show
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started
     nginx(lsb:nginx):Started 
#进入资源管理命令可查看当前集群配置资源的情况，上边表示两个资源都是处理started状态
crm(live)resource# stop webservice   #停掉webservice这个组中的所有资源，要删除资源，必须让资源处理stoppped状态
crm(live)resource# show
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Stopped
     nginx(lsb:nginx):Stopped 
crm(live)resource# cd ..
crm(live)# configure
crm(live)configure# edit           #输入edit命令回车后会调用vi编辑器直接去编辑资源定义的配置文件，如下所示
node nod1.test.com \
        attributes standby=on
node nod2.test.com
primitive nginx lsb:nginx      #这是定义的资源，需要删除
primitive webip IPaddr \      #这是定义的资源，需要删除
        params ip=192.168.0.100
group webservice webip nginx \     #这是定义的资源，需要删除
        meta target-role=Stopped
property cib-bootstrap-options: \
        dc-version=1.1.11-97629de \
        cluster-infrastructure="classic openais (with plugin)" \
        expected-quorum-votes=2 \
        stonith-enabled=false \
        no-quorum-policy=ignore \
        last-lrm-refresh=1436887216
#vim:set syntax=pcmk

在上边打开的编辑窗口中删除我们自己定义的资源，再保存退出，最后保留的内容如下：

node nod1.test.com \
        attributes standby=on
node nod2.test.com
property cib-bootstrap-options: \
        dc-version=1.1.11-97629de \
        cluster-infrastructure="classic openais (with plugin)" \
        expected-quorum-votes=2 \
        stonith-enabled=false \
        no-quorum-policy=ignore \
        last-lrm-refresh=1436887216
#vim:set syntax=pcmk
crm(live)configure# verify     #检查语法
crm(live)configure# commit   #提交配置
crm(live)resource# cd    #回到根目录
crm(live)# status   #查看集群状态
Last updated: Wed Jul 22 21:33:07 2015
Last change: Wed Jul 22 21:31:45 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
0 Resources configured
Online: [ nod1.test.com nod2.test.com ]

从状态信息输出发现我们定义的资源已被删除了，现在开始重新定义带监控的资源：

crm(live)configure# primitive webip ocf:IPaddr params ip=192.168.0.100 op monitor timeout=20s interval=60s
crm(live)configure# primitive webserver lsb:nginx op monitor timeout=20s interval=60s
crm(live)configure# group webservice webip webserver
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd
crm(live)# status
Last updated: Wed Jul 22 22:29:59 2015
Last change: Wed Jul 22 22:28:01 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     webserver(lsb:nginx):Started nod1.test.com

这样带监控的资源就定义好了，上边在定义监控是的那些参数的意义可以在使用类似的命令查看“crm(live)ra# meta ocf:IPaddr”。现在我们再到nod1.test.com节点上把nginx杀掉，观察会发生什么现象：

[root@nod1 ~]# pgrep nginx
3056
3063
[root@nod1 ~]# killall nginx
[root@nod1 ~]# pgrep nginx
[root@nod1 ~]# pgrep nginx
[root@nod1 ~]# pgrep nginx   #等了几十秒后，nginx又被重新启动了
3337
3338

再看一下集群的状态信息，如下：

[root@nod1 ~]# crm status
Last updated: Wed Jul 22 22:33:29 2015
Last change: Wed Jul 22 22:28:01 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     webserver(lsb:nginx):Started nod1.test.com
Failed actions:
    webserver_monitor_60000 on nod1.test.com 'not running' (7): call=23, status=complete, last-rc-change='Wed Jul 22 22:32:02 2015', queued=0ms, exec=0ms    #这里报告了webserver这个资源没有运行

如果我们kill掉nginx后，让nginx无法启动，又是怎样一个情况呢，我们这样来测试，把nginx杀掉后，立刻去修改nginx的配置文件，随便增加一些行，让nginx的配置文件无法通过语法检测，这样自然nginx就无法启动了，说做就做：

[root@nod1 ~]# killall nginx
[root@nod1 ~]# echo "test" >> /etc/nginx/nginx.conf
[root@nod1 ~]# nginx -t
nginx: [emerg] unexpected end of file, expecting ";" or "}" in /etc/nginx/nginx.conf:44
nginx: configuration file /etc/nginx/nginx.conf test failed
[root@nod1 ~]# crm status
Last updated: Wed Jul 22 22:37:42 2015
Last change: Wed Jul 22 22:28:01 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com    #看这里资源被转移到nod2.test.com了
     webserver(lsb:nginx):Started nod2.test.com
Failed actions:
    webserver_start_0 on nod1.test.com 'unknown error' (1): call=30, status=complete, last-rc-change='Wed Jul 22 22:37:02 2015', queued=0ms, exec=70ms   #这里也报告一个未知的错误

上边的两个测试证明，集群对资源能实现监控，并在资源不可用时能测试重新启动资源，如果不成功则转移资源。测试完了不要忘记恢复nod1.test.com节点上的nginx配置。

4、资源约束

资源约束定义我们期望资源运行在某一个节点上，或期望某些资源会在一起，而不使用组的方式定义。

接着上边的实验，我们希望webip与webserver这两个资源始终是在一起的，而不用定义webservice这个group来实现，那做如下操作：

[root@nod1 ~]# crm
crm(live)# status
Last updated: Wed Jul 22 22:46:26 2015
Last change: Wed Jul 22 22:28:01 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     webserver(lsb:nginx):Started nod2.test.com
Failed actions:
    webserver_start_0 on nod1.test.com 'unknown error' (1): call=30, status=complete, last-rc-change='Wed Jul 22 22:37:02 2015', queued=0ms, exec=70ms

先把上边资源的报错信息清理掉：

[root@nod1 ~]# crm 
crm(live)# resource
crm(live)resource# cleanup webserver   #清理资源的一些状态信息
Cleaning up webserver on nod1.test.com
Cleaning up webserver on nod2.test.com
Waiting for 2 replies from the CRMd.. OK
crm(live)resource# cd
crm(live)# status
Last updated: Wed Jul 22 22:47:53 2015
Last change: Wed Jul 22 22:47:47 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     webserver(lsb:nginx):Started nod2.test.com

接下来删除webservice这个组资源：

[root@nod1 ~]# crm 
crm(live)# resource
crm(live)resource# status
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started
     webserver(lsb:nginx):Started
crm(live)configure# delete webservice   #删除组资源
crm(live)configure# verify
crm(live)configure# commit
crm(live)# status
Last updated: Wed Jul 22 23:00:13 2015
Last change: Wed Jul 22 23:00:09 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com   #组被删除后，两个资源被集群平均分布在各节点上
 webserver(lsb:nginx):Started nod2.test.com    #webserver运行在nod2.test.com上

4.1、定义排列约束(colocation)

排列约束是定义让两个资源是否在一起：

[root@nod1 ~]# crm 
crm(live)#configure
crm(live)configure# help colocation   #查看colocation帮助信息
crm(live)configure# colocation webserver_with_webip inf: webserver webip  #这里表示webserver资源与webip在一起的可能是正无穷的，即两资源一定要在一起
crm(live)configure# show xml   #查看我们定义的约束
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd ..
crm(live)# status
Last updated: Wed Jul 22 23:09:11 2015
Last change: Wed Jul 22 23:09:08 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com   #现在两个资源又都在nod1.test.com上运行了
 webserver(lsb:nginx):Started nod1.test.com

4.2、定义顺序约束(order)

顺序约束表示资源的启动按照一定的顺序进行，而关闭则是一个相反的过程：

[root@nod1 ~]# crm
crm(live)configure# help order   #查看帮助
crm(live)configure# order webip_before_webserver mandatory: webip webserver  #表示webip先于webserver启动，详细请看帮助信息
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# show xml  #查看定义的详情

4.3、定义位置约束(location)

位置约束表示资源更倾向运行在哪个节点上。

[root@nod1 ~]# crm
crm(live)# status
Last updated: Wed Jul 22 23:20:08 2015
Last change: Wed Jul 22 23:15:39 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com  #此时资源是运行在nod1.test.com上的
 webserver(lsb:nginx):Started nod1.test.com

定义位置约束让资源更倾向运行在nod2.test.com上：

[root@nod1 ~]# crm
crm(live)# configure
crm(live)configure# help location  #查看帮助信息
crm(live)configure# location webip_on_nod2 webip inf: nod2.test.com  #表示webip在nod2.test.com上的倾向性是正无穷的
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd
crm(live)# status
Last updated: Wed Jul 22 23:23:21 2015
Last change: Wed Jul 22 23:22:50 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod2.test.com   
 webserver(lsb:nginx):Started nod2.test.com

上边webip与webserver资源都已转移到了nod2.test.com，但webserver资源我们并没有定义它的位置约束，为什么它也转移到了nod2.test.com上了呢？因为我们定义过webip与webserver的排序约束，这两个资源在一起的分数(score)是inf（正无穷）的，所以webip在哪里，webserver就在哪里。

location的定义还有另外一种格式，如下：

[root@nod1 ~]# crm
crm(live)configure# delete webip_on_nod2   #先删除上边定义的location
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# location webip_on_nod1 webip rule inf: #uname eq nod1.test.com  #表示webip运行在名称为nod1.test.com主机上的倾向性是正无穷的
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd
crm(live)# status
Last updated: Wed Jul 22 23:33:38 2015
Last change: Wed Jul 22 23:33:18 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod1.test.com
 webserver(lsb:nginx):Started nod1.test.com
#上边的两个资源又转移到了nod1.test.com节点上。

接着再来定义一个location：

crm(live)configure# location webserver_not_on_nod1 webserver rule -inf: #uname eq nod1.test.com  #这里表示webserver资源不在nod1上的分数是负无穷
crm(live)configure# verify
crm(live)configure# commit
crm(live)configure# cd
crm(live)# status
Last updated: Wed Jul 22 23:41:25 2015
Last change: Wed Jul 22 23:41:19 2015
Stack: classic openais (with plugin)
Current DC: nod2.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
2 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 webip(ocf::heartbeat:IPaddr):Started nod2.test.com
 webserver(lsb:nginx):Started nod2.test.com

webip与webserver从nod1.test.com上转移到了nod2.test上，为什么呢？虽然定义了webserver资源不在nod1上的分数是负无穷，但我们不是定义了webip对nod1.test.com的倾向性是正无穷么，这个“inf+(-inf)”等于什么呢？答案是“-inf”，所以资源绝对不会在nod1.test.com上。

5、案例

一个高可用集群一般会包含三类资源，一是虚拟ip，二是服务，三是共享存储，下边我们再把共享存储加上来一起说说高可用的实现，因有新的资源加入，在资源的约束上又会有所不同，所以先把上边的定义的ip资源、服务资源删除，重新来说说有三种资源的高可用性，怎样删除集群中的资源这里就不再赘述了，可以看看前边的操作。

资源删除后就是一个干净的集群，如下所示：

[root@nod1 ~]# crm
crm(live)# status
Last updated: Fri Jul 24 20:58:49 2015
Last change: Fri Jul 24 20:58:32 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
0 Resources configured
Online: [ nod1.test.com nod2.test.com ]

接下来准备共享存储，这里以nod0.test.com这个节点提供NFS共享存储为例：

[root@nod0 ~]# yum -y install nfs-utils
[root@nod0 ~]# vim /etc/exports
/web/htdocs     192.168.0.0/24(rw)
[root@nod0 ~]# mkdir -pv /web/htdocs
[root@nod0 ~]# vim /web/htdocs/index.html
[root@nod0 ~]# service rpcbind start
Starting rpcbind:                                          [  OK  ]
[root@nod0 ~]# service nfs start
Starting NFS services:                                     [  OK  ]
Starting NFS mountd:                                       [  OK  ]
Starting NFS daemon:                                       [  OK  ]
Starting RPC idmapd:                                       [  OK  ]
[root@nod0 ~]# vim /etc/exports
/web/htdocs     192.168.0.0/24(rw,no_root_squash)
[root@nod0 ~]# mkdir -pv /web/htdocs/
[root@nod0 ~]# echo "<h>NFS node</h>" > /web/htdocs/index.html    #这是提供的测试页面
[root@nod2 ~]# mount -t nfs 192.168.0.200:/web/htdocs /usr/share/nginxhtml/    #nfs第一次挂载很慢，所以先手动挂载一次

再在nod2.test.com上启动nginx，测试一下能否访问nod2.test.com节点上的ip：192.168.0.202测试页面：

[root@nod2 ~]# service nginx start
正在启动 nginx：                                           [确定]

测试通过了要关闭nginx服务，卸载共享存储：
[root@nod2 ~]# umount /usr/share/nginx/html/
You have new mail in /var/spool/mail/root
[root@nod2 ~]# service nginx stop
停止  nginx:                                               [确定]

接下来就去定义高可用集群的资源了：

[root@nod1 ~]# crm
crm(live)# configure
crm(live)configure# primitive webip ocf:IPaddr params ip=192.168.0.100 op monitor timeout=10s interval=30s
crm(live)configure# primitive webserver lsb:nginx op monitor timeout=10s interval=30s
crm(live)configure# primitive webstore ocf:Filesystem params device="192.168.0.200:/web/htdocs" directory="/usr/share/nginx/html" fstype="nfs" op monitor timeout=30s interval=60s
crm(live)configure# verify  
WARNING: webip: specified timeout 10s for monitor is smaller than the advised 20s
WARNING: webserver: specified timeout 10s for monitor is smaller than the advised 15
WARNING: webstore: default timeout 20s for start is smaller than the advised 60    #表示nfs共享存储要定义start时的超时时间，默认是20s，但建议是60s
WARNING: webstore: default timeout 20s for stop is smaller than the advised 60  #表示nfs共享存储要定义stop时的超时时间，默认是20s，但建议是60s
WARNING: webstore: specified timeout 30s for monitor is smaller than the advised 40
在校验时报了如下错误，大概是说在设置资源时监控的时间值不对，按照提示做修改就是
crm(live)configure# cd ..
There are changes pending. Do you want to commit them (y/n)? n    #这里不要提交，当然也可以用"edit"命令调用vi编辑器去编辑xml文件
crm(live)# configure   #进入配置模式重新定义资源
crm(live)configure# primitive webip ocf:IPaddr params ip=192.168.0.222 op monitor timeout=20s interval=30s
crm(live)configure# verify
crm(live)configure# primitive webserver lsb:nginx op monitor timeout=15s interval=30s
crm(live)configure# verify
crm(live)configure# primitive webstore ocf:Filesystem params device="192.168.0.200:/web/htdocs" directory="/usr/share/nginx/html" fstype="nfs" op monitor timeout=30s interval=60s op start timeout=60s op stop timeout=60s
crm(live)configure# verify
WARNING: webstore: specified timeout 30s for monitor is smaller than the advised 40   #这里还有一个值设置不对
crm(live)configure# edit   #直接进入编辑模式进行修改，修改后就是下边这样
node nod1.test.com \
        attributes standby=off
node nod2.test.com \
        attributes standby=off
primitive webip IPaddr \
        params ip=192.168.0.222 \
        op monitor timeout=20s interval=30s
primitive webserver lsb:nginx \
        op monitor timeout=15s interval=30s
primitive webstore Filesystem \
        params device="192.168.0.200:/web/htdocs" directory="/usr/share/nginx/html" fstype=nfs \
        op monitor timeout=40s interval=60s \
        op start timeout=60s interval=0 \
        op stop timeout=60s interval=0
property cib-bootstrap-options: \
        dc-version=1.1.11-97629de \
        cluster-infrastructure="classic openais (with plugin)" \
        expected-quorum-votes=2 \
        stonith-enabled=false \
        no-quorum-policy=ignore \
        last-lrm-refresh=1437576541
#vim:set syntax=pcmk
#记得保存退出
crm(live)configure# verify   #现在校验就没有错误了
crm(live)configure# commit  #提交配置

接下来定义三个资源的一些约束，思考一下，有VIP、有服务、有共享存储的一个高可用集群需要怎样一些约束关系呢？第一：集群在正常工作时三个资源应该是运行在一个节点上的，而三个资源间又有一些小的约束关系，VIP要与服务(nginx)要在一起，服务(nginx)要与共享存储在一起，这些可以用排列约束(colocation)，也可以用组(group)的方式实现；第二：各个资源的启动次序，VIP应该是先于服务启动，共享存储得先挂载上才启动服务吧；接下来就去定义这些：

crm(live)configure# group webservice webip webstore webserver   #定义一个组包含三个资源
crm(live)configure# order webip_before_webstore_before_webserver inf: webip webstore webserver  #定义顺序约束，定义这三个资源的启动顺序一定(inf)是先启动webip，接着是webstore，最后是webserver，而关闭则是相反的过程
crm(live)configure# verify
crm(live)configure# show xml   # 查看配置的xml文件

如果这三个资源对集群节点没有倾向性那就直接可以commit了，特别是在当今虚拟化泛滥的年代，高可用一样部署xem、kvm、openstack这样的虚拟环境下，集群资源对虚拟资源的倾向性表现得不明显了。

crm(live)configure# commit
crm(live)configure# cd ..
crm(live)# status
Last updated: Fri Jul 24 22:25:06 2015
Last change: Fri Jul 24 22:25:02 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
3 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod1.test.com
     webstore(ocf::heartbeat:Filesystem):Started nod1.test.com
     webserver(lsb:nginx):Started nod1.test.com
#从上边的输出信息可知资源运行在了nod1.test.com这个节点上了

现在访问http服务测试一下，访问的是我们定义的VIP，如下：

现在测试一下集群资源是否能正常转移，把nod1.test.com节点置于standby状态，看资源是否能转移到nod2.test.com节点上：

[root@nod1 ~]# crm node standby
[root@nod1 ~]# crm status
Last updated: Fri Jul 24 22:28:03 2015
Last change: Fri Jul 24 22:27:53 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
3 Resources configured
Node nod1.test.com: standby
Online: [ nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     webstore(ocf::heartbeat:Filesystem):Started nod2.test.com
     webserver(lsb:nginx):Started nod2.test.com
#上边输出信息中看到资源都转移到了nod2.test.com上了

再去刷新一下访问页面：如下依然是有效的，如下：

经测试资源能正常切换，接下来还要测试定义的资源监控是否生效，可以去尝试停止nginx服务或umount共享存储，等监控资源的时间到时集群就会尝试重新启动服务或挂载共享存储：

[root@nod2 ~]# service nginx stop
停止 nginx：   [确定]

过一会后，集群就监控到异常了，如下：

[root@nod1 ~]# crm status
Last updated: Fri Jul 24 22:42:05 2015
Last change: Fri Jul 24 22:36:00 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
3 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     webstore(ocf::heartbeat:Filesystem):Started nod2.test.com
     webserver(lsb:nginx):Started nod2.test.com
Failed actions:
    webserver_monitor_30000 on nod2.test.com 'not running' (7): call=41, status=complete, last-rc-change='Fri Jul 24 22:37:25 2015', queued=0ms, exec=0ms

再来测试一下共享存储是否能监控并恢复，如下：

[root@nod2 ~]# umount /usr/share/nginx/html/

现在去访问web，就是打开nginx的默认页面了，如下：

当检测时间一到，集群就会发现异常，并尝试恢复，如下：

[root@nod1 ~]# crm status
Last updated: Fri Jul 24 22:44:03 2015
Last change: Fri Jul 24 22:36:00 2015
Stack: classic openais (with plugin)
Current DC: nod1.test.com - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured, 2 expected votes
3 Resources configured
Online: [ nod1.test.com nod2.test.com ]
 Resource Group: webservice
     webip(ocf::heartbeat:IPaddr):Started nod2.test.com
     webstore(ocf::heartbeat:Filesystem):Started nod2.test.com
     webserver(lsb:nginx):Started nod2.test.com
Failed actions:
    webserver_monitor_30000 on nod2.test.com 'not running' (7): call=41, status=complete, last-rc-change='Fri Jul 24 22:37:25 2015', queued=0ms, exec=0ms
    webstore_monitor_60000 on nod2.test.com 'not running' (7): call=39, status=complete, last-rc-change='Fri Jul 24 22:42:55 2015', queued=0ms, exec=0ms

现在访问web页面又恢复了，如下：

至此，corosync+pacemaker+crmsh的高可用的实现已演示完毕。

6、总结

作为一个Linux运维工程师，掌握高可用架构是必不可少的技能，刚学习高可用时感觉那些理论知识就不好理解，可现在把上边的实验做完后，感觉对高可用架构有了新的认识，并对上一博客中提到的理论知识也有了新的认识。

在利用corosync+pacemaker且是两个节点实现高可用时，需要注意的是要设置全局属性把stonith设备关闭，忽略法定票数不大于一半的机制，即：

crm(live)configure# property no-quorum-policy=ignore
crm(live)configure# property stonith-enabled=false

你可能感兴趣的:(pacemaker,corosync,crmsh)

postgresql 集群文档 st780206 postgresql postgresql 数据库
https://www.cnblogs.com/Alicebat/p/14148933.html[命令]Pacemaker命令pcscluster（管理节点）–EternalCenterPostgreSQL实战之物理复制和逻辑复制（五）_postgresql流复制和物理复制-CSDN博客https://jingyan.baidu.com/article/a378c9606f059df3292830
【pacemaker pcs】* Node X : UNCLEAN (offline) 康雨城 linux pcs pacemaker
背景在配置PCS的时候，找了两个节点分别做好了配置4.3.创建高可用性集群RedHatEnterpriseLinux8|RedHatCustomerPortal但是发现，NodeList:*Nodemaster:UNCLEAN(offline)*Nodemon-node1:UNCLEAN(offline)排查排查发现原来是时间不一致导致修复通过ntp同步一下时间就可以了#安装工具yum-yinst
Proxmox VE退出集群模式传说中的暗暗 Linux PVE Proxmox
有2台PVE集群，一台突然当机了，然后一时半会也弄不好，这时候操作虚拟机重启的话会导致联系不到集群服务器导致虚拟机起不来，快速解决方法就是将集群服务停止。通过SSH连上PVE主机，直接输入下面的命令就可以完全退出集群服务停止cluster服务systemctlstoppve-cluster.servicesystemctlstopcorosync.service设置本地模式pmxcfs-l删除co
分布式一致性协议 growdu
分布式一致性协议当前业界主流的分布式一致性协议主要有如下几种：totem协议（简单即有效）totem协议，全称是TheTotemSingle-RingOrderingandMembershipProtocol，是一个基于令牌环的分布式一致性算法。corosync基于totem协议实现。paxos协议（二阶段提交）raft协议（二阶段提交，基于paxos协议完善和改进）Raft协议就是Paxos的衍
回顾 2023 这一年的进展，哪些 AI 公司让你觉得未来可期？网罗开发 AIGC 人工智能
文章目录前言行业趋势1、OpenAI成立于2015年2、Tome成立于2020年3、Synthesia成立于2017年4、Uizard成立于2018年5、Soundful成立于2019年6、GoodVision成立于2017年7、Writesonic成立于2021年8、AtomicAI成立于2020年9、Eightfold成立于2016年10、SpacemakerAI成立于2016年11、Deep
部署Openstack HA 叮咚网工 openstack 云计算 linux服务器 openstack linux 运维服务器虚拟化分布式
一、技术介绍Heartbeat与Corosync是流行的MessagingLayer（集群信息层），Pacemaker是最流行的CRM（集群资源管理器），同时Corosync+Pacemaker是最流行的高可用集群的套件，使用DRBD+Pacemaker+Corosync部署OpenStackHA。二、安装前准备1、常规初始化操作两个个节点都需要执行hostnamectlset-hostnamec
MFS分布式文件系统 Hongx06 kubernetes docker 容器
目录集群部署MasterServersChunkservers编辑ClientsStorageClassesLABELmfs高可用pacemaker高可用编辑ISCSI添加集群资源主机ip角色server1192.168.81.11MasterServersserver2192.168.81.12Chunkserversserver3192.168.81.13Chunkserversserver4
kubernetes集群编排（13） Hongx06 kubernetes 容器云原生
目录k8s高可用集群haproxy负载均衡pacemaker高可用部署control-plane部署workernodek8s高可用集群实验环境主机名IP角色k8s1192.168.81.10harbork8s2192.168.81.11control-planek8s3192.168.81.12control-planek8s4192.168.81.13control-planek8s5192.
kubernetes集群编排——k8s高可用集群 HaoJl09 kubernetes集群编排 kubernetes 容器云原生
实验环境主机名IP角色k8s1192.168.92.11harbork8s2192.168.92.12control-planek8s3192.168.92.13control-planek8s4192.168.92.14control-planek8s5192.168.92.15haproxy,pacemakerk8s6192.168.92.16haproxy,pacemakerk8s7192.
kubernetes 高可用集群 Mlul392 kubernetes kubernetes 容器
目录一、haproxy负载均衡二、pacemaker高可用三、部署control-plane四、部署workernode实验环境主机名IP角色docker192.168.67.10harbork8s1192.168.67.11control-planek8s2192.168.67.12control-planek8s3192.168.67.13control-planek8s4192.168.56
四层负载均衡(haproxy实现) w1n0
文章目录haproxy安装使用效果日志管理效果管理页面调度算法效果acl访问控制动静分离读写分离结合keepalived实现高可用安装测试VIP设置脚本检查haproxy状态效果haproxy+pacemaker配置PCSDhaproxyhaproxy是一种web服务解决方案，HAProxy提供高可用性、负载均衡以及基于TCP和HTTP应用的代理，支持虚拟主机，它是免费、快速并且可靠的一种解决方案
【PostgreSQL高可用之Repmgr和Patroni部分场景对比】小怪兽ysl PostgreSQL 数据库服务器 postgresql
PostgreSQL数据库有着各种各样的高可用方案，绝大多数，都是基于流复制机制实现的，常见的例如Patroni+DCS,Pacemaker+Corosync,Repmgr，keepalived，pg_auto_failover，PGpool等等，其中使用较多的应该是Patroni和Repmgr两种，下文针对PostgreSQ的两种高可用方案Repmgr和Patroni进行部分场景对比。一、Rep
Nginx + keepalived 实现双机热备无与伦比jia linux Nginx 服务器 Nginx Keepalived
Nginx+keepalivedkeepalived简介Keepalived是一个基于VRRP协议来实现的服务高可用方案，可以利用其来避免IP单点故障，类似的工具还有heartbeat、corosync、pacemaker。但是它一般不会单独出现，而是与其它负载均衡技术（如lvs、haproxy、nginx）一起工作来达到集群的高可用。VRRP协议VRRP全称VirtualRouterRedund
Pacemaker中的资源管理工具--CRM基本指令及用法 Mumunu- pacmaker
配置pacemaker有2种途径，一是命令行工具包括crm和pcs，二是图形工具包括：pygui(mgmt),Hawk,LCMC,pcsd.mgmt出道时间较早，很多文章有介绍；LCMC是Java编写的图形管理工具；Hawk则是官方推荐的代替mgmt的web界面配置工具；pcsd则是pcs的图形界面。具体可参考本文主要介绍命令行工具crm,而pcs和crm相似，语法稍有不同，掌握crm后使用pcs
国家及校级奖项、称号（中英对照） huanhuan_tiantian 工作
国家奖学金NationalScholarship国家励志奖学金NationalEncouragementscholarship三好学生标兵PacemakertoMeritStudent三好学生MeritStudent学习优秀生ModelStudentofAcademicRecords突出才能奖ModelStudentofOutstandingCapacity先进个人AdvancedIndividu
Arch Linux源码安装corosync成功乡路 Linux 编译
ArchLinux源码安装corosync成功[xy@archlinux~]$uname-rp5.13.5-arch1-1unknown[xy@archlinux~]$cat/etc/os-releaseNAME=“ArchLinux”直接安装找不到[xy@archlinux~]$sudopacman-Spacemakererror:targetnotfound:pacemakerarch网站也搜
corosync+pacemaker+nfs配置简单高可用吃面包的刺猬 linux 服务器运维
环境准备：每个节点提供20G共享存储web1192.168.134.176node7web2192.168.134.177node8一、准备web环境（两台web测试机都要准备）yuminstallhttpd-yecho"webtestpage,ipis`hostname-I`.">/var/www/html/index.htmlsystemctlstarthttpd二、做两个节点免密登录，和配置
euler欧拉系统尝试用yum、源码安装pacemaker失败乡路 Linux euler pacemaker
euler欧拉系统尝试用yum、源码安装pacemaker失败用yum安装pacemaker======================================[root@euler~]#uname-rp3.10.0-862.14.1.0.h209.eulerosv2r7.x86_64x86_64不能自动安装[root@euler~]#yuminstallpacemakerLoadedpl
FreeSwitch 使用keepalived进行主备切换高可用部署 xiedy001 freeswitch 服务器运维
FreeSWITCH的高可用部署方式有两种：主备切换和负载均衡，官方文档介绍的主备切换部署是采用Corosync&Pacemaker，负载均衡采用前置opensips。但对使用keepalived进行主备切换的高可用方式没有介绍，同时网上对该种部署方式也没有介绍。本人对Corosync&Pacemaker不熟悉，目前在职的公司web应用大部分采用keepalived+haproxy，所以对keep
【MogDB/openGauss与PG的repmgr对比】小怪兽ysl openGauss 数据库运维 linux
提到PG的repmgr，大家可能并不陌生，他是现在PG比较流行的一套开源工具，用于管理PostgreSQL服务器集群中的复制管理和故障转移，也就是相当于一个集群管理+HA工具。当前PG的高可用方案，大致有keepalived、pgpool、repmgr、pacemaker+corosync、etcd+patroni等等。其中etcd+patroni和repmgr是目前用的较多的高可用。patron
corosync-qdevice中ffsplit与lsm算法的区别 growdu_real corosync 集群服务器 linux corosync 集群
corosync-qdevice中ffsplit与lsm算法的区别corosync-qdevice目前支持两种算法来设置corosync-qnetd如何为给定的节点或者分区提供投票的行为。ffsplit五五平分算法。这仅对具有偶数个节点的集群有意义。它只为活动节点数最多的分区提供一票。如果有两个完全相同的分区，它将投票给得分较高的分区。分数计算规则为：连接节点数+启发式通过的连接节点数-启发式失败
corosync-qnetd投票机制 growdu_real corosync 集群服务器 linux 集群 corosync
corosync-qnetd投票机制corosync-qnetd是corosync的第三方仲裁机制，当corosync出现网络分区时，集群内部无法选择出quorate一方时，就会借助corosync-qnetd来进行辅助投票。corosync-qnetd作为服务端，会根据各分区连接到qnetd的客户端数目，完成启发式算法的客户端数目等信息来对各分区节点进行投票corosync-qdevice作为客
一台主机运行多个corosync代码分析 growdu_real corosync 数据库大数据集群 corosync
一台主机运行多个corosync代码分析corosync当前设计为一台机器只能运行一个corosyc实例，无法部署多个，是由如下两个部分决定的：锁文件ipc创建锁文件corosync从设计上就只运行一台机器运行一个corosync节点，它通过每次运行将pid写入固定的pid文件来进行控制，pid的文件路径在代码中写死，无法配置。staticconstchar*corosync_lock_file=
ansible shell non-zero return code 隐藏错误信息某呆啊杂七杂八的问题记录 linux shell ansible
在使用ansible的shell模块时，可能会碰到non-zeroreturncode，这时task会failed，但是需要该任务不为failed。此时可以在shell命令末尾增加cat，将返回的内容通过管道传递给cat，使用cat返回的rc始终为0，而且也能捕获到原始输出进行判断。实际应用目前有如下需求：往pacemaker的集群里面添加资源时，如果资源已经存在，命令返回值非0，但是需要该任务正
PVE 集群部署-节点删除江小白go linux 运维服务器
如题:pve1，pve2，pve3组集群，需要将pve2删除一.前置准备1.确保pve2的虚拟机已经全部迁移到其他节点2.确保pve2的数据已经备份过，可以全部删除二.pve2节点自我删除systemctlstoppve-cluster.servicesystemctlstopcorosync.servicepmxcfs-l#强制设置为本地模式cd/etc/pve/rmcorosync.confr
proxmox 退出集群 skydieu
1、在需要退出集群的node上停止pve-cluster服务systemctlstoppve-cluster.servicesystemctlstopcorosync.service2、node上集群系统文件设置未本地模式pmxcfs-l3、删除corosync配置文件rm/etc/pve/corosync.confrm-rf/etc/corosync/*4、重新启动集群文件系统服务killall
华为鲲鹏+uos ha + lvs安装龙飞1107
所有ha机器执行：aptinstalllibdbi-perllibdbd-mysql-perlcorosyncpcspacemakercrmsh修改pcs端口(我用的默认的2224)：vim/etc/default/pcsdsystemctlrestartpcsdpam_tally2--userhacluster--resetls/etc/corosync/corosync.confcat/etc
基于DRBD实现存储高可用配置 Macarron linux 网络运维高可用
基于DRBD+Pacemaker+Corosync的存储主备配置1.功能介绍1.1DRBDDRBD是一种基于软件、基于网络的块复制存储解决方案，主要用于对服务器之间的磁盘、分区、逻辑卷等进行数据镜像。当用户将数据写入本地磁盘时，数据也会被发送到网络中另一台主机的磁盘上，从而实现本地主机(主节点)与远程主机(备节点)之间数据的实时同步。DRBD是一种基于linux内核模块实现的快级别的同步复制技术，
mysql 数据库集群搭建：（四）pacemaker管理三台maxscale集群，搭建mariadb读写分离中间层集群... weixin_34026484 数据库
为什么80%的码农都做不了架构师？>>>《mysql数据库集群搭建：（一）VirtualBox中多台CentOS虚拟机间和windows主机间互通以及访问互联网设置》《mysql数据库集群搭建：（二）3台CentOS-7安装Percona-XtraDB-Cluster-57集群》《mysql数据库集群搭建：（三）CentOS7.2MariaDB10.2galera集群安装》《mysql数据库集群搭
搭建PostgreSQL高可用集群（基于Pacemaker+Corosync） mengshicheng1992 MySQL &PostgreSQL postgresql
搭建PostgreSQL高可用集群（基于Pacemaker+Corosync）此文以PostgreSQL10版本为例！如未指定，下述命令在所有节点执行！系统资源及组件规划节点名称系统名称CPU/内存网卡磁盘IP地址OS节点角色PGSQL1pgsql12C/4Gens33128G192.168.0.11CentOS7PostgreSQL、Pacemaker、CorosyncPGSQL2pgsql22
tomcat基础与部署发布暗黑小菠萝 Tomcat java web
从51cto搬家了，以后会更新在这里方便自己查看。做项目一直用tomcat，都是配置到eclipse中使用，这几天有时间整理一下使用心得，有一些自己配置遇到的细节问题。 Tomcat：一个Servlets和JSP页面的容器，以提供网站服务。一、Tomcat安装安装方式：①运行.exe安装包 &n
网站架构发展的过程 ayaoxinchao 数据库应用服务器网站架构
1.初始阶段网站架构：应用程序、数据库、文件等资源在同一个服务器上 2.应用服务和数据服务分离：应用服务器、数据库服务器、文件服务器 3.使用缓存改善网站性能：为应用服务器提供本地缓存，但受限于应用服务器的内存容量，可以使用专门的缓存服务器，提供分布式缓存服务器架构 4.使用应用服务器集群改善网站的并发处理能力：使用负载均衡调度服务器，将来自客户端浏览器的访问请求分发到应用服务器集群中的任何
[信息与安全]数据库的备份问题 comsci 数据库
如果你们建设的信息系统是采用中心-分支的模式,那么这里有一个问题如果你的数据来自中心数据库,那么中心数据库如果出现故障,你的分支机构的数据如何保证安全呢? 是否应该在这种信息系统结构的基础上进行改造,容许分支机构的信息系统也备份一个中心数据库的文件呢? &n
使用maven tomcat plugin插件debug关联源代码商人shang maven debug 查看源码 tomcat-plugin
*首先需要配置好'''maven-tomcat7-plugin'''，参见[[Maven开发Web项目]]的'''Tomcat'''部分。 *配置好后，在[[Eclipse]]中打开'''Debug Configurations'''界面，在'''Maven Build'''项下新建当前工程的调试。在'''Main'''选项卡中点击'''Browse Workspace...'''选择需要开发的
大访问量高并发 oloz 大访问量高并发
大访问量高并发的网站主要压力还是在于数据库的操作上，尽量避免频繁的请求数据库。下面简要列出几点解决方案： 01、优化你的代码和查询语句，合理使用索引 02、使用缓存技术例如memcache、ecache将不经常变化的数据放入缓存之中 03、采用服务器集群、负载均衡分担大访问量高并发压力 04、数据读写分离 05、合理选用框架，合理架构(推荐分布式架构)。
cache 服务器小猪猪08 cache
Cache 即高速缓存.那么cache是怎么样提高系统性能与运行速度呢？是不是在任何情况下用cache都能提高性能？是不是cache用的越多就越好呢？我在近期开发的项目中有所体会，写下来当作总结也希望能跟大家一起探讨探讨，有错误的地方希望大家批评指正。　　1.Cache 是怎么样工作的? 　　Cache 是分配在服务器上
mysql存储过程香水浓 mysql
Description:插入大量测试数据 use xmpl; drop procedure if exists mockup_test_data_sp; create procedure mockup_test_data_sp( in number_of_records int ) begin declare cnt int; declare name varch
CSS的class、id、css文件名的常用命名规则 agevs JavaScript UI 框架 Ajax css
CSS的class、id、css文件名的常用命名规则 (一)常用的CSS命名规则　　头：header 　　内容：content/container 　　尾：footer 　　导航：nav 　　侧栏：sidebar 　　栏目：column 　　页面外围控制整体布局宽度：wrapper 　　左右中：left right
全局数据源 AILIKES java tomcat mysql jdbc JNDI
实验目的：为了研究两个项目同时访问一个全局数据源的时候是创建了一个数据源对象，还是创建了两个数据源对象。 1：将diuid和mysql驱动包（druid-1.0.2.jar和mysql-connector-java-5.1.15.jar）copy至%TOMCAT_HOME%/lib下；2：配置数据源，将JNDI在%TOMCAT_HOME%/conf/context.xml中配置好,格式如下：&l
MYSQL的随机查询的实现方法 baalwolf mysql
MYSQL的随机抽取实现方法。举个例子，要从tablename表中随机提取一条记录，大家一般的写法就是：SELECT * FROM tablename ORDER BY RAND() LIMIT 1。但是，后来我查了一下MYSQL的官方手册，里面针对RAND()的提示大概意思就是，在ORDER BY从句里面不能使用RAND()函数，因为这样会导致数据列被多次扫描。但是在MYSQL 3.23版本中，
JAVA的getBytes()方法 bijian1013 java eclipse unix OS
在Java中，String的getBytes()方法是得到一个操作系统默认的编码格式的字节数组。这个表示在不同OS下，返回的东西不一样！ String.getBytes(String decode)方法会根据指定的decode编码返回某字符串在该编码下的byte数组表示，如： byte[] b_gbk = "
AngularJS中操作Cookies bijian1013 JavaScript AngularJS Cookies
如果你的应用足够大、足够复杂，那么你很快就会遇到这样一咱种情况：你需要在客户端存储一些状态信息，这些状态信息是跨session(会话)的。你可能还记得利用document.cookie接口直接操作纯文本cookie的痛苦经历。幸运的是，这种方式已经一去不复返了，在所有现代浏览器中几乎
[Maven学习笔记五]Maven聚合和继承特性 bit1129 maven
Maven聚合在实际的项目中，一个项目通常会划分为多个模块，为了说明问题，以用户登陆这个小web应用为例。通常一个web应用分为三个模块： 1. 模型和数据持久化层user-core, 2. 业务逻辑层user-service以 3. web展现层user-web， user-service依赖于user-core user-web依赖于user-core和use
【JVM七】JVM知识点总结 bit1129 jvm
1. JVM运行模式 1.1 JVM运行时分为-server和-client两种模式，在32位机器上只有client模式的JVM。通常，64位的JVM默认都是使用server模式，因为server模式的JVM虽然启动慢点，但是，在运行过程，JVM会尽可能的进行优化 1.2 JVM分为三种字节码解释执行方式：mixed mode, interpret mode以及compiler
linux下查看nginx、apache、mysql、php的编译参数 ronin47
在linux平台下的应用，最流行的莫过于nginx、apache、mysql、php几个。而这几个常用的应用，在手工编译完以后，在其他一些情况下（如：新增模块），往往想要查看当初都使用了那些参数进行的编译。这时候就可以利用以下方法查看。 1、nginx [root@361way ~]# /App/nginx/sbin/nginx -V nginx: nginx version: nginx/
unity中运用Resources.Load的方法？ brotherlamp unity视频 unity资料 unity自学 unity unity教程
问：unity中运用Resources.Load的方法？答：Resources.Load是unity本地动态加载资本所用的方法,也即是你想动态加载的时分才用到它,比方枪弹,特效,某些实时替换的图像什么的,主张此文件夹不要放太多东西,在打包的时分,它会独自把里边的一切东西都会集打包到一同,不论里边有没有你用的东西,所以大多数资本应该是自个建文件放置 1、unity实时替换的物体即是依据环境条件
线段树-入门 bylijinnan java 算法线段树
/** * 线段树入门 * 问题：已知线段[2,5] [4,6] [0,7]；求点2,4,7分别出现了多少次 * 以下代码建立的线段树用链表来保存，且树的叶子结点类似[i,i] * * 参考链接：http://hi.baidu.com/semluhiigubbqvq/item/be736a33a8864789f4e4ad18 * @author lijinna
全选与反选 chicony 全选
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> <html> <head> <title>全选与反选</title>
vim一些简单记录 chenchao051 vim
mac在/usr/share/vim/vimrc linux在/etc/vimrc 1、问：后退键不能删除数据，不能往后退怎么办？答：在vimrc中加入set backspace=2 2、问：如何控制tab键的缩进？答：在vimrc中加入set tabstop=4 (任何
Sublime Text 快捷键 daizj 快捷键 sublime
[size=large][/size]Sublime Text快捷键：Ctrl+Shift+P：打开命令面板Ctrl+P：搜索项目中的文件Ctrl+G：跳转到第几行Ctrl+W：关闭当前打开文件Ctrl+Shift+W：关闭所有打开文件Ctrl+Shift+V：粘贴并格式化Ctrl+D：选择单词，重复可增加选择下一个相同的单词Ctrl+L：选择行，重复可依次增加选择下一行Ctrl+Shift+L：
php 引用(&)详解 dcj3sjt126com PHP
在PHP 中引用的意思是：不同的名字访问同一个变量内容. 与Ｃ语言中的指针是有差别的．Ｃ语言中的指针里面存储的是变量的内容在内存中存放的地址变量的引用 PHP 的引用允许你用两个变量来指向同一个内容复制代码代码如下: <? $a="ABC"; $b =&$a; echo
SVN中trunk,branches,tags用法详解 dcj3sjt126com SVN
Subversion有一个很标准的目录结构，是这样的。比如项目是proj，svn地址为svn://proj/，那么标准的svn布局是svn://proj/|+-trunk+-branches+-tags这是一个标准的布局，trunk为主开发目录，branches为分支开发目录，tags为tag存档目录（不允许修改）。但是具体这几个目录应该如何使用，svn并没有明确的规范，更多的还是用户自己的习惯。
对软件设计的思考 e200702084 设计模式数据结构算法 ssh 活动
软件设计的宏观与微观软件开发是一种高智商的开发活动。一个优秀的软件设计人员不仅要从宏观上把握软件之间的开发，也要从微观上把握软件之间的开发。宏观上，可以应用面向对象设计，采用流行的SSH架构，采用web层，业务逻辑层，持久层分层架构。采用设计模式提供系统的健壮性和可维护性。微观上，对于一个类，甚至方法的调用，从计算机的角度模拟程序的运行情况。了解内存分配，参数传
同步、异步、阻塞、非阻塞 geeksun 非阻塞
同步、异步、阻塞、非阻塞这几个概念有时有点混淆，在此文试图解释一下。同步：发出方法调用后，当没有返回结果，当前线程会一直在等待（阻塞）状态。场景：打电话，营业厅窗口办业务、B/S架构的http请求-响应模式。异步：方法调用后不立即返回结果，调用结果通过状态、通知或回调通知方法调用者或接收者。异步方法调用后，当前线程不会阻塞，会继续执行其他任务。实现：
Reverse SSH Tunnel 反向打洞實錄 hongtoushizi ssh
實際的操作步驟： # 首先，在客戶那理的機器下指令連回我們自己的 Server，並設定自己 Server 上的 12345 port 會對應到幾器上的 SSH port ssh -NfR 12345:localhost:22 [email protected] # 然後在 myhost 的機器上連自己的 12345 port，就可以連回在客戶那的機器 ssh localhost -p 1
Hibernate中的缓存 Josh_Persistence 一级缓存 Hiberante缓存查询缓存二级缓存
Hibernate中的缓存一、Hiberante中常见的三大缓存：一级缓存，二级缓存和查询缓存。 Hibernate中提供了两级Cache，第一级别的缓存是Session级别的缓存，它是属于事务范围的缓存。这一级别的缓存是由hibernate管理的，一般情况下无需进行干预；第二级别的缓存是SessionFactory级别的缓存，它是属于进程范围或群集范围的缓存。这一级别的缓存
对象关系行为模式之延迟加载 home198979 PHP 架构延迟加载
形象化设计模式实战 HELLO!架构一、概念 Lazy Load：一个对象，它虽然不包含所需要的所有数据，但是知道怎么获取这些数据。延迟加载貌似很简单，就是在数据需要时再从数据库获取，减少数据库的消耗。但这其中还是有不少技巧的。二、实现延迟加载实现Lazy Load主要有四种方法：延迟初始化、虚
xml 验证 pengfeicao521 xml xml解析
有些字符，xml不能识别，用jdom或者dom4j解析的时候就报错 public static void testPattern() { // 含有非法字符的串 String str = "Jamey친Ñ&#1282
div设置半透明效果 spjich css 半透明
为div设置如下样式： div{filter:alpha(Opacity=80);-moz-opacity:0.5;opacity: 0.5;} 说明： 1、filter：对win IE设置半透明滤镜效果，filter:alpha(Opacity=80)代表该对象80%半透明，火狐浏览器不认2、-moz-opaci
你真的了解单例模式么？ w574240966 java 单例设计模式 jvm
单例模式，很多初学者认为单例模式很简单，并且认为自己已经掌握了这种设计模式。但事实上，你真的了解单例模式了么。一，单例模式的5中写法。（回字的四种写法，哈哈。） 1，懒汉式（1）线程不安全的懒汉式 public cla