hellojackyleon

Mysql集群架构MHA应用实战

MHA（Master High Availability）目前在MySQL高可用方面是一个相对成熟的解决方案，它由日本DeNA公司youshimaton（现就职于Facebook公司）开发，
是一套优秀的作为MySQL高可用性环境下故障切换和主从提升的高可用软件。在MySQL故障切换过程中，MHA能做到在0~30秒之内自动完成数据库的故障切换操作，
并且在进行故障切换的过程中，MHA能在最大程度上保证数据的一致性，以达到真正意义上的高可用。
该软件由两部分组成：MHA Manager（管理节点）和MHA Node（数据节点）。MHA Manager可以单独部署在一台独立的机器上管理多个master-slave集群，
也可以部署在一台slave节点上。MHA Node运行在每台MySQL服务器上，MHA Manager会定时探测集群中的master节点，当master出现故障时，
它可以自动将最新数据的slave提升为新的master，然后将所有其他的slave重新指向新的master。整个故障转移过程对应用程序完全透明。
在MHA自动故障切换过程中，MHA试图从宕机的主服务器上保存二进制日志，最大程度的保证数据的不丢失，但这并不总是可行的。
例如，如果主服务器硬件故障或无法通过ssh访问，MHA没法保存二进制日志，只进行故障转移而丢失了最新的数据。
使用MySQL 5.5的半同步复制，可以大大降低数据丢失的风险。MHA可以与半同步复制结合起来。如果只有一个slave已经收到了最新的二进制日志
，MHA可以将最新的二进制日志应用于其他所有的slave服务器上，因此可以保证所有节点的数据一致性。
目前MHA主要支持一主多从的架构，要搭建MHA,要求一个复制集群中必须最少有三台数据库服务器，一主二从，即一台充当master，一台充当备用master，
另外一台充当从库，因为至少需要三台服务器，出于机器成本的考虑，淘宝也在该基础上进行了改造，目前淘宝TMHA已经支持一主一从。
官方介绍：https://code.google.com/p/mysql-master-ha/
下图展示了如何通过MHA Manager管理多组主从复制。可以将MHA工作原理总结为如下

本次环境规划如下 (centos6.7)

1、配置三台服务器ssh互信
ssh-keygen -t rsa  一路回车即可
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa): 
Enter passphrase (empty for no passphrase): 
Enter same passphrase again: 
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
c7:2e:ca:e2:c2:3b:30:63:97:b4:62:81:dd:27:e3:f9 root@centos02
The key's randomart p_w_picpath is:
+--[ RSA 2048]----+
|                 |
|                 |
|.. .             |
|....+ .  .       |
|  o.o=  S o      |
|++ +o    o       |
|o=o  .  . .      |
|  + ..E. .       |
|  .=..o          |
+-----------------+

[root@ansible mysql]#ssh-copy-id -i /root/.ssh/id_rsa.pub [email protected]
[root@ansible mysql]#ssh-copy-id  -i /root/.ssh/id_rsa.pub [email protected]
[root@ansible mysql]# ssh-copy-id -i /root/.ssh/id_rsa.pub 172.16.80.127
The authenticity of host '172.16.80.127 (172.16.80.127)' can't be established.
RSA key fingerprint is 05:89:5e:3d:2a:c1:ae:90:27:d9:a5:48:4a:ab:b9:79.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added '172.16.80.127' (RSA) to the list of known hosts.
[email protected]'s password: 
Now try logging into the machine, with "ssh '172.16.80.127'", and check in:

  .ssh/authorized_keys

to make sure we haven't added extra keys that you weren't expecting.

测试
[root@ansible mysql]# ssh 172.16.80.117 ifconfig eth0 
eth0      Link encap:Ethernet  HWaddr 00:0C:29:45:FE:30  
          inet addr:172.16.80.117  Bcast:172.16.80.255  Mask:255.255.255.0
          inet6 addr: fe80::20c:29ff:fe45:fe30/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:1220176 errors:0 dropped:0 overruns:0 frame:0
          TX packets:980887 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:1198343068 (1.1 GiB)  TX bytes:1318688106 (1.2 GiB)

[root@ansible mysql]# ssh 172.16.80.127 ifconfig eth0 
eth0      Link encap:Ethernet  HWaddr 00:0C:29:FF:58:D9  
          inet addr:172.16.80.127  Bcast:172.16.80.255  Mask:255.255.255.0
          inet6 addr: fe80::20c:29ff:feff:58d9/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:162129 errors:0 dropped:0 overruns:0 frame:0
          TX packets:27546 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:225287420 (214.8 MiB)  TX bytes:1921228 (1.8 MiB)
          

2、三节点配置epel的yum源，安装相关依赖包
rpm -Uvh 
rpm --import/etc/pki/rpm-gpg/RPM-GPG-KEY-EPEL-6 
yum  -y install perl-DBD-MySQL  ncftp


三台mysql服务器的配置文件
master 172.16.80.117
server-id       = 1
read-only=1log-bin=mysql-bin
relay-log = mysql-relay-bin
replicate-wild-ignore-table=mysql.%
replicate-wild-ignore-table=test.%
replicate-wild-ignore-table=information_schema.%
salve 172.16.80.127

server-id       = 2
read-only=1
log-bin=mysql-bin
relay-log = mysql-relay-bin
replicate-wild-ignore-table=mysql.%
replicate-wild-ignore-table=test.%
replicate-wild-ignore-table=information_schema.%
salve-manager  172.16.80.128

server-id       = 3
read-only=1
log-bin=mysql-bin
relay-log = mysql-relay-bin
replicate-wild-ignore-table=mysql.%
replicate-wild-ignore-table=test.%
replicate-wild-ignore-table=information_schema.%


在3个mysql节点做授权配置
mysql>  grant replication slave  on *.* to 'martin'@'172.16.80.%' identified by '123456';
Query OK, 0 rows affected (0.05 sec)

mysql> grant all on *.* to 'root'@'172.16.80.%' identified by '123456';
Query OK, 0 rows affected (0.00 sec)

查看主节点上的日志状态mysql> show master status;
mysql> show master status;
+------------------+----------+--------------+------------------+
| File             | Position | Binlog_Do_DB | Binlog_Ignore_DB |
+------------------+----------+--------------+------------------+
| mysql-bin.000001 |      107 |              |                  |
+------------------+----------+--------------+------------------+
1 row in set (0.01 sec)

3、在两个从节点上面执行如下操作
change master to \
master_host='172.16.80.117',\
master_user='martin',\
master_password='123456',\
master_log_file='mysql-bin.000001',\
master_log_pos=107;

mysql> start slave;
Query OK, 0 rows affected (0.06 sec)

mysql> show slave status \G;   可以看到主从同步状态正常

4、安装MHA软件MHA提供了源码和rpm包两种安装方式，如果是rpm包安装，方式如下：
1）在三个节点依次安装MHA的node
[root@ansible tools]# rpm -ivh mha4mysql-node-0.56-0.el6.noarch.rpm 
Preparing...########################################### [100%]   
1:mha4mysql-node ########################################### [100%]
2）最后在Slave/MHA Manager节点安装mha4mysql-manage：
yum install perl-Parallel-ForkManager perl-Time-HiRes \ 
 perl-DBD-MySQL perl-Config-Tiny perl-Log-Dispatch \ 
 perl-Parallel-ForkManagerperl-Config-IniFilesperl-Time-HiRes
[root@ansible tools]# rpm -ivh mha4mysql-manager-0.56-0.el6.noarch.rpm
Preparing...########################################### [100%]  
 1:mha4mysql-manager ########################################### [100%]
[root@ansible tools]# mkdir -p /etc/mha/scripts

MHA 配置文件如下
[root@ansible etc]# cat masterha_default.cnf 
[server default]
user=root
password=123456
ssh_user=root
repl_user=martin
repl_password=123456
ping_interval=1
secondary_check_script = masterha_secondary_check -s 172.16.80.117 -s 172.16.80.127  --user=repl_user --master_host=centos02 --master_ip=172.16.80.117 --master_port=3306
master_ip_failover_script="/etc/mha/scripts/master_ip_failover"
report_script="/etc/mha/scripts/send_report"

[root@ansible mha]# cat app1.cnf 
[server default]
manager_log=/var/log/mha/app1/manager.log
manager_workdir=/var/log/mha/app1
[server1]
candidate_master=1
hostname=172.16.80.117
master_binlog_dir="/application/mysql/data"
[server2]
candidate_master=1
hostname=172.16.80.127
master_binlog_dir="/application/mysql/data"
check_repl_delay=0
[server3]
hostname=172.16.80.128
master_binlog_dir="/application/mysql/data"
no_master=1

1、通过masterha_check_ssh验证ssh信任登录是否成功，
[root@ansible scripts]# masterha_check_ssh  --conf=/etc/mha/app1.cnf 
Thu Aug 11 19:29:03 2016 - [info] Reading default configuration from /etc/masterha_default.cnf..
Thu Aug 11 19:29:03 2016 - [info] Reading application default configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:29:03 2016 - [info] Reading server configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:29:03 2016 - [info] Starting SSH connection tests..
Thu Aug 11 19:29:04 2016 - [debug] 
Thu Aug 11 19:29:03 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.117:22) to [email protected](172.16.80.127:22)..
Thu Aug 11 19:29:03 2016 - [debug]   ok.
Thu Aug 11 19:29:03 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.117:22) to [email protected](172.16.80.128:22)..
Thu Aug 11 19:29:04 2016 - [debug]   ok.
Thu Aug 11 19:29:04 2016 - [debug] 
Thu Aug 11 19:29:03 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.127:22) to [email protected](172.16.80.117:22)..
Thu Aug 11 19:29:04 2016 - [debug]   ok.
Thu Aug 11 19:29:04 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.127:22) to [email protected](172.16.80.128:22)..
Thu Aug 11 19:29:04 2016 - [debug]   ok.
Thu Aug 11 19:29:04 2016 - [debug] 
Thu Aug 11 19:29:04 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.128:22) to [email protected](172.16.80.117:22)..
Thu Aug 11 19:29:04 2016 - [debug]   ok.
Thu Aug 11 19:29:04 2016 - [debug]  Connecting via SSH from [email protected](172.16.80.128:22) to [email protected](172.16.80.127:22)..
Thu Aug 11 19:29:04 2016 - [debug]   ok.
Thu Aug 11 19:29:04 2016 - [info] All SSH connection tests passed successfully.



2、masterha_check_repl验证mysql复制是否成功

masterha_check_repl --conf=/etc/mha/app1.cnf
[root@ansible scripts]# masterha_check_repl --conf=/etc/mha/app1.cnf
Thu Aug 11 19:31:53 2016 - [info] Reading default configuration from /etc/masterha_default.cnf..
Thu Aug 11 19:31:53 2016 - [info] Reading application default configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:31:53 2016 - [info] Reading server configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:31:53 2016 - [info] MHA::MasterMonitor version 0.56.
Thu Aug 11 19:31:54 2016 - [info] GTID failover mode = 0
Thu Aug 11 19:31:54 2016 - [info] Dead Servers:
Thu Aug 11 19:31:54 2016 - [info] Alive Servers:
Thu Aug 11 19:31:54 2016 - [info]   172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:31:54 2016 - [info]   172.16.80.127(172.16.80.127:3306)
Thu Aug 11 19:31:54 2016 - [info]   172.16.80.128(172.16.80.128:3306)
Thu Aug 11 19:31:54 2016 - [info] Alive Slaves:
Thu Aug 11 19:31:54 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:31:54 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:31:54 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:31:54 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:31:54 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:31:54 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:31:54 2016 - [info] Current Alive Master: 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:31:54 2016 - [info] Checking slave configurations..
Thu Aug 11 19:31:54 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.127(172.16.80.127:3306).
Thu Aug 11 19:31:54 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.128(172.16.80.128:3306).
Thu Aug 11 19:31:54 2016 - [info] Checking replication filtering settings..
Thu Aug 11 19:31:54 2016 - [info]  binlog_do_db= , binlog_ignore_db= 
Thu Aug 11 19:31:54 2016 - [info]  Replication filtering check ok.
Thu Aug 11 19:31:54 2016 - [info] GTID (with auto-pos) is not supported
Thu Aug 11 19:31:54 2016 - [info] Starting SSH connection tests..
Thu Aug 11 19:31:56 2016 - [info] All SSH connection tests passed successfully.
Thu Aug 11 19:31:56 2016 - [info] Checking MHA Node version..
Thu Aug 11 19:31:56 2016 - [info]  Version check ok.
Thu Aug 11 19:31:56 2016 - [info] Checking SSH publickey authentication settings on the current master..
Thu Aug 11 19:31:57 2016 - [info] HealthCheck: SSH to 172.16.80.117 is reachable.
Thu Aug 11 19:31:57 2016 - [info] Master MHA Node version is 0.56.
Thu Aug 11 19:31:57 2016 - [info] Checking recovery script configurations on 172.16.80.117(172.16.80.117:3306)..
Thu Aug 11 19:31:57 2016 - [info]   Executing command: save_binary_logs --command=test --start_pos=4 --binlog_dir=/application/mysql/data --output_file=/var/tmp/save_binary_logs_test --manager_version=0.56 --start_file=mysql-bin.000001 
Thu Aug 11 19:31:57 2016 - [info]   Connecting to [email protected](172.16.80.117:22).. 
  Creating /var/tmp if not exists..    ok.
  Checking output directory is accessible or not..
   ok.
  Binlog found at /application/mysql/data, up to mysql-bin.000001
Thu Aug 11 19:31:57 2016 - [info] Binlog setting check done.
Thu Aug 11 19:31:57 2016 - [info] Checking SSH publickey authentication and checking recovery script configurations on all alive slave servers..
Thu Aug 11 19:31:57 2016 - [info]   Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=172.16.80.127 --slave_ip=172.16.80.127 --slave_port=3306 --workdir=/var/tmp --target_version=5.5.49-log --manager_version=0.56 --relay_log_info=/application/mysql/data/relay-log.info  --relay_dir=/application/mysql/data/  --slave_pass=xxx
Thu Aug 11 19:31:57 2016 - [info]   Connecting to [email protected](172.16.80.127:22).. 
  Checking slave recovery environment settings..
    Opening /application/mysql/data/relay-log.info ... ok.
    Relay log found at /application/mysql/data, up to mysql-relay-bin.000002
    Temporary relay log file is /application/mysql/data/mysql-relay-bin.000002
    Testing mysql connection and privileges.. done.
    Testing mysqlbinlog output.. done.
    Cleaning up test file(s).. done.
Thu Aug 11 19:31:58 2016 - [info]   Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=172.16.80.128 --slave_ip=172.16.80.128 --slave_port=3306 --workdir=/var/tmp --target_version=5.5.49-log --manager_version=0.56 --relay_log_info=/application/mysql/data/relay-log.info  --relay_dir=/application/mysql/data/  --slave_pass=xxx
Thu Aug 11 19:31:58 2016 - [info]   Connecting to [email protected](172.16.80.128:22).. 
  Checking slave recovery environment settings..
    Opening /application/mysql/data/relay-log.info ... ok.
    Relay log found at /application/mysql/data, up to mysql-relay-bin.000002
    Temporary relay log file is /application/mysql/data/mysql-relay-bin.000002
    Testing mysql connection and privileges.. done.
    Testing mysqlbinlog output.. done.
    Cleaning up test file(s).. done.
Thu Aug 11 19:31:58 2016 - [info] Slaves settings check done.
Thu Aug 11 19:31:58 2016 - [info] 
172.16.80.117(172.16.80.117:3306) (current master)
 +--172.16.80.127(172.16.80.127:3306)
 +--172.16.80.128(172.16.80.128:3306)

Thu Aug 11 19:31:58 2016 - [info] Checking replication health on 172.16.80.127..
Thu Aug 11 19:31:58 2016 - [info]  ok.
Thu Aug 11 19:31:58 2016 - [info] Checking replication health on 172.16.80.128..
Thu Aug 11 19:31:58 2016 - [info]  ok.
Thu Aug 11 19:31:58 2016 - [info] Checking master_ip_failover_script status:
Thu Aug 11 19:31:58 2016 - [info]   /etc/mha/scripts/master_ip_failover --command=status --ssh_user=root --orig_master_host=172.16.80.117 --orig_master_ip=172.16.80.117 --orig_master_port=3306 


IN SCRIPT TEST====/sbin/ifconfig eth0:1 down==/sbin/ifconfig eth0:1 172.16.80.200/24===

Checking the Status of the script.. OK 
Thu Aug 11 19:31:58 2016 - [info]  OK.
Thu Aug 11 19:31:58 2016 - [warning] shutdown_script is not defined.
Thu Aug 11 19:31:58 2016 - [info] Got exit code 0 (Not master dead).

MySQL Replication Health is OK.


准备failover脚本用于vip切换

[root@ansible ~]# cat /etc/mha/scripts/master_ip_failover
#!/usr/bin/env perl

use strict;
use warnings FATAL => 'all';

use Getopt::Long;

my (
    $command,          $ssh_user,        $orig_master_host, $orig_master_ip,
    $orig_master_port, $new_master_host, $new_master_ip,    $new_master_port
);

my $vip = '172.16.80.200/24';
my $key = '1';
my $ssh_start_vip = "/sbin/ifconfig eth0:$key $vip";
my $ssh_stop_vip = "/sbin/ifconfig eth0:$key down";

GetOptions(
    'command=s'          => \$command,
    'ssh_user=s'         => \$ssh_user,
    'orig_master_host=s' => \$orig_master_host,
    'orig_master_ip=s'   => \$orig_master_ip,
    'orig_master_port=i' => \$orig_master_port,
    'new_master_host=s'  => \$new_master_host,
    'new_master_ip=s'    => \$new_master_ip,
    'new_master_port=i'  => \$new_master_port,
);

exit &main();

sub main {

    print "\n\nIN SCRIPT TEST====$ssh_stop_vip==$ssh_start_vip===\n\n";

    if ( $command eq "stop" || $command eq "stopssh" ) {

        my $exit_code = 1;
        eval {
            print "Disabling the VIP on old master: $orig_master_host \n";
            &stop_vip();
            $exit_code = 0;
        };
        if ($@) {
            warn "Got Error: $@\n";
            exit $exit_code;
        }
        exit $exit_code;
    }
    elsif ( $command eq "start" ) {

        my $exit_code = 10;
        eval {
            print "Enabling the VIP - $vip on the new master - $new_master_host \n";
            &start_vip();
            $exit_code = 0;
        };
        if ($@) {
            warn $@;
            exit $exit_code;
        }
        exit $exit_code;
    }
    elsif ( $command eq "status" ) {
        print "Checking the Status of the script.. OK \n";
        exit 0;
    }
    else {
        &usage();
        exit 1;
    }
}

sub start_vip() {
    `ssh $ssh_user\@$new_master_host \" $ssh_start_vip \"`;
}
sub stop_vip() {
     return 0  unless  ($ssh_user);
    `ssh $ssh_user\@$orig_master_host \" $ssh_stop_vip \"`;
}

sub usage {
    print
    "Usage: master_ip_failover --command=start|stop|stopssh|status --orig_master_host=host --orig_master_ip=ip --orig_master_port=port --new_master_host=host --new_master_ip=ip --new_master_port=port\n";
}



启动MHA
先执行如下命令：
/sbin/ifconfig eth0:1 172.16.80.200（只需第一次添加）
将vip绑定到目前的master上。

然后通过masterha_manager启动MHA监控：
[root@ansible scripts]# mkdir /var/log//masterha/app1 -p
[root@ansible scripts]# touch /var/log/masterha/app1/manager.log

[root@ansible scripts]# nohup masterha_manager --conf=/etc/mha/app1.cnf  \
--remove_dead_master_conf --ignore_last_failover< /dev/null > \ 
/var/log/masterha/app1/manager.log 2>&1 &


然后通过masterha_check_status查看MHA状态
[root@ansible scripts]# masterha_check_status --conf=/etc/mha/app1.cnf 
app1 (pid:58184) is running(0:PING_OK), master:172.16.80.117

模拟主库 172.16.80.117 数据库挂掉
[root@centos02 .ssh]# /etc/init.d/mysqld stop
Shutting down MySQL................               [  OK  ]

看看failover过程中的日志记录情况
Checking the Status of the script.. OK 
Thu Aug 11 19:40:45 2016 - [info]  OK.
Thu Aug 11 19:40:45 2016 - [warning] shutdown_script is not defined.
Thu Aug 11 19:40:45 2016 - [info] Set master ping interval 1 seconds.
Thu Aug 11 19:40:45 2016 - [info] Set secondary check script: masterha_secondary_check -s 172.16.80.117 -s 172.16.80.127  --user=repl_user --master_host=centos02 --master_ip=172.16.80.117 --master_port=3306
Thu Aug 11 19:40:45 2016 - [info] Starting ping health check on 172.16.80.117(172.16.80.117:3306)..
Thu Aug 11 19:40:45 2016 - [info] Ping(SELECT) succeeded, waiting until MySQL doesn't respond..
Thu Aug 11 19:42:36 2016 - [warning] Got error on MySQL select ping: 2006 (MySQL server has gone away)
Thu Aug 11 19:42:36 2016 - [info] Executing secondary network check script: masterha_secondary_check -s 172.16.80.117 -s 172.16.80.127  --user=repl_user --master_host=centos02 --master_ip=172.16.80.117 --master_port=3306  --user=root  --master_host=172.16.80.117  --master_ip=172.16.80.117  --master_port=3306 --master_user=root --master_password=123456 --ping_type=SELECT
Thu Aug 11 19:42:36 2016 - [info] Executing SSH check script: save_binary_logs --command=test --start_pos=4 --binlog_dir=/application/mysql/data --output_file=/var/tmp/save_binary_logs_test --manager_version=0.56 --binlog_prefix=mysql-bin
Thu Aug 11 19:42:37 2016 - [info] HealthCheck: SSH to 172.16.80.117 is reachable.
Monitoring server 172.16.80.117 is reachable, Master is not reachable from 172.16.80.117. OK.
Thu Aug 11 19:42:37 2016 - [warning] Got error on MySQL connect: 2013 (Lost connection to MySQL server at 'reading initial communication packet', system error: 111)
Thu Aug 11 19:42:37 2016 - [warning] Connection failed 2 time(s)..
Monitoring server 172.16.80.127 is reachable, Master is not reachable from 172.16.80.127. OK.
Thu Aug 11 19:42:38 2016 - [info] Master is not reachable from all other monitoring servers. Failover should start.
Thu Aug 11 19:42:38 2016 - [warning] Got error on MySQL connect: 2013 (Lost connection to MySQL server at 'reading initial communication packet', system error: 111)
Thu Aug 11 19:42:38 2016 - [warning] Connection failed 3 time(s)..
Thu Aug 11 19:42:39 2016 - [warning] Got error on MySQL connect: 2013 (Lost connection to MySQL server at 'reading initial communication packet', system error: 111)
Thu Aug 11 19:42:39 2016 - [warning] Connection failed 4 time(s)..
Thu Aug 11 19:42:39 2016 - [warning] Master is not reachable from health checker!
Thu Aug 11 19:42:39 2016 - [warning] Master 172.16.80.117(172.16.80.117:3306) is not reachable!
Thu Aug 11 19:42:39 2016 - [warning] SSH is reachable.
Thu Aug 11 19:42:39 2016 - [info] Connecting to a master server failed. Reading configuration file /etc/masterha_default.cnf and /etc/mha/app1.cnf again, and trying to connect to all servers to check server status..
Thu Aug 11 19:42:39 2016 - [info] Reading default configuration from /etc/masterha_default.cnf..
Thu Aug 11 19:42:39 2016 - [info] Reading application default configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:42:39 2016 - [info] Reading server configuration from /etc/mha/app1.cnf..
Thu Aug 11 19:42:39 2016 - [info] GTID failover mode = 0
Thu Aug 11 19:42:39 2016 - [info] Dead Servers:
Thu Aug 11 19:42:39 2016 - [info]   172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:39 2016 - [info] Alive Servers:
Thu Aug 11 19:42:39 2016 - [info]   172.16.80.127(172.16.80.127:3306)
Thu Aug 11 19:42:39 2016 - [info]   172.16.80.128(172.16.80.128:3306)
Thu Aug 11 19:42:39 2016 - [info] Alive Slaves:
Thu Aug 11 19:42:39 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:39 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:39 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:42:39 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:39 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:39 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:42:39 2016 - [info] Checking slave configurations..
Thu Aug 11 19:42:39 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.127(172.16.80.127:3306).
Thu Aug 11 19:42:39 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.128(172.16.80.128:3306).
Thu Aug 11 19:42:39 2016 - [info] Checking replication filtering settings..
Thu Aug 11 19:42:39 2016 - [info]  Replication filtering check ok.
Thu Aug 11 19:42:39 2016 - [info] Master is down!
Thu Aug 11 19:42:39 2016 - [info] Terminating monitoring script.
Thu Aug 11 19:42:39 2016 - [info] Got exit code 20 (Master dead).
Thu Aug 11 19:42:39 2016 - [info] MHA::MasterFailover version 0.56.
Thu Aug 11 19:42:39 2016 - [info] Starting master failover.
Thu Aug 11 19:42:39 2016 - [info] 
Thu Aug 11 19:42:39 2016 - [info] * Phase 1: Configuration Check Phase..
Thu Aug 11 19:42:39 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] GTID failover mode = 0
Thu Aug 11 19:42:40 2016 - [info] Dead Servers:
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info] Checking master reachability via MySQL(double check)...
Thu Aug 11 19:42:40 2016 - [info]  ok.
Thu Aug 11 19:42:40 2016 - [info] Alive Servers:
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.127(172.16.80.127:3306)
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.128(172.16.80.128:3306)
Thu Aug 11 19:42:40 2016 - [info] Alive Slaves:
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:42:40 2016 - [info] Starting Non-GTID based failover.
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] ** Phase 1: Configuration Check Phase completed.
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] * Phase 2: Dead Master Shutdown Phase..
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] Forcing shutdown so that applications never connect to the current master..
Thu Aug 11 19:42:40 2016 - [info] Executing master IP deactivation script:
Thu Aug 11 19:42:40 2016 - [info]   /etc/mha/scripts/master_ip_failover --orig_master_host=172.16.80.117 --orig_master_ip=172.16.80.117 --orig_master_port=3306 --command=stopssh --ssh_user=root  


IN SCRIPT TEST====/sbin/ifconfig eth0:1 down==/sbin/ifconfig eth0:1 172.16.80.200/24===

Disabling the VIP on old master: 172.16.80.117 
Thu Aug 11 19:42:40 2016 - [info]  done.
Thu Aug 11 19:42:40 2016 - [warning] shutdown_script is not set. Skipping explicit shutting down of the dead master.
Thu Aug 11 19:42:40 2016 - [info] * Phase 2: Dead Master Shutdown Phase completed.
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] * Phase 3: Master Recovery Phase..
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] * Phase 3.1: Getting Latest Slaves Phase..
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] The latest binary log file/position on all slaves is mysql-bin.000001:107
Thu Aug 11 19:42:40 2016 - [info] Latest slaves (Slaves that received relay log files to the latest):
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:42:40 2016 - [info] The oldest binary log file/position on all slaves is mysql-bin.000001:107
Thu Aug 11 19:42:40 2016 - [info] Oldest slaves:
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:42:40 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:40 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:40 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:40 2016 - [info] * Phase 3.2: Saving Dead Master's Binlog Phase..
Thu Aug 11 19:42:40 2016 - [info] 
Thu Aug 11 19:42:41 2016 - [info] Fetching dead master's binary logs..
Thu Aug 11 19:42:41 2016 - [info] Executing command on the dead master 172.16.80.117(172.16.80.117:3306): save_binary_logs --command=save --start_file=mysql-bin.000001  --start_pos=107 --binlog_dir=/application/mysql/data --output_file=/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog --handle_raw_binlog=1 --disable_log_bin=0 --manager_version=0.56
  Creating /var/tmp if not exists..    ok.
 Concat binary/relay logs from mysql-bin.000001 pos 107 to mysql-bin.000001 EOF into /var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog ..
  Dumping binlog format description event, from position 0 to 107.. ok.
  Dumping effective binlog data from /application/mysql/data/mysql-bin.000001 position 107 to tail(126).. ok.
 Concat succeeded.
Thu Aug 11 19:42:42 2016 - [info] scp from [email protected]:/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog to local:/var/log/mha/app1/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog succeeded.
Thu Aug 11 19:42:43 2016 - [info] HealthCheck: SSH to 172.16.80.127 is reachable.
Thu Aug 11 19:42:43 2016 - [info] HealthCheck: SSH to 172.16.80.128 is reachable.
Thu Aug 11 19:42:43 2016 - [info] 
Thu Aug 11 19:42:43 2016 - [info] * Phase 3.3: Determining New Master Phase..
Thu Aug 11 19:42:43 2016 - [info] 
Thu Aug 11 19:42:43 2016 - [info] Finding the latest slave that has all relay logs for recovering other slaves..
Thu Aug 11 19:42:43 2016 - [info] All slaves received relay logs to the same position. No need to resync each other.
Thu Aug 11 19:42:43 2016 - [info] Searching new master from slaves..
Thu Aug 11 19:42:43 2016 - [info]  Candidate masters from the configuration file:
Thu Aug 11 19:42:43 2016 - [info]   172.16.80.127(172.16.80.127:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:43 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:43 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Thu Aug 11 19:42:43 2016 - [info]  Non-candidate masters:
Thu Aug 11 19:42:43 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Thu Aug 11 19:42:43 2016 - [info]     Replicating from 172.16.80.117(172.16.80.117:3306)
Thu Aug 11 19:42:43 2016 - [info]     Not candidate for the new Master (no_master is set)
Thu Aug 11 19:42:43 2016 - [info]  Searching from candidate_master slaves which have received the latest relay log events..
Thu Aug 11 19:42:43 2016 - [info] New master is 172.16.80.127(172.16.80.127:3306)
Thu Aug 11 19:42:43 2016 - [info] Starting master failover..
Thu Aug 11 19:42:43 2016 - [info] 
From:
172.16.80.117(172.16.80.117:3306) (current master)
 +--172.16.80.127(172.16.80.127:3306)
 +--172.16.80.128(172.16.80.128:3306)

To:
172.16.80.127(172.16.80.127:3306) (new master)
 +--172.16.80.128(172.16.80.128:3306)
Thu Aug 11 19:42:43 2016 - [info] 
Thu Aug 11 19:42:43 2016 - [info] * Phase 3.3: New Master Diff Log Generation Phase..
Thu Aug 11 19:42:43 2016 - [info] 
Thu Aug 11 19:42:43 2016 - [info]  This server has all relay logs. No need to generate diff files from the latest slave.
Thu Aug 11 19:42:43 2016 - [info] Sending binlog..
Thu Aug 11 19:42:44 2016 - [info] scp from local:/var/log/mha/app1/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog to [email protected]:/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog succeeded.
Thu Aug 11 19:42:44 2016 - [info] 
Thu Aug 11 19:42:44 2016 - [info] * Phase 3.4: Master Log Apply Phase..
Thu Aug 11 19:42:44 2016 - [info] 
Thu Aug 11 19:42:44 2016 - [info] *NOTICE: If any error happens from this phase, manual recovery is needed.
Thu Aug 11 19:42:44 2016 - [info] Starting recovery on 172.16.80.127(172.16.80.127:3306)..
Thu Aug 11 19:42:44 2016 - [info]  Generating diffs succeeded.
Thu Aug 11 19:42:44 2016 - [info] Waiting until all relay logs are applied.
Thu Aug 11 19:42:44 2016 - [info]  done.
Thu Aug 11 19:42:44 2016 - [info] Getting slave status..
Thu Aug 11 19:42:44 2016 - [info] This slave(172.16.80.127)'s Exec_Master_Log_Pos equals to Read_Master_Log_Pos(mysql-bin.000001:107). No need to recover from Exec_Master_Log_Pos.
Thu Aug 11 19:42:44 2016 - [info] Connecting to the target slave host 172.16.80.127, running recover script..
Thu Aug 11 19:42:44 2016 - [info] Executing command: apply_diff_relay_logs --command=apply --slave_user='root' --slave_host=172.16.80.127 --slave_ip=172.16.80.127  --slave_port=3306 --apply_files=/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog --workdir=/var/tmp --target_version=5.5.49-log --timestamp=20160811194239 --handle_raw_binlog=1 --disable_log_bin=0 --manager_version=0.56 --slave_pass=xxx
Thu Aug 11 19:42:46 2016 - [info] 
Applying differential binary/relay log files /var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog on 172.16.80.127:3306. This may take long time...
Applying log files succeeded.
Thu Aug 11 19:42:46 2016 - [info]  All relay logs were successfully applied.
Thu Aug 11 19:42:46 2016 - [info] Getting new master's binlog name and position..
Thu Aug 11 19:42:46 2016 - [info]  mysql-bin.000001:245
Thu Aug 11 19:42:46 2016 - [info]  All other slaves should start replication from here. Statement should be: CHANGE MASTER TO MASTER_HOST='172.16.80.127', MASTER_PORT=3306, MASTER_LOG_FILE='mysql-bin.000001', MASTER_LOG_POS=245, MASTER_USER='martin', MASTER_PASSWORD='xxx';
Thu Aug 11 19:42:46 2016 - [info] Executing master IP activate script:
Thu Aug 11 19:42:46 2016 - [info]   /etc/mha/scripts/master_ip_failover --command=start --ssh_user=root --orig_master_host=172.16.80.117 --orig_master_ip=172.16.80.117 --orig_master_port=3306 --new_master_host=172.16.80.127 --new_master_ip=172.16.80.127 --new_master_port=3306 --new_master_user='root' --new_master_password='123456'  
Unknown option: new_master_user
Unknown option: new_master_password


IN SCRIPT TEST====/sbin/ifconfig eth0:1 down==/sbin/ifconfig eth0:1 172.16.80.200/24===

Enabling the VIP - 172.16.80.200/24 on the new master - 172.16.80.127 
Thu Aug 11 19:42:47 2016 - [info]  OK.
Thu Aug 11 19:42:47 2016 - [info] Setting read_only=0 on 172.16.80.127(172.16.80.127:3306)..
Thu Aug 11 19:42:47 2016 - [info]  ok.
Thu Aug 11 19:42:47 2016 - [info] ** Finished master recovery successfully.
Thu Aug 11 19:42:47 2016 - [info] * Phase 3: Master Recovery Phase completed.
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] * Phase 4: Slaves Recovery Phase..
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] * Phase 4.1: Starting Parallel Slave Diff Log Generation Phase..
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] -- Slave diff file generation on host 172.16.80.128(172.16.80.128:3306) started, pid: 58855. Check tmp log /var/log/mha/app1/172.16.80.128_3306_20160811194239.log if it takes time..
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] Log messages from 172.16.80.128 ...
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info]  This server has all relay logs. No need to generate diff files from the latest slave.
Thu Aug 11 19:42:47 2016 - [info] End of log messages from 172.16.80.128.
Thu Aug 11 19:42:47 2016 - [info] -- 172.16.80.128(172.16.80.128:3306) has the latest relay log events.
Thu Aug 11 19:42:47 2016 - [info] Generating relay diff files from the latest slave succeeded.
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] * Phase 4.2: Starting Parallel Slave Log Apply Phase..
Thu Aug 11 19:42:47 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] -- Slave recovery on host 172.16.80.128(172.16.80.128:3306) started, pid: 58857. Check tmp log /var/log/mha/app1/172.16.80.128_3306_20160811194239.log if it takes time..
Thu Aug 11 19:42:48 2016 - [info] 
Thu Aug 11 19:42:48 2016 - [info] Log messages from 172.16.80.128 ...
Thu Aug 11 19:42:48 2016 - [info] 
Thu Aug 11 19:42:47 2016 - [info] Sending binlog..
Thu Aug 11 19:42:47 2016 - [info] scp from local:/var/log/mha/app1/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog to [email protected]:/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog succeeded.
Thu Aug 11 19:42:47 2016 - [info] Starting recovery on 172.16.80.128(172.16.80.128:3306)..
Thu Aug 11 19:42:47 2016 - [info]  Generating diffs succeeded.
Thu Aug 11 19:42:47 2016 - [info] Waiting until all relay logs are applied.
Thu Aug 11 19:42:47 2016 - [info]  done.
Thu Aug 11 19:42:47 2016 - [info] Getting slave status..
Thu Aug 11 19:42:47 2016 - [info] This slave(172.16.80.128)'s Exec_Master_Log_Pos equals to Read_Master_Log_Pos(mysql-bin.000001:107). No need to recover from Exec_Master_Log_Pos.
Thu Aug 11 19:42:47 2016 - [info] Connecting to the target slave host 172.16.80.128, running recover script..
Thu Aug 11 19:42:47 2016 - [info] Executing command: apply_diff_relay_logs --command=apply --slave_user='root' --slave_host=172.16.80.128 --slave_ip=172.16.80.128  --slave_port=3306 --apply_files=/var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog --workdir=/var/tmp --target_version=5.5.49-log --timestamp=20160811194239 --handle_raw_binlog=1 --disable_log_bin=0 --manager_version=0.56 --slave_pass=xxx
Thu Aug 11 19:42:48 2016 - [info] 
Applying differential binary/relay log files /var/tmp/saved_master_binlog_from_172.16.80.117_3306_20160811194239.binlog on 172.16.80.128:3306. This may take long time...
Applying log files succeeded.
Thu Aug 11 19:42:48 2016 - [info]  All relay logs were successfully applied.
Thu Aug 11 19:42:48 2016 - [info]  Resetting slave 172.16.80.128(172.16.80.128:3306) and starting replication from the new master 172.16.80.127(172.16.80.127:3306)..
Thu Aug 11 19:42:48 2016 - [info]  Executed CHANGE MASTER.
Thu Aug 11 19:42:48 2016 - [info]  Slave started.
Thu Aug 11 19:42:48 2016 - [info] End of log messages from 172.16.80.128.
Thu Aug 11 19:42:48 2016 - [info] -- Slave recovery on host 172.16.80.128(172.16.80.128:3306) succeeded.
Thu Aug 11 19:42:48 2016 - [info] All new slave servers recovered successfully.
Thu Aug 11 19:42:48 2016 - [info] 
Thu Aug 11 19:42:48 2016 - [info] * Phase 5: New master cleanup phase..
Thu Aug 11 19:42:48 2016 - [info] 
Thu Aug 11 19:42:48 2016 - [info] Resetting slave info on the new master..
Thu Aug 11 19:42:49 2016 - [info]  172.16.80.127: Resetting slave info succeeded.
Thu Aug 11 19:42:49 2016 - [info] Master failover to 172.16.80.127(172.16.80.127:3306) completed successfully.
Thu Aug 11 19:42:49 2016 - [info] 

----- Failover Report -----

app1: MySQL Master failover 172.16.80.117(172.16.80.117:3306) to 172.16.80.127(172.16.80.127:3306) succeeded

Master 172.16.80.117(172.16.80.117:3306) is down!

Check MHA Manager logs at ansible:/var/log/mha/app1/manager.log for details.

Started automated(non-interactive) failover.
Invalidated master IP address on 172.16.80.117(172.16.80.117:3306)
The latest slave 172.16.80.127(172.16.80.127:3306) has all relay logs for recovery.
Selected 172.16.80.127(172.16.80.127:3306) as a new master.
172.16.80.127(172.16.80.127:3306): OK: Applying all logs succeeded.
172.16.80.127(172.16.80.127:3306): OK: Activated master IP address.
172.16.80.128(172.16.80.128:3306): This host has the latest relay log events.
Generating relay diff files from the latest slave succeeded.
172.16.80.128(172.16.80.128:3306): OK: Applying all logs succeeded. Slave started, replicating from 172.16.80.127(172.16.80.127:3306)
172.16.80.127(172.16.80.127:3306): Resetting slave info succeeded.
Master failover to 172.16.80.127(172.16.80.127:3306) completed successfully.

可以看到这个从库自动连接到了新的主库 172.16.80.127上面

切换完成后，关注如下变化：
1、    vip自动从原来的master切换到新的master，同时，manager节点的监控进程自动退出。
2、    在日志目录（/var/log/masterha/app1）产生一个app1.failover.complete文件
3、    /etc/mha/app1.cnf配置文件中原来老的master配置被删除。

再截图之前ssh及mysql主从检查的过程

修复老的主master 172.16.80.117
[root@centos02 .ssh]# /etc/init.d/mysqld startStarting MySQL................                             [  OK  ]
此时在管理节点 172.16.80.128上检查同步情况

[root@ansible ~]# masterha_check_repl --conf=/etc/mha/app1.cnf
Fri Aug 12 14:03:15 2016 - [info] Reading default configuration from /etc/masterha_default.cnf..
Fri Aug 12 14:03:15 2016 - [info] Reading application default configuration from /etc/mha/app1.cnf..
Fri Aug 12 14:03:15 2016 - [info] Reading server configuration from /etc/mha/app1.cnf..
Fri Aug 12 14:03:15 2016 - [info] MHA::MasterMonitor version 0.56.
Fri Aug 12 14:03:19 2016 - [error][/usr/share/perl5/vendor_perl/MHA/ServerManager.pm, ln653] There are 2 non-slave servers! MHA manages at most one non-slave server. Check configurations.
Fri Aug 12 14:03:19 2016 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln424] Error happened on checking configurations.  at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 326
Fri Aug 12 14:03:19 2016 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln523] Error happened on monitoring servers.
Fri Aug 12 14:03:19 2016 - [info] Got exit code 1 (Not master dead).

MySQL Replication Health is NOT OK!

在老的master执行如下命令：
mysql>reset slave
然后查看目前新的master状态
mysql> show master status;
+------------------+----------+--------------+------------------+
| File             | Position | Binlog_Do_DB | Binlog_Ignore_DB |
+------------------+----------+--------------+------------------+
| mysql-bin.000001 |      245 |              |                  |
+------------------+----------+--------------+------------------+
1 row in set (0.00 sec)
，找到binlog日志信息和pos id，然后在老master上执行如下命令：

mysql> reset slave;
Query OK, 0 rows affected (0.03 sec)

mysql> change master to \
    -> master_host='172.16.80.127',\
    -> master_user='martin',\
    -> master_password='123456',\
    -> master_log_file='mysql-bin.000001',\
    -> master_log_pos=245;
Query OK, 0 rows affected (0.09 sec)

mysql> start slave;
Query OK, 0 rows affected (0.00 sec)

在老的主节点上面
mysql> grant all on *.* to root@'centos02' identified by 123456;
管理节点启动manage进程

[root@ansible ~]# nohup masterha_manager --conf=/etc/mha/app1.cnf  --remove_dead_master_conf--ignore_last_failover< /dev/null > /var/log/masterha/app1/manager.log 2>&1 &

[root@ansible ~]# masterha_check_status --conf=/etc/mha/app1.cnf 
app1 (pid:62074) is running(0:PING_OK), master:172.16.80.127

[root@ansible ~]# masterha_check_repl --conf=/etc/mha/app1.cnf
Fri Aug 12 14:22:29 2016 - [info] Reading default configuration from /etc/masterha_default.cnf..
Fri Aug 12 14:22:29 2016 - [info] Reading application default configuration from /etc/mha/app1.cnf..
Fri Aug 12 14:22:29 2016 - [info] Reading server configuration from /etc/mha/app1.cnf..
Fri Aug 12 14:22:29 2016 - [info] MHA::MasterMonitor version 0.56.
Fri Aug 12 14:22:30 2016 - [info] GTID failover mode = 0
Fri Aug 12 14:22:30 2016 - [info] Dead Servers:
Fri Aug 12 14:22:30 2016 - [info] Alive Servers:
Fri Aug 12 14:22:30 2016 - [info]   172.16.80.117(172.16.80.117:3306)
Fri Aug 12 14:22:30 2016 - [info]   172.16.80.127(172.16.80.127:3306)
Fri Aug 12 14:22:30 2016 - [info]   172.16.80.128(172.16.80.128:3306)
Fri Aug 12 14:22:30 2016 - [info] Alive Slaves:
Fri Aug 12 14:22:30 2016 - [info]   172.16.80.117(172.16.80.117:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Fri Aug 12 14:22:30 2016 - [info]     Replicating from 172.16.80.127(172.16.80.127:3306)
Fri Aug 12 14:22:30 2016 - [info]     Primary candidate for the new Master (candidate_master is set)
Fri Aug 12 14:22:30 2016 - [info]   172.16.80.128(172.16.80.128:3306)  Version=5.5.49-log (oldest major version between slaves) log-bin:enabled
Fri Aug 12 14:22:30 2016 - [info]     Replicating from 172.16.80.127(172.16.80.127:3306)
Fri Aug 12 14:22:30 2016 - [info]     Not candidate for the new Master (no_master is set)
Fri Aug 12 14:22:30 2016 - [info] Current Alive Master: 172.16.80.127(172.16.80.127:3306)
Fri Aug 12 14:22:30 2016 - [info] Checking slave configurations..
Fri Aug 12 14:22:30 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.117(172.16.80.117:3306).
Fri Aug 12 14:22:30 2016 - [warning]  relay_log_purge=0 is not set on slave 172.16.80.128(172.16.80.128:3306).
Fri Aug 12 14:22:30 2016 - [info] Checking replication filtering settings..
Fri Aug 12 14:22:30 2016 - [info]  binlog_do_db= , binlog_ignore_db= 
Fri Aug 12 14:22:30 2016 - [info]  Replication filtering check ok.
Fri Aug 12 14:22:30 2016 - [info] GTID (with auto-pos) is not supported
Fri Aug 12 14:22:30 2016 - [info] Starting SSH connection tests..
Fri Aug 12 14:22:36 2016 - [info] All SSH connection tests passed successfully.
Fri Aug 12 14:22:36 2016 - [info] Checking MHA Node version..
Fri Aug 12 14:22:37 2016 - [info]  Version check ok.
Fri Aug 12 14:22:37 2016 - [info] Checking SSH publickey authentication settings on the current master..
Fri Aug 12 14:22:37 2016 - [info] HealthCheck: SSH to 172.16.80.127 is reachable.
Fri Aug 12 14:22:38 2016 - [info] Master MHA Node version is 0.56.
Fri Aug 12 14:22:38 2016 - [info] Checking recovery script configurations on 172.16.80.127(172.16.80.127:3306)..
Fri Aug 12 14:22:38 2016 - [info]   Executing command: save_binary_logs --command=test --start_pos=4 --binlog_dir=/application/mysql/data --output_file=/var/tmp/save_binary_logs_test --manager_version=0.56 --start_file=mysql-bin.000001 
Fri Aug 12 14:22:38 2016 - [info]   Connecting to [email protected](172.16.80.127:22).. 
  Creating /var/tmp if not exists..    ok.
  Checking output directory is accessible or not..
   ok.
  Binlog found at /application/mysql/data, up to mysql-bin.000001
Fri Aug 12 14:22:38 2016 - [info] Binlog setting check done.
Fri Aug 12 14:22:38 2016 - [info] Checking SSH publickey authentication and checking recovery script configurations on all alive slave servers..
Fri Aug 12 14:22:38 2016 - [info]   Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=172.16.80.117 --slave_ip=172.16.80.117 --slave_port=3306 --workdir=/var/tmp --target_version=5.5.49-log --manager_version=0.56 --relay_log_info=/application/mysql/data/relay-log.info  --relay_dir=/application/mysql/data/  --slave_pass=xxx
Fri Aug 12 14:22:38 2016 - [info]   Connecting to [email protected](172.16.80.117:22).. 
  Checking slave recovery environment settings..
    Opening /application/mysql/data/relay-log.info ... ok.
    Relay log found at /application/mysql/data, up to mysql-relay-bin.000002
    Temporary relay log file is /application/mysql/data/mysql-relay-bin.000002
    Testing mysql connection and privileges.. done.
    Testing mysqlbinlog output.. done.
    Cleaning up test file(s).. done.
Fri Aug 12 14:22:39 2016 - [info]   Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=172.16.80.128 --slave_ip=172.16.80.128 --slave_port=3306 --workdir=/var/tmp --target_version=5.5.49-log --manager_version=0.56 --relay_log_info=/application/mysql/data/relay-log.info  --relay_dir=/application/mysql/data/  --slave_pass=xxx
Fri Aug 12 14:22:39 2016 - [info]   Connecting to [email protected](172.16.80.128:22).. 
  Checking slave recovery environment settings..
    Opening /application/mysql/data/relay-log.info ... ok.
    Relay log found at /application/mysql/data, up to mysql-relay-bin.000002
    Temporary relay log file is /application/mysql/data/mysql-relay-bin.000002
    Testing mysql connection and privileges.. done.
    Testing mysqlbinlog output.. done.
    Cleaning up test file(s).. done.
Fri Aug 12 14:22:39 2016 - [info] Slaves settings check done.
Fri Aug 12 14:22:39 2016 - [info] 
172.16.80.127(172.16.80.127:3306) (current master)
 +--172.16.80.117(172.16.80.117:3306)
 +--172.16.80.128(172.16.80.128:3306)

Fri Aug 12 14:22:39 2016 - [info] Checking replication health on 172.16.80.117..
Fri Aug 12 14:22:39 2016 - [info]  ok.
Fri Aug 12 14:22:39 2016 - [info] Checking replication health on 172.16.80.128..
Fri Aug 12 14:22:39 2016 - [info]  ok.
Fri Aug 12 14:22:39 2016 - [info] Checking master_ip_failover_script status:
Fri Aug 12 14:22:39 2016 - [info]   /etc/mha/scripts/master_ip_failover --command=status --ssh_user=root --orig_master_host=172.16.80.127 --orig_master_ip=172.16.80.127 --orig_master_port=3306 


IN SCRIPT TEST====/sbin/ifconfig eth0:1 down==/sbin/ifconfig eth0:1 172.16.80.200/24===

Checking the Status of the script.. OK 
Fri Aug 12 14:22:40 2016 - [info]  OK.
Fri Aug 12 14:22:40 2016 - [warning] shutdown_script is not defined.
Fri Aug 12 14:22:40 2016 - [info] Got exit code 0 (Not master dead).

MySQL Replication Health is OK.

你可能感兴趣的:(mysql,主从复制,数据库)

25年大数据开发省赛样题第一套，离线数据处理答案 Tometor 大数据 spark scala
省赛样题一，数据抽取模块这一模块的作用是从mysql抽取数据到ods层进行指标计算，在题目中要求进行全量抽取，并新增etl-date字段进行分区，日期为比赛前一天importorg.apache.spark.sql.SparkSessionimportjava.util.PropertiesobjectTask1{defmain(args:Array[String]):Unit={valspark
Mybatis的基本使用学c真好玩 mybatis
MyBatis简介MyBatis用于持久层框架,持久层是对数据库操作的部分，前版本iBatis由Apache软件基金组织进行更名并维护。特点:简化数据库的操作SQL映射灵活(半ORM框架)支持高级映射易于集成维护配置动态SQL缓存机制功能：替代JDBC,JDBC是java中提供的用于操作数据库的技术及方案数据库的连接控制难。连接池SQL语句硬编码。将sql语句存放到xml配置文件中参数传递问题。提
html5使用本地sqlite数据库小祁爱编程 sqlite html5 big data
html5使用本地sqlite数据库本地数据库概述在HTML5中，大大丰富了客户端本地可以存储的内容，添加了很多功能将原本必须要保存在服务器上的数据转为保存在客户端本地，从而大大提高了Web应用程序性能，减轻了服务器的负担，使用Web时代重新回到了“客户端为重、服务器端为轻”的时代。HTML5中内置了两种本地数据库，一种是SQLite,一种是indexedDBSQLite数据库使用操作本地数据库的
VSCode 2025最新后端开发必备插件汇总（必备插件合集，Python、Java、Go等语言） Code_流苏实用软件与高效工具 vscode python java 后端开发必备插件合集
前言:作为微软推出的轻量级跨平台编辑器，VSCode凭借智能代码补全、远程开发、Git集成等核心功能，已成为后端开发者首选工具。其强大的插件生态更是覆盖了主流后端语言支持、代码质量优化、性能分析等全场景需求。名人说：博观而约取，厚积而薄发。——苏轼《稼说送张琥》创作者：Code_流苏(CSDN)（一个喜欢古诗词和编程的Coder）目录一、语言支持类插件二、代码质量和格式化工具三、数据库工具四、AP
MySQL 事务的隔离级别重生之我在成电转码 java mysql 事务
MySQL事务的隔离级别定义了多个事务并发执行时，如何防止相互影响。隔离级别越高，数据一致性越强，但并发性能可能降低。四种事务隔离级别MySQL提供4种事务隔离级别（从低到高）：隔离级别脏读（DirtyRead）不可重复读（Non-repeatableRead）幻读（PhantomRead）1.读未提交（ReadUncommitted）❌可能发生❌可能发生❌可能发生2.读已提交（ReadCommi
主流架构模式全景解析：微服务 vs SOA vs 单体架构的终极抉择指南 Eqwaak00 分布式系统设计实战科技微服务架构
一、架构演进史：从巨石到微粒的进化之路（图示：1970s单体→2000sSOA→2010s微服务→2020s云原生）二、三大架构模式深度拆解2.1单体架构（MonolithicArchitecture）核心特征graphTDA[单体应用]-->B[用户界面]A-->C[业务逻辑]A-->D[数据访问]B-->E[Web/移动端]C-->F[订单处理]C-->G[支付处理]D-->H[MySQL]D
MySQL主从复制架构原理及部署（work）只想按时下班 Mysql mysql 数据库 memcached
文章目录一、原理1、什么是MySQL主从复制2、MySQL主从复制应用场景3、MySQL主从复制架构及原理4、MySQLbinlog日志三种模式二、主从复制配置搭建1、MySQL8二进制安装2、主从复制配置3、测试主从复制三、二进制日志管理说明四、MySQL主从复制常见问题1、从库binlog落后主库binlog？2、主库update，从库迟迟没有更新3、主从复制延时配置（从库配置）4、主从复制故
Mysql 主从复制架构百里自来卷 mysql 架构数据库
MySQL主从复制（Master-SlaveReplication）是一种常见的数据库架构，广泛用于提高数据库的可扩展性、读写分离以及数据备份和容灾恢复。主从复制架构中，一个MySQL实例作为主库（Master），负责处理所有的写操作，而一个或多个从库（Slave）从主库复制数据，并负责处理读操作。主库（Master）：主库负责处理数据库的所有写操作（如INSERT、UPDATE和DELETE），
mysqldump踩坑！！！忽略Warning 导致主备不同步喝醉酒的小白 MySQL 数据库 mysql 服务器
Warning:ApartialdumpfromaserverthathasGTIDswillbydefaultincludetheGTIDsofalltransactions,eventhosethatchangedsuppressedpartsofthedatabase.Ifyoudon’twanttorestoreGTIDs,pass--set-gtid-purged=OFF.Tomakea
ERROR 2061 (HY000): Authentication plugin ‘caching_sha2_password‘ reported error: Authentication 喝醉酒的小白 MySQL mysql java 数据库
错误信息“ERROR2061(HY000):Authenticationplugin‘caching_sha2_password’reportederror:Authenticationrequiressecureconnection.”表示MySQL数据库配置了caching_sha2_password认证插件，并要求使用安全连接来进行身份验证。该错误通常出现在以下情况下：使用的MySQL客户端
如何进行OceanBase 运维工具的部署和表性能优化 oceanbase
随着OceanBase数据库应用的日益深入，数据量不断攀升，单个表中存储数百万乃至数千万条数据的情况变得愈发普遍。因此，部署专门的运维工具、实施针对性的表性能优化策略，以及加强指标监测工作，都变得更为重要。以下为基于我们的使用场景，所采取的一些部署和优化措施分享。一、OCP部署升级1．OCP升级（1）4.2.1BP1升级到4.2.2，本来以为毫无波澜但是下载完毕一键包并完成前期准备工作启动后发现无
2025年2月中国数据库排行榜：OceanBase迎来开门红，金仓、GBASE排名节节高
2025年2月，中国数据库流行度排行榜正式发布。在春节之际，DeepSeek凭借突破性的技术成功出圈，而在此前，各大数据库厂商便已开始探索AI与数据库的深度融合，并陆续推出了相关产品和功能。相信在这股技术革新的浪潮下，将涌现越来越多的新产品和解决方案。接下来，我们将逐一盘点各大数据库的最新动态，探索未来的潜力与挑战。一、金仓、GBASE排名再攀升，TDSQL升第九与上月相比，榜单前十的位次出现了细
Mysql高频八股——SQL语句的执行过程钢板兽高频八股 mysql sql 数据库面试后端
大家好，我是钢板兽！今天这篇文章本来想把SQL语句的执行过程和事务与undolog、redolog的联系放在一起写的。SQL语句的执行过程中会涉及到undolog、redolog，而undolog、redolog更深入的原理也是面试中经常会问到的，所以把它们放在一起再合适不过了，但是写着写着发现内容太多，于是拆成了两篇。这篇文章会带你理解SQL语句的执行过程，在探究SQL语句的执行过程前，我们要先
【MySQL基础-3】SQL语言详解：定义、分类、注意事项与注释 AllenBright #MySQL mysql sql
SQL（StructuredQueryLanguage，结构化查询语言）是用于管理和操作关系型数据库的标准编程语言。无论是查询数据、插入新记录、更新数据还是删除数据，SQL都是与数据库交互的核心工具。本文将深入探讨SQL语言的定义、分类、注意事项以及注释的使用，帮助你全面掌握这一强大的数据库操作语言。1.什么是SQL语言？SQL是一种专门用于管理关系型数据库的编程语言。它允许用户执行以下操作：查询
【赵渝强老师】达梦数据库的目录结构数据库关系型数据库
达梦数据库安装成功后，通过使用Linux的tree命令可以非常方便地查看DM8的目录结构。tree-L1-d/home/dmdba/dmdbms#输出的信息如下：/home/dmdba/dmdbms├──bin存放DM数据库的可执行文件，例如disql命令等。├──bin2├──data数据库实例目录，该目录存放各个实例的文件。├──desktop存放DM数据库各个工具的桌面图标。├──doc存放
MySQL Buffer Pool、Undo Log、脏页详解学堂在线 Mysql 数据库 mysql 数据库
文章目录1.BufferPool2.UndoLog3.脏页（DirtyPage）三者的协同工作常见问题总结MySQL中的BufferPool、UndoLog和脏页是InnoDB存储引擎的核心组件，共同保障了事务处理的高效性、一致性与持久性。以下是它们的详细解释及关联：1.BufferPool作用：BufferPool是InnoDB的内存缓存区域，用于缓存数据页和索引页，减少直接访问磁盘的开销，显著
【MYSQL学习】MySQL索引：删除索引的5大绝招你GET到了吗？墨瑾轩 MySql入门~精通 mysql 学习数据库
关注墨瑾轩，带你探索编程的奥秘！超萌技术攻略，轻松晋级编程高手技术宝库已备好，就等你来挖掘订阅墨瑾轩，智趣学习不孤单即刻启航，编程之旅更有趣MySQL索引：删除索引的5大绝招你GET到了吗？引言❓在数据库操作中，索引是一个非常重要的概念。合理的索引设计可以显著提高查询性能，而不合理的索引则可能导致性能下降。但你知道如何有效地删除索引吗？今天，我们就来一场深入浅出的探索之旅，带你了解删除索引的5大绝
Mysql-InnoDB索引：普通索引、主键索引、唯一索引、组合索引豪大大ya mysql 数据库 java
InnoDB和MyISAM的区别事务方面InnoDB支持事务，MyISAM不支持事务。这是Mysql将默认存储引擎从MyISAM变成InnoDB的重要原因之一外键方面InnoDB支持外键，而MyISAM不支持。对一个包含外键的InnoDB表转为MyISAM会失败索引层面InnoDB是聚集（聚簇）索引，MyISAM是非聚集（非聚簇）索引。MyISAM支持FULLTEXT类型的全文索引。InnoDB不
3-002： MySQL 中使用索引一定有效吗？如何排查索引效果？盖盖衍上_染染熊_代码集 00-刷题 mysql 数据库
1.索引失效的常见原因虽然索引可以加速查询，但在某些情况下，MySQL可能不会使用索引，甚至使用索引反而更慢。以下是一些常见导致索引失效的原因：①查询条件使用了!=或30时仍能利用索引。2.如何排查索引效果？可以使用EXPLAIN命令分析SQL是否走索引，以及索引的效率。①使用EXPLAIN分析SQL执行计划EXPLAINSELECT*FROMusersWHEREage=30;返回示例：idsel
4-002：如何使用 MySQL 的 EXPLAIN 语句进行查询分析？盖盖衍上_染染熊_代码集 00-刷题 mysql 数据库
EXPLAIN是MySQL中用于分析查询性能的工具，能够帮助你理解查询的执行计划。通过EXPLAIN，你可以查看MySQL如何执行查询，包括使用的索引、表连接顺序等信息。基本用法在查询前加上EXPLAIN即可：EXPLAINSELECT*FROMyour_tableWHEREyour_column='value';输出字段说明EXPLAIN的输出包含多个字段，以下是主要字段及其含义：id:查询标识
【QT教程】QT6硬件数据库编程 QT硬件数据库 QT性能优化QT原理源码QT界面美化 qt qt6.3 qt5 c++QT教程
QT6硬件数据库编程使用AI技术辅助生成QT界面美化视频课程QT性能优化视频课程QT原理与源码分析视频课程QTQMLC++扩展开发视频课程免费QT视频课程您可以看免费1000+个QT技术视频免费QT视频课程QT统计图和QT数据可视化视频免费看免费QT视频课程QT性能优化视频免费看免费QT视频课程QT界面美化视频免费看1QT6硬件数据库编程基础1.1QT6数据库引擎概述1.1.1QT6数据库引擎概述
MySQL 的索引数量是否越多越好 Zero_pl Mysql基础知识面试题 mysql 数据库
MySQL的索引并不是越多越好，索引数量需要根据查询需求合理设置。虽然索引可以提高查询效率，但过多的索引也会带来额外的开销，影响数据库的性能。✅索引的优点提高查询速度索引类似于书籍的目录，可以快速查找数据，减少查询时间。如SELECT*FROMusersWHEREemail='[email protected]';，如果email字段有索引，MySQL可以直接找到匹配数据，否则需要全表扫描。加速排序（
4-001：MySQL 中的索引数量是否越多越好？为什么？盖盖衍上_染染熊_代码集 00-刷题 mysql 数据库
MySQL中的索引并不是越多越好，索引数量要合理控制！过多索引的影响增加存储开销每个索引都会占用额外的磁盘空间，索引多了，存储成本增加。降低INSERT、UPDATE、DELETE性能任何涉及数据修改的操作，都需要同时更新索引，影响性能。示例：INSERTINTOusers(id,name)VALUES(1,'Tom');，如果users表有多个索引，则插入时每个索引都需要更新，影响插入速度。可能
MyBatis底层原理深度解析：动态代理与注解如何实现ORM映射 rider189 java 开发语言 mybatis
一、引言MyBatis作为一款优秀的ORM框架，其核心设计思想是通过动态代理和注解将接口方法与SQL操作解耦。开发者只需定义Mapper接口并添加注解，便能实现数据库操作，这背后隐藏着精妙的动态代理机制与源码设计。本文将从源码层解析MyBatis如何实现这一过程。二、动态代理机制：从接口到实现类关键点：MyBatis通过JDK动态代理为Mapper接口生成代理对象，拦截所有方法调用，将其路由到SQ
SQLMesh 系列教程：解锁SQLMesh的宏与变量魔法梦想画家 #python 数据分析工程 sqlmesh 数据工程分析工程
在数据库流水线开发中，代码复用与动态配置是提升效率的核心诉求。SQLMesh以其独特的宏系统与用户定义变量机制，重新定义了SQL生成的灵活性。与传统模板引擎不同，SQLMesh的宏并非简单的字符串替换，而是基于语义理解的智能代码重构——通过sqlglot库解析SQL结构，结合Python逻辑处理能力，让用户能够以声明式语法实现复杂查询的动态组装。引言无论是全局配置、网关级参数还是模型内局部变量，S
从零实现OSS阿里云图片上传：前端采用的vue3+element-plus，后端采用javaspingboot，实现上传图片到云，然后存储数据库链接能够回显的效果绝顶少年阿里云前端数据库
后端（JavaSpringBoot）1.添加依赖在pom.xml中添加必要的依赖，包括阿里云OSSSDK、SpringBootWeb、MyBatis-Plus等：org.springframework.bootspring-boot-starter-webcom.baomidoumybatis-plus-boot-starter3.4.3.4com.aliyun.ossaliyun-sdk-oss
Redis五种用途 egekm_sefg 面试学习路线阿里巴巴 redis 数据库缓存
简介Redis是一个高性能的key-value数据库。Redis与其他key-value缓存产品有以下三个特点：-Redis支持数据的持久化，可以将内存中的数据保存在磁盘中，重启的时候可以再次加载进行使用。-Redis不仅仅支持简单的key-value类型的数据，同时还提供list，set，zset，hash等数据结构的存储。-Redis支持数据的备份，即master-slave模式的数据备份。五
软件架构师--Redis常见问题一蓑烟雨*任平生软件架构师 redis 数据库缓存
一、缓存雪崩产生原因：大部分缓存失效—>数据库崩溃解决方案1.使用锁或队列保证不会有大量的线程对数据库一次性进行读写，从而避免失效时大量的并发请求落到底层存储系统上（对数据库限流）。2.为key设置不同的缓存失效时间在固定的一个缓存时间的基础上+随机一个时间作为缓存失效时间，避免大量数据同时失效。3.二级缓存设置一个有时间限制的缓存+一个无时间限制的缓存，避免大规模访问数据库。二、缓存穿透产生原因
软件架构师--数据库系统一蓑烟雨*任平生软件架构师数据库 1024程序员节
一、分布式数据库1.分片透明性分片透明性：分不分片，用户感受不到（不关心如何分片存储）。位置透明性：数据存放在哪里，用户不用管（用户无需知道数据存放的物理位置）复制透明性：不关心结点的复制情况。局部数据模型透明性（逻辑透明）：用户或应用程序无需知道局部场地使用的是哪种数据模型。2.两阶段提交协议2PC2PC事务提交的两个阶段①表决阶段，目的是形成一个共同的决定②执行阶段，目的是实现这个协调者的决定
ES 使用geo point 查询离目标地址最近的数据 DavidSoCool elasticsearch Mysql elasticsearch 搜索引擎 mysql
需求描述：项目中需要通过经纬度坐标查询目标地所在的行政区。解决思路大致有种，使用es和mysql分别查询。1、使用es进行查询将带有经纬度坐标的省市区数据存入es中，mappings字段使用geopoint类型，索引及查询dsl如下。geopoint文档地址：Geo-distancequery|ElasticsearchGuide[8.6]|ElasticSortsearchresults|Ela
对于规范和实现，你会混淆吗？ yangshangchuan HotSpot
昨晚和朋友聊天，喝了点咖啡，由于我经常喝茶，很长时间没喝咖啡了，所以失眠了，于是起床读JVM规范，读完后在朋友圈发了一条信息： JVM Run-Time Data Areas：The Java Virtual Machine defines various run-time data areas that are used during execution of a program. So
android 网络百合不是茶网络
android的网络编程和java的一样没什么好分析的都是一些死的照着写就可以了,所以记录下来方便查找 , 服务器使用的是TomCat 服务器代码; servlet的使用需要在xml中注册 package servlet; import java.io.IOException; import java.util.Arr
[读书笔记]读法拉第传 comsci 读书笔记
1831年的时候,一年可以赚到1000英镑的人..应该很少的... 要成为一个科学家,没有足够的资金支持,很多实验都无法完成但是当钱赚够了以后....就不能够一直在商业和市场中徘徊......
随机数的产生沐刃青蛟随机数
c++中阐述随机数的方法有两种：一是产生假随机数（不管操作多少次，所产生的数都不会改变）这类随机数是使用了默认的种子值产生的，所以每次都是一样的。 //默认种子 for (int i = 0; i < 5; i++) { cout<<
PHP检测函数所在的文件名 IT独行者 PHP 函数
很简单的功能，用到PHP中的反射机制，具体使用的是ReflectionFunction类，可以获取指定函数所在PHP脚本中的具体位置。创建引用脚本。代码： [php] view plain copy // Filename: functions.php <?php&nbs
银行各系统功能简介文强chu 金融
银行各系统功能简介　业务系统核心业务系统业务功能包括：总账管理、卡系统管理、客户信息管理、额度控管、存款、贷款、资金业务、国际结算、支付结算、对外接口等清分清算系统以清算日期为准，将账务类交易、非账务类交易的手续费、代理费、网络服务费等相关费用，按费用类型计算应收、应付金额，经过清算人员确认后上送核心系统完成结算的过程国际结算系
Python学习1(pip django 安装以及第一个project) 小桔子 python django pip
最近开始学习python,要安装个pip的工具。听说这个工具很强大，安装了它，在安装第三方工具的话so easy!然后也下载了，按照别人给的教程开始安装，奶奶的怎么也安装不上！第一步：官方下载pip-1.5.6.tar.gz, https://pypi.python.org/pypi/pip easy! 第二部：解压这个压缩文件，会看到一个setup.p
php 数组 aichenglong PHP 排序数组循环多维数组
1 php中的创建数组 $product = array('tires','oil','spark');//array()实际上是语言结构而不是函数 2 如果需要创建一个升序的排列的数字保存在一个数组中，可以使用range()函数来自动创建数组 $numbers=range(1,10)//1 2 3 4 5 6 7 8 9 10 $numbers=range(1,10,
安装python2.7 AILIKES python
安装python2.7 1、下载可从 http://www.python.org/进行下载#wget https://www.python.org/ftp/python/2.7.10/Python-2.7.10.tgz 2、复制解压 #mkdir -p /opt/usr/python #cp /opt/soft/Python-2
java异常的处理探讨百合不是茶 JAVA异常
//java异常 /* 1，了解java 中的异常处理机制，有三种操作 a,声明异常 b,抛出异常 c,捕获异常 2，学会使用try-catch-finally来处理异常 3，学会如何声明异常和抛出异常 4，学会创建自己的异常 */ //2，学会使用try-catch-finally来处理异常
getElementsByName实例 bijian1013 element
实例1： <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/x
探索JUnit4扩展：Runner bijian1013 java 单元测试 JUnit
参加敏捷培训时，教练提到Junit4的Runner和Rule，于是特上网查一下，发现很多都讲的太理论，或者是举的例子实在是太牵强。多搜索了几下，搜索到两篇我觉得写的非常好的文章。文章地址：http://www.blogjava.net/jiangshachina/archive/20
[MongoDB学习笔记二]MongoDB副本集 bit1129 mongodb
1. 副本集的特性 1)一台主服务器(Primary),多台从服务器(Secondary) 2)Primary挂了之后，从服务器自动完成从它们之中选举一台服务器作为主服务器，继续工作，这就解决了单点故障，因此，在这种情况下，MongoDB集群能够继续工作 3)挂了的主服务器恢复到集群中只能以Secondary服务器的角色加入进来 2
【Spark八十一】Hive in the spark assembly bit1129 assembly
Spark SQL supports most commonly used features of HiveQL. However, different HiveQL statements are executed in different manners: 1. DDL statements (e.g. CREATE TABLE, DROP TABLE, etc.)
Nginx问题定位之监控进程异常退出 ronin47
nginx在运行过程中是否稳定，是否有异常退出过？这里总结几项平时会用到的小技巧。 1. 在error.log中查看是否有signal项，如果有，看看signal是多少。比如，这是一个异常退出的情况： $grep signal error.log 2012/12/24 16:39:56 [alert] 13661#0: worker process 13666 exited on s
No grammar constraints (DTD or XML schema).....两种解决方法 byalias xml
方法一：常用方法关闭XML验证工具栏：windows => preferences => xml => xml files => validation => Indicate when no grammar is specified:选择Ignore即可。方法二：（个人推荐）添加内容如下 <?xml version=
Netty源码学习-DefaultChannelPipeline bylijinnan netty
package com.ljn.channel; /** * ChannelPipeline采用的是Intercepting Filter 模式 * 但由于用到两个双向链表和内部类，这个模式看起来不是那么明显，需要仔细查看调用过程才发现 * * 下面对ChannelPipeline作一个模拟，只模拟关键代码： */ public class Pipeline {
MYSQL数据库常用备份及恢复语句 chicony mysql
备份MySQL数据库的命令，可以加选不同的参数选项来实现不同格式的要求。 mysqldump -h主机 -u用户名 -p密码数据库名 > 文件备份MySQL数据库为带删除表的格式，能够让该备份覆盖已有数据库而不需要手动删除原有数据库。 mysqldump -–add-drop-table -uusername -ppassword databasename > ba
小白谈谈云计算--基于Google三大论文 CrazyMizzz Google 云计算 GFS
之前在没有接触到云计算之前，只是对云计算有一点点模糊的概念，觉得这是一个很高大上的东西，似乎离我们大一的还很远。后来有机会上了一节云计算的普及课程吧，并且在之前的一周里拜读了谷歌三大论文。不敢说理解，至少囫囵吞枣啃下了一大堆看不明白的理论。现在就简单聊聊我对于云计算的了解。我先说说GFS &n
hadoop 平衡空间设置方法 daizj hadoop balancer
在hdfs-site.xml中增加设置balance的带宽，默认只有1M： <property> <name>dfs.balance.bandwidthPerSec</name> <value>10485760</value> <description&g
Eclipse程序员要掌握的常用快捷键 dcj3sjt126com 编程
判断一个人的编程水平，就看他用键盘多，还是鼠标多。用键盘一是为了输入代码（当然了，也包括注释），再有就是熟练使用快捷键。曾有人在豆瓣评《卓有成效的程序员》：“人有多大懒，才有多大闲”。之前我整理了一个程序员图书列表，目的也就是通过读书，让程序员变懒。程序员作为特殊的群体，有的人可以这么懒，懒到事情都交给机器去做，而有的人又可以那么勤奋，每天都孜孜不倦得
Android学习之路 dcj3sjt126com Android学习
转自：http://blog.csdn.net/ryantang03/article/details/6901459 以前有J2EE基础，接触JAVA也有两三年的时间了，上手Android并不困难，思维上稍微转变一下就可以很快适应。以前做的都是WEB项目，现今体验移动终端项目，让我越来越觉得移动互联网应用是未来的主宰。下面说说我学习Android的感受，我学Android首先是看MARS的视
java 遍历Map的四种方法 eksliang java HashMap java 遍历Map的四种方法
转载请出自出处： http://eksliang.iteye.com/blog/2059996 package com.ickes; import java.util.HashMap; import java.util.Iterator; import java.util.Map; import java.util.Map.Entry; /** * 遍历Map的四种方式
【精典】数据库相关相关 gengzg 数据库
package C3P0; import java.sql.Connection; import java.sql.SQLException; import java.beans.PropertyVetoException; import com.mchange.v2.c3p0.ComboPooledDataSource; public class DBPool{
自动补全 huyana_town 自动补全
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml&quo
jquery在线预览PDF文件，打开PDF文件天梯梦 jquery
最主要的是使用到了一个jquery的插件jquery.media.js，使用这个插件就很容易实现了。核心代码 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.
ViewPager刷新单个页面的方法 lovelease android viewpager tag 刷新
使用ViewPager做滑动切换图片的效果时，如果图片是从网络下载的，那么再子线程中下载完图片时我们会使用handler通知UI线程，然后UI线程就可以调用mViewPager.getAdapter().notifyDataSetChanged()进行页面的刷新，但是viewpager不同于listview，你会发现单纯的调用notifyDataSetChanged()并不能刷新页面
利用按位取反（~）从复合枚举值里清除枚举值草料场 enum
以 C# 中的 System.Drawing.FontStyle 为例。如果需要同时有多种效果，如：“粗体”和“下划线”的效果，可以用按位或（|） FontStyle style = FontStyle.Bold | FontStyle.Underline; 如果需要去除 style 里的某一种效果，
Linux系统新手学习的11点建议刘星宇编程工作 linux 脚本
　　随着Linux应用的扩展许多朋友开始接触Linux，根据学习Windwos的经验往往有一些茫然的感觉：不知从何处开始学起。这里介绍学习Linux的一些建议。　　一、从基础开始：常常有些朋友在Linux论坛问一些问题，不过，其中大多数的问题都是很基础的。例如：为什么我使用一个命令的时候，系统告诉我找不到该目录，我要如何限制使用者的权限等问题，这些问题其实都不是很难的，只要了解了 Linu
hibernate dao层应用之HibernateDaoSupport二次封装 wangzhezichuan DAO Hibernate
/** * 方法描述:sql语句查询返回List<Class> * 方法备注: Class 只能是自定义类 * @param calzz * @param sql * @return * 创建人：王川 * 创建时间：Jul