mysql-operator容器化部署mysql8

基础组件容器化

前一段时间容器化了容器的s3和pika。由于已经有开源方案,本次mysql直接以operator容器化。使用的是[presslabs的mysql-operator]https://github.com/presslabs/mysql-operator。

主要特征

  1. presslabs/mysql-operator自动化搭建主从集群。
  2. 使用XtraBackup 对mysql数据进行备份和恢复
  3. 使用orchestrator保证集群高可用,自动发现MySQL的复制拓扑,自动选主。
  4. 可以将数据备份到s3,并支持从s3读取数据恢复集群。

安装

刚提的pr修复myql-operator 的mysql-8分支,需要下载源码进行安装

  1. 下载myql-operator源码
git clone -b mysql-8 https://github.com/presslabs/mysql-operator.git
  1. myql-operator支持mysql8的bug最近刚刚修复,目前不在master分支,也没有正式发布。所以需要自己编译。镜像然后修chart的value.yaml文件
# Default values for mysql-operator.
# This is a YAML-formatted file.
# Declare variables to be passed into your templates.

replicas: 1
image: 自己编译的mysql-operator镜像
sidecarImage: 自己编译的mysql-operator-sidecar镜像
imagePullPolicy: IfNotPresent

imagePullSecrets:
# - name: "image-pull-secret"

# whether or not to install CRDs
installCRDs: true
# in which namespace to watch for resource, leave empty to watch in all namespaces
watchNamespace:

# The operator will install a ServiceMonitor if you have prometheus-operator installed.
serviceMonitor:

  1. 使用本地chart安装mysql operator
[root@rg1-ceph101 /home/zhangzhifei/mysql-operator/charts/mysql-operator]# helm install .
  1. 编辑cluster.yaml
apiVersion: mysql.presslabs.org/v1alpha1
kind: MysqlCluster
metadata:
  name: mysql-cluster-test
spec:
  replicas: 3
  secretName: my-secret
  mysqlVersion: 8.0.20  # 
  volumeSpec:
    persistentVolumeClaim:
      accessModes: ["ReadWriteOnce"]
      resources:
        requests:
          storage: 20Gi
  #mysqlConf:
  #  query-cache-type: "0"
  #  query-cache-size: "0"
  podSpec:
    resources:
      requests:
        cpu: "8"
        memory: 16Gi
      limits:
        cpu: "8"
        memory: 16Gi
  mysqlConf:
    innodb-buffer-pool-size: 6Gi
---
apiVersion: v1
kind: Secret
metadata:
  name: my-secret
type: Opaque
data:
  # root password is required to be specified (root/mypass)
  ROOT_PASSWORD: bXlwYXNz
  ## application credentials that will be created at cluster bootstrap
  # DATABASE:
  # USER:
  # PASSWORD:
  1. 创建mysql集群
kubectl apply -f cluster.yaml

工作流程

operator工作流程
  1. myql cluster cr 创建后operator会创建mysql statefulset,由statefulset负责创建并管理pod。
  2. pod里的init容器负责从健康的节点备份数据,并执行prepare backup。如果是master节点第一次启动不需要备份数据,但是可能是从s3获取备份数据恢复mysql集群。
  3. heartbeat容器负责发送心跳
  4. sidecar容器是一个mysql数据备份和传输的容器,第一个从节点向master pod的sidecar容器发从backup请求,master pod的sidecar容器先执行“xtrabackup --backup”,然后把备份数据传给从pod。
  5. mysql容器启动后,node controller会感知到,然后进行change master等操作。到此集群已经跑起来了
  6. orch controller会将mysql节点加入到orchestrator中管理,并且通过orchestrator服务获取集群的健康状态,更新mysqlcluster资源对象
  7. orchestrator服务负责故障恢复(master 挂了之后,从新选主)。

问题

当前的xtrabackup-8.0可能有些bug,通过--tables-exclude参数过滤掉sys_operator.status表,但是数据仍然有残留导致从节点启动时报如下错误:

2019-07-04T18:07:07.992788Z 1 [Warning] [MY-012351] [InnoDB] Tablespace 2, name 'sys_operator/status', file './sys_operator/status.ibd' is missing!
2019-07-04T18:07:08.563943Z 6 [ERROR] [MY-012592] [InnoDB] Operating system error number 2 in a file operation.
2019-07-04T18:07:08.563966Z 6 [ERROR] [MY-012593] [InnoDB] The error means the system cannot find the path specified.
2019-07-04T18:07:08.563974Z 6 [ERROR] [MY-012216] [InnoDB] Cannot open datafile for read-only: './sys_operator/status.ibd' OS error: 71
2019-07-04T18:07:08.564165Z 6 [ERROR] [MY-000061] [Server] 1812  Tablespace is missing for table `sys_operator`.`status`..

最终通过修改mysql初始化脚本修复这个问题:判断如果不是master节点则先清理status表,然后在创建。(status表是operator检测mysql是否启动成功用的)

你可能感兴趣的:(mysql-operator容器化部署mysql8)