ceph集群安装报错解决方法

1.创建密钥时报错

[root@node1 ~]#  ceph-deploy gatherkeys node1

[node1][DEBUG ] get remote short hostname

[node1][DEBUG ] fetch remote file

[node1][INFO  ] Running command: /usr/bin/ceph --connect-timeout=25 --cluster=ceph --admin-daemon=/var/run/ceph/ceph-mon.node1.asok mon_status

[node1][ERROR ] Not yet quorum for 'node1'

[ceph_deploy.gatherkeys][ERROR ] Failed to connect to host:node1

[ceph_deploy.gatherkeys][INFO  ] Destroy temp directory /tmp/tmpMfWp77

[ceph_deploy][ERROR ] RuntimeError: Failed to connect any mon


解决方法:

[root@node1 ~]# systemctl  stop firewalld

[root@node1 ~]# systemctl   disable firewalld

Removed symlink /etc/systemd/system/multi-user.target.wants/firewalld.service.

Removed symlink /etc/systemd/system/dbus-org.fedoraproject.FirewallD1.service.

[root@node1 ~]# ceph-deploy --overwrite-conf  mon create-initial

2.使用ceph认证权限问题

[root@node1 ~]# /bin/ceph --cluster=ceph osd stat --format=json

2019-04-25 09:16:31.404 7f5a501ac700 -1 auth: unable to find a keyring on /etc/ceph/ceph.client.admin.keyring,/etc/ceph/ceph.keyring,/etc/ceph/keyring,/etc/ceph/keyring.bin,: (2) No such file or directory

2019-04-25 09:16:31.404 7f5a501ac700 -1 monclient: ERROR: missing keyring, cannot use cephx for authentication

[errno 2] error connecting to the cluster


解决方法:

[root@node1 ~]# find / -name ceph.client.admin.keyring

/root/ceph.client.admin.keyring

[root@node1 ~]# cp -a  /root/ceph.client.admin.keyring  /etc/ceph/

[root@node1 ~]# chmod +r ceph.client.admin.keyring

[root@node1 ~]# /bin/ceph --cluster=ceph osd stat --format=json

{"epoch":7,"num_osds":1,"num_up_osds":1,"num_in_osds":1,"full":false,"nearfull":false,"num_remapped_pgs":0}

[root@node1 ~]# ceph health

HEALTH_WARN no active mgr

[root@node1 ~]#


3.使用ceph health显示没有激活mgr

[root@node1 ~]# ceph health

HEALTH_WARN no active mgr

[root@node1 ~]#


解决方法:

[root@node1 ~]# ceph-deploy mgr create node1 node2 node3    --创建mgr进程

[root@node1 ~]# ceph health

HEALTH_OK

[root@node1 ~]# ceph mgr metadata


4.安装好ceph后系统重启报错(进入不了终端)

Restarting network (Via systemctl): Error getting authority: Error initializing authority: Could not connect: No such file or directory (g-io-error-quark,1)

Welcome to emergency mode!Aftle logging in,type "journalcel -xb" to view

system logs,"systemctl reboot" to reboot,"systemctl default" or ^D to

try again to boot into default mode.

Give root password for maintenance

(  Control-D   )

login incorrect


解决方法:

注释osd磁盘分区,不能将osd分区挂载,否则会进入不了系统,也可在分区的defaults后面加入nofail参数,用逗号隔开

[root@node2 ~]# cat /etc/fstab 

#

# /etc/fstab

# Created by anaconda on Wed Apr 24 20:17:43 2019

#

# Accessible filesystems, by reference, are maintained under '/dev/disk'

# See man pages fstab(5), findfs(8), mount(8) and/or blkid(8) for more info

#

UUID=2096e465-9251-4dd7-93c7-a163adba6209    /                       xfs     defaults        0 0

UUID=76236dfd-510d-4d33-9071-3f2ecfeee3f7     /boot                   xfs     defaults        0 0

# UUID=zDLnfa-iAwr-yp1k-C19m-8oOC-iLmS-agJarY   /data            ext4   defaults        0 0     --注释osd磁盘分区,不能挂载

UUID=e1f86a97-a02f-47aa-bd1e-6aaae4db60f7 swap                     swap    defaults      0 0

[root@node2 ~]# 


来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/25854343/viewspace-2642445/,如需转载,请注明出处,否则将追究法律责任。

转载于:http://blog.itpub.net/25854343/viewspace-2642445/

你可能感兴趣的:(ceph集群安装报错解决方法)