首先遇到的第一个问题,fdisk 格式化lvm的底层盘,导致主机无法启动
解决方法:
1 在进行fdisk之前查看lvdisplay和pvdisplay查看lvm信息,避免格式化错磁盘
fdisk需要谨慎,
2 进入救援模式。编辑fstab文件,取消对错误vg的挂载。重启系统。
使用vsphere自带的web图形管理界面进行图形安装,发现直接使用./runInstaller
在一些包含确认按钮的步骤中,会出现小竖不能点开,无法继续安装。
处理方式如下:
./runInstaller -jreLoc /usr/lib/jvm/java-1.7.0-openjdk-1.7.0.141-2.6.10.5.el7.x86_64/jre
使用系统自带的java程序进行调用安装程序,在使用setup过程中,报java drive错误。只能手工使用脚本配置ssh同步
./sshUserSetup.sh -user grid -hosts "bpptdb01 bpptdb02" -advanced -noPromptPassphrase
但是使用手工ssh同步后,test还是不能通过,发现如果在图形环境中进行ssh等级配置,会出现如下报错:
sign_and_send_pubkey:signing failed :agent refused operation;
处理方式如下:
eval "$(ssh-agent -s)"
ssd-add
在master 节点执行root.sh过程中出现hang现象,
[client(14360)]CRS-2101:The OLR was formatted using version 3.
重新执行root.sh,当再次遇到:adding clusterware entries to inittab时,打开另一个窗口用:
root执行:
dd if=/var/tmp/.oracle/npohasd of=/dev/null bs=1024 count=1
重启主机之后发现只有进程ohasd.bin
解决方法:
[root@bprac01 ~]# cd /var/tmp/.oracle/
[root@bprac01 ~]# rm -rf npohasd
[root@bprac01 ~]# touch npohasd
[root@bprac01~]# chmod 755 npohasd
在对集群修正补丁过程中发现,在执行rootcrs.pl -patch过程中,启动crs失败。重启主机出现多个
ohasd.bin reboot进程。且has启动失败,acfs驱动加载失败。
解决方案:
使用linux strace 命令进行跟踪如下
stace -p 2141
strace -p pid
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
access("/var/tmp/.oracle/npohasd2", F_OK) = -1 ENOENT (No such file or directory)
会出现在查找/var/tmp/.oracle/npohasd2的信息。
echo "">npohasd2
/bin/dd if=/var/tmp/.oracle/npohasd2 of=/dev/null bs=1024 count=1
想要在主机启动时,执行rc.local内容,无法执行。
处理方案:
chmod 775 rc.local
dbca创建数据库无法识别到节点。显示为空。
处理方法:
vi /u01/app/oraInventory/ContentsXML/inventory.xml
查询文件内容,
11.2.0.4.0
2.1.0.6.0
CRS="true">
————————————————
确保 crs="true”关键字在文件中。