AIX系统下RAC故障之-VIPCA 错误

环境:

操作系统: AIX 5300-09

集群软件: CRS 10.2.0.1

数据库:   Oracle 10.2.0.1

故障现象:

运行完,root.sh 后在node 上运行vipca 出现以下错误

[root@aix209 bin]#./vipca

PRKR-1062 : Failed to find configuration for node aix209

PRKR-1062 : Failed to find configuration for node aix209


解决方法:

1、删除该节点CRS 配置信息,并重新执行root.sh

[root@aix209 install]#./rootdelete.sh

Shutting down Oracle Cluster Ready Services (CRS):

Stopping resources.

Error while stopping resources. Possible cause: CRSD is down.

Stopping CSSD.

Shutting down CSS daemon.

Shutdown request successfully issued.

Shutdown has begun. The daemons should exit soon.

Checking to see if Oracle CRS stack is down...

Oracle CRS stack is not running.

Oracle CRS stack is down now.

Removing script for Oracle Cluster Ready services

Updating ocr file for downgrade

Cleaning up SCR settings in '/etc/oracle/scls_scr'


[root@aix209 install]#/u01/crs_1/root.sh

WARNING: directory '/u01' is not owned by root

Checking to see if Oracle CRS stack is already configured


Setting the permissions on OCR backup directory

Setting up NS directories

Oracle Cluster Registry configuration upgraded successfully

WARNING: directory '/u01' is not owned by root

clscfg: EXISTING configuration version 3 detected.

clscfg: version 3 is 10G Release 2.

Successfully accumulated necessary OCR keys.

Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.

node :

node 1: aix209 aix209-priv aix209

node 2: aix210 aix210-priv aix210

clscfg: Arguments check out successfully.


NO KEYS WERE WRITTEN. Supply -force parameter to override.

-force is destructive and will destroy any previous cluster

configuration.

Oracle Cluster Registry for cluster has already been initialized

Startup will be queued to init within 30 seconds.

Adding daemons to inittab

Adding daemons to inittab

Expecting the CRS daemons to be up within 600 seconds.

CSS is active on these nodes.

       aix209

       aix210

CSS is active on all nodes.

Waiting for the Oracle CRSD and EVMD to start

Oracle CRS stack installed and running under init(1M)

Running vipca(silent) for configuring nodeapps

IP address "aix209-vip" has already been used. Enter an unused IP address.

[root@aix209 install]#

2、重新执行vipca

AIX系统下RAC故障之-VIPCA 错误_第1张图片

vipca 可以执行!


3、但在,执行过程中报以下错误

AIX系统下RAC故障之-VIPCA 错误_第2张图片

忽略该错误后,出现以下问题:

[root@aix209 /]#crs_stat -t

Name           Type           Target    State     Host        

------------------------------------------------------------

ora.aix209.gsd application    ONLINE    ONLINE    aix209      

ora.aix209.ons application    ONLINE    ONLINE    aix209      

ora.aix210.gsd application    ONLINE    ONLINE    aix210      

ora.aix210.ons application    ONLINE    ONLINE    aix210      

ora.aix210.vip application    ONLINE    ONLINE    aix210

在crs 资源里,缺少ora.aix209.vip的资源!


4、手工创建该资源


[u@h W]$ srvctl add nodeapps -n aix209 -o /u01/app/oracle/product/10.2.0/db_1 -A 192.168.8.143/255.255.255.0

CRS-0210: Could not find resource 'ora.aix209.vip'.


[oracle@aix209 bin]$./crs_profile -create ora.aix209.vip -t application -a ora.aix209.vip.src

[oracle@aix209 bin]$ls -l /u01/crs_1/crs/script/

total 0

[oracle@aix209 bin]$ls -l /u01/crs_1/crs/public/

total 16

-rw-rw----    1 oracle   oinstall       3396 Aug  3 2004  action_scr.scr

-rw-r--r--    1 oracle   oinstall        752 Apr  8 16:26 ora.aix209.vip.cap

[oracle@aix209 bin]$cd /u01/crs_1/crs/public/

[oracle@aix209 public]$cp action_scr.scr ../script/ora.aix209.vip.src

可以注册ora.aix209.vip资源:

[root@aix209 /]#crs_register ora.aix209.vip

[root@aix209 /]#srvctl add nodeapps -n aix209 -o /u01/app/oracle/product/10.2.0/db_1 -A 192.168.8.143/255.255.255.0


[root@aix209 /]#crs_stat -t

Name           Type           Target    State     Host        

------------------------------------------------------------

ora.aix209.gsd application    ONLINE    ONLINE    aix209      

ora.aix209.ons application    ONLINE    ONLINE    aix209      

ora.aix209.vip application    OFFLINE   OFFLINE  

ora.aix210.gsd application    ONLINE    ONLINE    aix210      

ora.aix210.ons application    ONLINE    ONLINE    aix210      

ora.aix210.vip application    ONLINE    ONLINE    aix210  

但是启动失败!  

[root@aix209 /]#crs_start -f ora.aix209.vip

Attempting to start `ora.aix209.vip` on member `aix209`

`ora.aix209.vip` on member `aix209` has experienced an unrecoverable failure.

Human intervention required to resume its availability.

CRS-0215: Could not start resource 'ora.aix209.vip'.


五、最终解决方法:

重新清理OCR和VOTE磁盘,重新执行root.sh(在两个节点上)

[root@aix210 /]#dd if=/dev/zero of=/dev/rrac_ocr bs=8192 count=2560

2560+0 records in

2560+0 records out

[root@aix210 /]#dd if=/dev/zero of=/dev/rrac_vote bs=8192 count=2560

2560+0 records in

2560+0 records out

[root@aix210 /]#/u01/crs_1/root.sh

[root@aix209 /]#crs_stat -t

Name           Type           Target    State     Host        

------------------------------------------------------------

ora.aix209.vip application    OFFLINE   OFFLINE        

重新运行vipca:

AIX系统下RAC故障之-VIPCA 错误_第3张图片

[root@aix209 /]#crs_stat -t

Name           Type           Target    State     Host        

------------------------------------------------------------

ora.aix209.gsd application    ONLINE    ONLINE    aix209      

ora.aix209.ons application    ONLINE    ONLINE    aix209      

ora.aix209.vip application    ONLINE    ONLINE    aix209      

ora.aix210.gsd application    ONLINE    ONLINE    aix210      

ora.aix210.ons application    ONLINE    ONLINE    aix210      

ora.aix210.vip application    ONLINE    ONLINE    aix210      

[root@aix209 /]#

@至此,问题解决!