http://www.leadcom.com.cn/weibaozhishi/255.html
URL:
http://www.aixchina.net/blogs/mxin/archive/2006/8/5/1460.aspx
摘要:
IBM HACMP 自从出了5.2 版本后, 到了5.205比较稳定,并已经过充分的测试(见测试篇)和实践证明(已有多个系统成功自动切换)。这里我整理了一下,以比较复杂的双机互备情况为例,其他类似设置可适当简化。
样例为AIX5304+HACMP5205.
第一章 规划篇
第二章 安装篇
1 安装前提
1) 操作系统版本要求:
AIX5.203以上
2) 环境要求
Ø 用户确认
HA要求所有切换需要用到的用户必须一一对应,userid完全相同。
如tlg1d0上oracle用户为orarun,tlg1a0上的orarun必须为切换保留,sid均为209
平时用的oracle用户就改为oraclt
[tlg1a0][root][/]lsuser -a id ALL
root id=0
daemon id=1
bin id=2
sys id=3
adm id=4
uucp id=5
nobody id=-2
lpd id=9
lp id=11
invscout id=6
snapp id=200
ipsec id=201
nuucp id=7
radiusd id=202
ldap id=203
oradev id=204
tuxrun id=205
oraclt id=208
orarun id=209
sshd id=211
bsx1 id=210
[tlg1d0][root][/]>lsuser -a id ALL
root id=0
daemon id=1
bin id=2
sys id=3
adm id=4
uucp id=5
nobody id=-2
lpd id=9
lp id=11
invscout id=6
snapp id=200
ipsec id=201
nuucp id=7
radiusd id=202
ldap id=203
oradev id=204
orarun id=209
tuxrun id=205
oraclt id=208
bsx1 id=210
Ø 文件系统确认
HA要求切换相关的文件系统不能重名,如tlg1d0上oracle软件目录为/ora10msrun,tlg1a0上的/ora10msrun必须为切换保留,改为/ora10msclt
[tlg1d0][root][/]>df -k
Filesystem 1024-blocks Free %Used Iused %Iused Mounted on
/dev/hd4 524288 487820 7% 3276 3% /
/dev/hd2 7077888 1868516 74% 91290 18% /usr
/dev/hd9var 524288 458364 13% 991 1% /var
/dev/hd3 917504 826700 10% 120 1% /tmp
/dev/hd1 655360 524856 20% 291 1% /home
/proc - - - - - /proc
/dev/hd10opt 1179648 589072 51% 11370 8% /opt
/dev/lv00 131072 126912 4% 18 1% /var/adm/csd
/dev/fslv01 131072 130724 1% 4 1% /tftpboot
/dev/fslv00 6684672 3719036 45% 6 1% /export/eznim
/dev/ora10msrunlvbak 8912896 5165508 43% 24814 3% /ora10msrunbak
/dev/msrundatalvbak 13107200 2895980 78% 32 1% /msrundatabak
/dev/msrunloglvbak 5242880 3094480 41% 8185 2% /msrunlogbak
/dev/mksysblv 13107200 6238868 53% 5 1% /mksysb
/dev/tlg1d0_baklv 26214400 17495848 34% 1111 1% /tlg1d0_bak
/dev/msrundatalv 13107200 2896000 78% 31 1% /msrundata
/dev/msrunloglv 5242880 3112216 41% 7996 2% /msrunlog
/dev/ora10msrunlv 8912896 5173252 42% 23406 2% /ora10msrun
[tlg1a0][root][/]>df -k
Filesystem 1024-blocks Free %Used Iused %Iused Mounted on
/dev/hd4 655360 93968 86% 3450 14% /
/dev/hd2 8650752 3779000 57% 90347 10% /usr
/dev/hd9var 1179648 1001248 16% 1164 1% /var
/dev/hd3 1179648 1136192 4% 143 1% /tmp
/dev/hd1 655360 587592 11% 489 1% /home
/proc - - - - - /proc
/dev/hd10opt 1835008 1244528 33% 11374 4% /opt
/dev/lv00 131072 126912 4% 18 1% /var/adm/csd
/dev/fslv01 131072 118800 10% 7 1% /tftpboot
/dev/fslv00 7733248 960696 88% 27298 12% /export/eznim
/dev/fslv02 15335424 1143804 93% 4525 2% /soft_ins
/dev/tlg1a0_baklv 26214400 20751500 21% 16 1% /tlg1a0_bak
/dev/ora10mscltlvbak 13107200 9386904 29% 16852 1% /ora10mscltbak
/dev/tux81msrunlvbak 655360 407932 38% 2685 3% /tux81msrunbak
/dev/meslvbak 8912896 4815640 46% 6349 1% /BSMesWare/BSX1bak
/dev/xcomlvbak 5242880 5186260 2% 4178 1% /xcombak
/dev/ora10mscltlv 13107200 9386748 29% 16852 1% /ora10msclt
/dev/tux81msrunlv 786432 538952 32% 2685 3% /tux81msrun
/dev/meslv 14155776 10057876 29% 6343 1% /BSMesWare/BSX1
/dev/xcomlv 5242880 5186280 2% 4178 1% /xcom
3) 安装包要求:
RSCT 2.3.6.0 或更高版本。lslpp –l|grep rsct
4) 以下的包也是必须要安装的:(脚本可直接拷贝运行)
lslpp -l rsct.*
lslpp -l bos.adt.lib
lslpp -l bos.adt.libm
lslpp -l bos.adt.syscalls
lslpp -l bos.net.tcp.client
lslpp -l bos.net.tcp.server
lslpp -l bos.rte.SRC
lslpp -l bos.rte.libc
lslpp -l bos.rte.libcfg
lslpp -l bos.rte.libcur
lslpp -l bos.rte.libpthreads
lslpp -l bos.rte.odm
5) tlg1a0结果:
[tlg1a0][root][/]>lslpp -l rsct.*
lslpp -l bos.adt.lib
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
rsct.basic.hacmp 2.4.4.0 COMMITTED RSCT Basic Function (HACMP/ES
Support)
rsct.basic.rte 2.4.4.0 COMMITTED RSCT Basic Function
rsct.basic.sp 2.4.4.0 COMMITTED RSCT Basic Function (PSSP
Support)
rsct.clients.rte 99.99.999.999 COMMITTED Supersede Entry - Not really
installed
rsct.compat.basic.hacmp 2.4.4.0 COMMITTED RSCT Event Management Basic
Function (HACMP/ES Support)
rsct.compat.basic.rte 2.4.4.0 COMMITTED RSCT Event Management Basic
Function
rsct.compat.basic.sp 2.4.4.0 COMMITTED RSCT Event Management Basic
Function (PSSP Support)
rsct.compat.clients.hacmp 2.4.4.0 COMMITTED RSCT Event Management Client
Function (HACMP/ES Support)
rsct.compat.clients.rte 2.4.4.0 COMMITTED RSCT Event Management Client
Function
rsct.compat.clients.sp 2.4.4.0 COMMITTED RSCT Event Management Client
Function (PSSP Support)
rsct.core.auditrm 2.4.4.0 COMMITTED RSCT Audit Log Resource
Manager
rsct.core.errm 2.4.4.0 COMMITTED RSCT Event Response Resource
Manager
rsct.core.fsrm 2.4.4.0 COMMITTED RSCT File System Resource
Manager
rsct.core.gui 2.4.4.0 COMMITTED RSCT Graphical User Interface
rsct.core.hostrm 2.4.4.0 COMMITTED RSCT Host Resource Manager
rsct.core.lprm 2.4.4.0 COMMITTED RSCT Least Privilege Resource
Manager
rsct.core.rmc 2.4.4.0 COMMITTED RSCT Resource Monitoring and
Control
rsct.core.sec 2.4.4.0 COMMITTED RSCT Security
rsct.core.sensorrm 2.4.4.0 COMMITTED RSCT Sensor Resource Manager
rsct.core.sr 2.4.4.0 COMMITTED RSCT Registry
rsct.core.utils 2.4.4.0 COMMITTED RSCT Utilities
rsct.lapi.msg.EN_US.rte 2.4.0.0 COMMITTED RSCT LAPI Msgs - U.S. English
(UTF)
rsct.lapi.msg.en_US.rte 2.4.0.0 COMMITTED RSCT LAPI Msgs - U.S. English
rsct.lapi.nam 2.4.2.0 COMMITTED RSCT LAPI NAM Files
rsct.lapi.rte 2.4.2.0 COMMITTED RSCT LAPI Runtime
rsct.lapi.samp 2.4.2.0 COMMITTED RSCT LAPI Samples
rsct.msg.EN_US.core.auditrm
2.4.0.0 COMMITTED RSCT Audit Log RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.errm 2.4.0.0 COMMITTED RSCT Event Response RM Msgs -
U.S. English (UTF)
rsct.msg.EN_US.core.fsrm 2.4.0.0 COMMITTED RSCT File System RM Msgs -
U.S. English (UTF)
rsct.msg.EN_US.core.gui 2.4.0.0 COMMITTED RSCT GUI Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.hostrm
2.4.0.0 COMMITTED RSCT Host RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.lprm 2.4.0.0 COMMITTED RSCT LPRM Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.rmc 2.4.0.0 COMMITTED RSCT RMC Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.sec 2.4.0.0 COMMITTED RSCT Security Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.sensorrm
2.4.0.0 COMMITTED RSCT Sensor RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.sr 2.4.0.0 COMMITTED RSCT Registry Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.utils 2.4.0.0 COMMITTED RSCT Utilities Msgs - U.S.
English (UTF)
rsct.msg.en_US.basic.rte 2.4.0.0 COMMITTED RSCT Basic Msgs - U.S. English
rsct.msg.en_US.core.auditrm
2.4.0.0 COMMITTED RSCT Audit Log RM Msgs - U.S.
English
rsct.msg.en_US.core.errm 2.4.0.0 COMMITTED RSCT Event Response RM Msgs -
U.S. English
rsct.msg.en_US.core.fsrm 2.4.0.0 COMMITTED RSCT File System RM Msgs -
U.S. English
rsct.msg.en_US.core.gui 2.4.0.0 COMMITTED RSCT GUI Msgs - U.S. English
rsct.msg.en_US.core.gui.com
2.4.0.0 COMMITTED RSCT GUI JAVA Msgs - U.S.
English
rsct.msg.en_US.core.hostrm
2.4.0.0 COMMITTED RSCT Host RM Msgs - U.S.
English
rsct.msg.en_US.core.lprm 2.4.0.0 COMMITTED RSCT LPRM Msgs - U.S. English
rsct.msg.en_US.core.rmc 2.4.0.0 COMMITTED RSCT RMC Msgs - U.S. English
rsct.msg.en_US.core.rmc.com
2.4.0.0 COMMITTED RSCT RMC JAVA Msgs - U.S.
English
rsct.msg.en_US.core.sec 2.4.0.0 COMMITTED RSCT Security Msgs - U.S.
English
rsct.msg.en_US.core.sensorrm
2.4.0.0 COMMITTED RSCT Sensor RM Msgs - U.S.
English
rsct.msg.en_US.core.sr 2.4.0.0 COMMITTED RSCT Registry Msgs - U.S.
English
rsct.msg.en_US.core.utils 2.4.0.0 COMMITTED RSCT Utilities Msgs - U.S.
English
rsct.vsd.cmds 4.1.0.0 COMMITTED VSD Commands
rsct.vsd.rvsd 4.1.0.0 COMMITTED Recoverable VSD
rsct.vsd.vsdd 4.1.0.0 COMMITTED VSD Device Driver
rsct.vsd.vsdrm 4.1.0.0 COMMITTED VSD Resource Manager
Path: /etc/objrepos
rsct.basic.rte 2.4.4.0 COMMITTED RSCT Basic Function
rsct.compat.basic.rte 2.4.1.0 COMMITTED RSCT Event Management Basic
Function
rsct.core.auditrm 2.4.4.0 COMMITTED RSCT Audit Log Resource
Manager
rsct.core.errm 2.4.4.0 COMMITTED RSCT Event Response Resource
Manager
rsct.core.fsrm 2.4.4.0 COMMITTED RSCT File System Resource
Manager
rsct.core.hostrm 2.4.4.0 COMMITTED RSCT Host Resource Manager
rsct.core.lprm 2.4.4.0 COMMITTED RSCT Least Privilege Resource
Manager
rsct.core.rmc 2.4.4.0 COMMITTED RSCT Resource Monitoring and
Control
rsct.core.sec 2.4.4.0 COMMITTED RSCT Security
rsct.core.sensorrm 2.4.4.0 COMMITTED RSCT Sensor Resource Manager
rsct.core.sr 2.4.4.0 COMMITTED RSCT Registry
rsct.core.utils 2.4.4.0 COMMITTED RSCT Utilities
rsct.lapi.nam 2.4.2.0 COMMITTED RSCT LAPI NAM Files
rsct.lapi.rte 2.4.2.0 COMMITTED RSCT LAPI Runtime
rsct.vsd.cmds 4.1.0.0 COMMITTED VSD Commands
rsct.vsd.rvsd 4.1.0.0 COMMITTED Recoverable VSD
rsct.vsd.vsdd 4.1.0.0 COMMITTED VSD Device Driver
rsct.vsd.vsdrm 4.1.0.0 COMMITTED VSD Resource Manager
[tlg1a0][root][/]>lslpp -l bos.adt.lib
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.lib 5.3.0.40 COMMITTED Base Application Development
Libraries
[tlg1a0][root][/]>lslpp -l bos.adt.libm
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.libm 5.3.0.40 COMMITTED Base Application Development
Math Library
[tlg1a0][root][/]>lslpp -l bos.adt.syscalls
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.syscalls 5.3.0.40 COMMITTED System Calls Application
Development Toolkit
[tlg1a0][root][/]>lslpp -l bos.net.tcp.client
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.net.tcp.client 5.3.0.41 COMMITTED TCP/IP Client Support
Path: /etc/objrepos
bos.net.tcp.client 5.3.0.10 COMMITTED TCP/IP Client Support
[tlg1a0][root][/]>lslpp -l bos.net.tcp.server
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.net.tcp.server 5.3.0.40 COMMITTED TCP/IP Server
Path: /etc/objrepos
bos.net.tcp.server 5.3.0.10 COMMITTED TCP/IP Server
[tlg1a0][root][/]>lslpp -l bos.rte.SRC
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.SRC 5.3.0.40 COMMITTED System Resource Controller
Path: /etc/objrepos
bos.rte.SRC 5.3.0.10 COMMITTED System Resource Controller
[tlg1a0][root][/]>lslpp -l bos.rte.libc
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libc 5.3.0.41 COMMITTED libc Library
[tlg1a0][root][/]>lslpp -l bos.rte.libcfg
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libcfg 5.3.0.40 COMMITTED libcfg Library
[tlg1a0][root][/]>lslpp -l bos.rte.libcur
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libcur 5.3.0.40 COMMITTED libcurses Library
[tlg1a0][root][/]>lslpp -l bos.rte.libpthreads
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libpthreads 5.3.0.40 COMMITTED libpthreads Library
[tlg1a0][root][/]>lslpp -l bos.rte.odm
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.odm 5.3.0.40 COMMITTED Object Data Manager
6) 如果您要安装并行的资源组(如oracle RAC),还要安装下面的包:
lslpp –l bos.rte.lvm.rte (5.2.0.25 or higher)
lslpp -l bos.rte.lvm.rte
lslpp –l bos.clvm.enh
2 开始安装 (注意,每机器均要完成)
1) 安装HACMP5.2
如果是光盘,请插入光盘 ,输入smitty install_latest
Install Software
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* INPUT device / directory for software /dev/cd0
* SOFTWARE to install [_all_latest]
PREVIEW only? (install operation will NOT occur) no
COMMIT software updates? yes
SAVE replaced files? no
AUTOMATICALLY install requisite software? yes
EXTEND file systems if space needed? yes
OVERWRITE same or newer versions? no
VERIFY install and check file sizes? no
Include corresponding LANGUAGE filesets? yes
DETAILED output? no
Process multiple volumes? yes
ACCEPT new license agreements? yes
Preview new LICENSE agreements? no
如果是安装盘拷贝,请进入cd installp/ppc目录,smitty install_latest
Install Software
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* INPUT device / directory for software .
* SOFTWARE to install [_all_latest]
…..
ACCEPT new license agreements? yes
Preview new LICENSE agreements? no
安装结束后,会报failed,请检查
clluster.hativoli
clusterhaview
netwiew
包以外,所有的hacmp的包都要安装
2) 打补丁至5205
注意,请不要忽略给hacmp打补丁这一步骤。其实对hacmp来说,补丁是十分重要的。很多发现的缺陷都已经在补丁中被解决了。当严格的按照正确步骤安装和配置完hacmp的软件后,发现takeover 有问题,ip接管有问题,机器自动宕机等等千奇百怪的问题,其实都与补丁有关。所以一定要注意打补丁这个环节。此次hacmp为5205或补丁IY73937。
smitty install_latest,全部安装
IY53044 - Latest HACMP for AIX R510 Fixes as of January 2004
[tlg1a0][root][/soft_ins/ha52/patch]>ls
.toc cluster.es.cspoc.dsh.5.2.0.21.bff
cluster.adt.es.client.include.5.2.0.3.bff cluster.es.cspoc.rte.5.2.0.3.bff
cluster.adt.es.client.include.5.2.0.31.bff cluster.es.cspoc.rte.5.2.0.31.bff
cluster.adt.es.client.samples.clinfo.5.2.0.3.bff cluster.es.ercmf.cmds.5.2.0.3.bff
cluster.adt.es.client.samples.clinfo.5.2.0.31.bff cluster.es.ercmf.cmds.5.2.0.31.bff
cluster.adt.es.client.samples.clstat.5.2.0.3.bff cluster.es.ercmf.rte.5.2.0.1.bff
cluster.adt.es.client.samples.clstat.5.2.0.31.bff cluster.es.ercmf.rte.5.2.0.11.bff
cluster.adt.es.client.samples.libcl.5.2.0.1.bff cluster.es.pprc.cmds.5.2.0.3.bff
cluster.adt.es.client.samples.libcl.5.2.0.11.bff cluster.es.pprc.cmds.5.2.0.31.bff
cluster.doc.en_US.es.html.5.2.0.1.bff cluster.es.pprc.rte.5.2.0.3.bff
cluster.doc.en_US.es.pdf.5.2.0.1.bff cluster.es.pprc.rte.5.2.0.31.bff
cluster.doc.en_US.es.pdf.5.2.0.11.bff cluster.es.server.cfgast.5.2.0.1.bff
cluster.doc.en_US.pprc.html.5.2.0.1.bff cluster.es.server.cfgast.5.2.0.11.bff
cluster.doc.en_US.pprc.html.5.2.0.11.bff cluster.es.server.diag.5.2.0.4.bff
cluster.doc.en_US.pprc.pdf.5.2.0.1.bff cluster.es.server.diag.5.2.0.41.bff
cluster.doc.en_US.pprc.pdf.5.2.0.11.bff cluster.es.server.events.5.2.0.3.bff
cluster.doc.en_US.websphere.html.5.2.0.1.bff cluster.es.server.events.5.2.0.31.bff
cluster.doc.en_US.websphere.html.5.2.0.11.bff cluster.es.server.rte.5.2.0.5.bff
cluster.doc.en_US.websphere.pdf.5.2.0.1.bff cluster.es.server.rte.5.2.0.51.bff
cluster.doc.en_US.websphere.pdf.5.2.0.11.bff cluster.es.server.testtool.5.2.0.1.bff
cluster.es.cfs.rte.5.2.0.3.bff cluster.es.server.utils.5.2.0.4.bff
cluster.es.cfs.rte.5.2.0.31.bff cluster.es.server.utils.5.2.0.41.bff
cluster.es.client.lib.5.2.0.4.bff cluster.es.svcpprc.cmds.5.2.0.1.bff
cluster.es.client.lib.5.2.0.41.bff cluster.es.svcpprc.cmds.5.2.0.11.bff
cluster.es.client.rte.5.2.0.4.bff cluster.es.svcpprc.rte.5.2.0.1.bff
cluster.es.client.rte.5.2.0.41.bff cluster.es.websphere.5.2.0.1.bff
cluster.es.client.utils.5.2.0.3.bff cluster.es.worksheets.5.2.0.3.bff
cluster.es.client.utils.5.2.0.31.bff cluster.man.en_US.es.data.5.2.0.1.bff
cluster.es.client.wsm.5.2.0.21.bff cluster.man.en_US.es.data.5.2.0.11.bff
cluster.es.cspoc.cmds.5.2.0.3.bff cluster.xd.glvm.5.2.0.1.bff
cluster.es.cspoc.cmds.5.2.0.31.bff cluster.xd.glvm.5.2.0.11.bff
安装结束后,仍会报failed,检查
cluster.doc.en_US.pprc
cluster.doc.en_US.websphere
cluster.es.ercmf
cluster.es.pprc
cluster.es.svcpprc
cluster.es.websphere
cluster.xd.glvm
没装上外,其他都已安装上。
当然大家可以从IBM网站上下载,或者打800-810-1818 热线电话索要。
3) 重启机器
记住一定要重起机器,否则安装将无法正常继续。
4) 安装确认
Ø 确认inittab:
egrep "HA|clcomd" /etc/inittab
harc:2:wait:/usr/es/sbin/cluster/etc/harc.net # HACMP for AIX network startup
clcomdES:2:once:startsrc -s clcomdES >/dev/console 2>&1
clinit:a:wait:/bin/touch /usr/es/sbin/cluster/.telinit # HACMP for AIX These must be the last entries of run level a in inittab!
pst_clinit:a:wait:/bin/echo Created /usr/es/sbin/cluster/.telinit > /dev/console # HACMP for AIX These must be the last entries of run level a in inittab!
在hacmp 5.1 中 为了安全起见,不再使用/.rhosts 文件来控制两台机器 之间的命令和数据交换,而是引进的一个新的进程clcomd 。 如果你编辑/etc/inittab文件 就会发现安装完hacmp后,添加了一行:clcomdES:2:once:startsrc -s clcomdES >/dev/console 2>&1 。
Ø 确认安装和补丁包:(关键为cluster.es.server.rte)
lslpp -l cluster.*
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
cluster.adt.es.client.include
5.2.0.3 COMMITTED ES Client Include Files
cluster.adt.es.client.samples.clinfo
5.2.0.3 COMMITTED ES Client CLINFO Samples
cluster.adt.es.client.samples.clstat
5.2.0.3 COMMITTED ES Client Clstat Samples
cluster.adt.es.client.samples.libcl
5.2.0.1 COMMITTED ES Client LIBCL Samples
cluster.adt.es.java.demo.monitor
5.2.0.0 COMMITTED ES Web Based Monitor Demo
cluster.doc.en_US.es.html 5.2.0.1 COMMITTED HAES Web-based HTML
Documentation - U.S. English
cluster.doc.en_US.es.pdf 5.2.0.1 COMMITTED HAES PDF Documentation - U.S.
English
cluster.es.cfs.rte 5.2.0.3 COMMITTED ES Cluster File System Support
cluster.es.client.lib 5.2.0.4 COMMITTED ES Client Libraries
cluster.es.client.rte 5.2.0.4 COMMITTED ES Client Runtime
cluster.es.client.utils 5.2.0.3 COMMITTED ES Client Utilities
cluster.es.client.wsm 5.2.0.2 COMMITTED Web based Smit
cluster.es.clvm.rte 5.2.0.0 COMMITTED ES for AIX Concurrent Access
cluster.es.cspoc.cmds 5.2.0.3 COMMITTED ES CSPOC Commands
cluster.es.cspoc.dsh 5.2.0.2 COMMITTED ES CSPOC dsh
cluster.es.cspoc.rte 5.2.0.3 COMMITTED ES CSPOC Runtime Commands
cluster.es.plugins.dhcp 5.2.0.0 COMMITTED ES Plugins - dhcp
cluster.es.plugins.dns 5.2.0.0 COMMITTED ES Plugins - Name Server
cluster.es.plugins.printserver
5.2.0.0 COMMITTED ES Plugins - Print Server
cluster.es.server.cfgast 5.2.0.1 COMMITTED ES Two-Node Configuration
Assistant
cluster.es.server.diag 5.2.0.4 COMMITTED ES Server Diags
cluster.es.server.events 5.2.0.3 COMMITTED ES Server Events
cluster.es.server.rte 5.2.0.5 COMMITTED ES Base Server Runtime
cluster.es.server.testtool
5.2.0.1 COMMITTED ES Cluster Test Tool
cluster.es.server.utils 5.2.0.4 COMMITTED ES Server Utilities
cluster.es.worksheets 5.2.0.3 COMMITTED Online Planning Worksheets
cluster.license 5.2.0.0 COMMITTED HACMP Electronic License
cluster.msg.en_US.cspoc 5.2.0.0 COMMITTED HACMP CSPOC Messages - U.S.
English
cluster.msg.en_US.es.client
5.2.0.0 COMMITTED ES Client Messages - U.S.
English
cluster.msg.en_US.es.server
5.2.0.0 COMMITTED ES Recovery Driver Messages -
U.S. English
Path: /etc/objrepos
cluster.es.client.rte 5.2.0.0 COMMITTED ES Client Runtime
cluster.es.clvm.rte 5.2.0.0 COMMITTED ES for AIX Concurrent Access
cluster.es.cspoc.rte 5.2.0.0 COMMITTED ES CSPOC Runtime Commands
cluster.es.server.diag 5.2.0.0 COMMITTED ES Server Diags
cluster.es.server.events 5.2.0.0 COMMITTED ES Server Events
cluster.es.server.rte 5.2.0.5 COMMITTED ES Base Server Runtime
cluster.es.server.utils 5.2.0.0 COMMITTED ES Server Utilities
Path: /usr/share/lib/objrepos
cluster.man.en_US.es.data 5.2.0.1 COMMITTED ES Man Pages - U.S. English
Ø 确认进程
ps -ef |grep clcomd
root 115694 65676 0 Jun 07 - 2:09 /usr/es/sbin/cluster/clcomd -d
第三章 准备篇
1. 修改.rhosts
vi /.rhosts
tlg1a0 root
tlg1a0_stb
tlg1a0_isvc
tlg1a0_boot
tlg1a0_svc
tlg1a0_stb
tlg1d0_isvc
tlg1d0_boot
tlg1d0_svc
tlg1d0_stb
tlg1a0_boot1
tlg1a0_l1_svc
tlg1a0_l1_stb
tlg1d0_boot1
tlg1d0_l1_svc
tlg1d0_l1_stb
tlg1a0_boot2
tlg1a0_l2_svc
tlg1a0_l2_stb
tlg1d0_boot2
tlg1d0_l2_svc
tlg1d0_l2_stb
tlg1a0_1580_svc
tlg1a0_1580_stb
tlg1d0_1580_svc
tlg1d0_1580_stb
在hacmp 5.2中 为了安全起见,不再使用/.rhosts 文件来控制两台机器之间的命令和数据交换,使用 /usr/es/sbin/cluster/etc/rhosts 文件来代替 /.rhosts 文件的功能。
注意:如果两个节点间的通讯发生了什么问题,可以检查rhots 文件,或者编辑rhosts文件
加入两个节点的网络信息。为方便查问题,配置期间让/.rhosts和HA的rhosts一致。
cp /.rhosts /usr/es/sbin/cluster/etc/rhosts
rcp /.rhosts tlg1d0:/usr/es/sbin/cluster/etc/rhosts
rcp /.rhosts tlg1d0:/
2. 添加共享vg , jfs2:
[tlg1d0][root][/]>lspv
hdisk0 00c1fe1f0215b425 rootvg active
hdisk1 00c1fe1f8d700839 rootvg active
hdisk3 none none
hdisk4 none none
hdisk6 00c1eedf177bd321 g1d0msd2vg active
hdisk2 00c1fe1f966aff5a g1d0msd1vg active
hdisk7 00c1eedf177ba8e3 g1d0msd2vg active
hdisk8 00c1eedf220b9697 None
hdisk9 00c1eedf220bd5f6 None
hdisk10 00c1fe1f220e7a99 g1d0msd3vg active
hdisk11 00c1fe1f220e7d28 g1d0msd3vg active
[tlg1a0][root][/]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg active
hdisk0 00c1eedf721dbee3 rootvg active
hdisk1 00c1eedf01de40b3 rootvg active
hdisk2 00c1eedf6ddb9f5e g1a0msa1vg active
hdisk4 00c1eedffa577dca g1a0msa2vg active
hdisk6 none none
hdisk7 none none
hdisk8 00c1eedf220bd5f6 g1a0msa3vg active
hdisk9 00c1eedffaa954cc g1a0msa3vg active
hdisk10 none None
hdisk11 none None
3. 修改loglv
tlg1a0 (tlg1d0vg也要修改)
² 察看
[tlg1a0][root][/]>varyonvg g1a0msa2vg
[tlg1a0][root][/]>lsvg -l g1a0msa2vg
g1a0msa2vg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
ora10mscltlv jfs2 100 100 2 open/syncd /ora10msclt
tux81msrunlv jfs2 6 6 2 open/syncd /tux81msrun
meslv jfs2 108 108 2 open/syncd /BSMesWare/BSX1
xcomlv jfs2 40 40 2 open/syncd /xcom
loglv06 jfs2log 1 1 1 open/syncd N/A
umount vg上所有fs
umount /ora10msclt
umount / tux81msrun
umount / BSMesWare/BSX1
umount / xcom
² 修改loglv名称
[tlg1a0][root][/]> chlv -n g1a0msa2_loglv loglv06
0516-712 chlv: The chlv succeeded, however chfs must now be
run on every filesystem which references the old log name loglv06.
[tlg1a0][root][/]>lsvg -l g1a0msa2vg
g1a0msa2vg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
ora10mscltlv jfs2 100 100 2 closed/syncd /ora10msclt
tux81msrunlv jfs2 6 6 2 closed/syncd /tux81msrun
meslv jfs2 108 108 2 closed/syncd /BSMesWare/BSX1
xcomlv jfs2 40 40 2 closed/syncd /xcom
g1a0msa2_loglv jfs2log 1 1 1 closed/syncd N/A
² 修改fs对应关系
vi /etcfilesystems:
/dev/loglv06 –〉/dev/g1a0msa2_loglv
:g/loglv06/s//g1a0msa2_loglv/g
:wq
² 确认
mount /ora10msclt
mount / tux81msrun
mount / BSMesWare/BSX1
mount / xcom
4. 整理vg
在每台机器上先varyoffvg,运行以下脚本(实际可以copy以下脚本到文本编辑器替换成你实际的vg)
varyoffvg g1a0msa2vg
varyoffvg g1d0msd2vg
exportvg g1d0msd2vg
exportvg g1a0msa2vg
chdev -l hdisk6 -a pv=yes
chdev -l hdisk7 -a pv=yes
importvg -V 201 -n -y g1d0msd2vg hdisk6
varyonvg g1d0msd2vg
chvg -an g1d0msd2vg
chdev -l hdisk3 -a pv=yes
chdev -l hdisk4 -a pv=yes
importvg -V 101 -n -y g1a0msa2vg hdisk3
varyonvg g1a0msa2vg
chvg -an g1a0msa2vg
varyoffvg g1a0msa2vg
varyoffvg g1d0msd2vg
确认:
[tlg1d0][root][/home/scripts/takeover]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg
hdisk0 00c1fe1f0215b425 rootvg active
hdisk1 00c1fe1f8d700839 rootvg active
hdisk4 00c1eedffa577dca g1a0msa2vg
hdisk6 00c1eedf177bd321 g1d0msd2vg
hdisk2 00c1fe1f966aff5a g1d0msd1vg active
hdisk7 00c1eedf177ba8e3 g1d0msd2vg
hdisk5 00c1eedfbf806475 ha_heart
hdisk8 00c1eedf220b9697 None
hdisk9 00c1eedf220bd5f6 None
hdisk10 00c1fe1f220e7a99 g1d0msd3vg active
hdisk11 00c1fe1f220e7d28 g1d0msd3vg active
[tlg1a0][root][/home/scripts/takeover]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg
hdisk0 00c1eedf721dbee3 rootvg active
hdisk1 00c1eedf01de40b3 rootvg active
hdisk2 00c1eedf6ddb9f5e g1a0msa1vg active
hdisk4 00c1eedffa577dca g1a0msa2vg
hdisk6 00c1eedf177bd321 g1d0msd2vg
hdisk7 00c1eedf177ba8e3 g1d0msd2vg
hdisk8 00c1eedf220bd5f6 g1a0msa3vg active
hdisk9 00c1eedffaa954cc g1a0msa3vg active
hdisk5 00c1eedfbf806475 ha_heart
hdisk10 none None
hdisk11 none None
5. 修改ip地址
按照规划,修改ip地址 ,smitty tcpip,最终为
[tlg1a0][root][/]>netstat -in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en0 1500 link#2 0.9.6b.dd.e1.aa 2177624 0 1368126 0 0
en0 1500 192.1.1 192.1.1.1 2177624 0 1368126 0 0
en1 1500 link#3 0.9.6b.dd.e1.ab 1928315 0 1222071 0 0
en1 1500 182.1.1 182.1.1.1 1928315 0 1222071 0 0
en2 1500 link#4 0.11.25.8.cf.23 1532962 0 868518 0 0
en2 1500 192.168.1 192.168.1.1 1532962 0 868518 0 0
en3 1500 link#5 0.11.25.8.cd.fa 1633025 0 969811 0 0
en3 1500 192.168.2 192.168.2.1 1633025 0 969811 0 0
lo0 16896 link#1 1290353 0 1291202 0 0
lo0 16896 127 127.0.0.1 1290353 0 1291202 0 0
lo0 16896 ::1 1290353 0 1291202 0 [tlg1d0][root][/]>netstat -in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en0 1500 link#2 0.9.6b.dd.e5.64 1169895 0 1119848 0 0
en0 1500 192.1.1 192.1.1.2 1169895 0 1119848 0 0
en1 1500 link#3 0.9.6b.dd.e5.65 2070363 0 1607863 0 0
en1 1500 182.1.1 182.1.1.2 2070363 0 1607863 0 0
en2 1500 link#4 0.11.25.8.db.20 1647564 0 1178253 0 0
en2 1500 192.168.1 192.168.1.2 1647564 0 1178253 0 0
en3 1500 link#5 0.11.25.8.da.80 1631680 0 998077 0 0
en3 1500 192.168.2 192.168.2.2 1631680 0 998077 0 0
lo0 16896 link#1 1029297 0 1305172 0 0
lo0 16896 127 127.0.0.1 1029297 0 1305172 0 0
lo0 16896 ::1 1029297 0 1305172 0 0
6. 修改/etc/hosts
127.0.0.1 loopback localhost # loopback (lo0) name/address
182.1.1.21 hmc01
182.1.1.22 hmc02
##tlg1a0
182.1.1.1 tlg1a0_isvc tlg1a0_stb
168.120.1.201 tlg1a0_l1_svc
168.121.1.201 tlg1a0_l2_svc
140.100.100.101 tlg1a0_1580_svc
192.1.1.1 tlg1a0_boot
192.168.1.1 tlg1a0_boot1 tlg1a0
192.168.2.1 tlg1a0_boot2
190.2.206.11 tlg1a0_svc
##tlg1d0
182.1.1.2 tlg1d0_isvc tlg1d0_stb
192.1.1.2 tlg1d0_boot t
192.168.1.2 tlg1d0_boot1 lg1d0
192.168.2.2 tlg1d0_boot2
190.2.206.12 tlg1d0_svc
168.120.1.202 tlg1d0_l1_svc
168.121.1.202 tlg1d0_l2_svc
140.100.100.102 tlg1d0_1580_svc
7. 编写启停脚本
mkdir /usr/sbin/cluster/app
[tlg1a0][root][/usr/sbin/cluster/app]>ls
start_g1a0 start_g1d0 stop_g1a0 stop_g1d0
#start_g1a0
banner start tlg1a0
route delete 0
route add 0 190.2.206.240
route add -net 168.100.100 -netmask 255.255.255.0 168.121.1.240
su - bsx1 -c /home/scripts/tlg1a0/start_bsx1.sh
banner end tlg1a0
# stop_g1a0
banner stop tlg1a0
su - bsx1 -c /home/scripts/tlg1a0/stop_bsx1.sh &
sleep 15
su - bsx1 -c /home/scripts/tlg1a0/tux_shut_force.sh
banner end tlg1a0
# start_g1d0
banner start tlg1d0!
route delete 0
route add 0 190.2.206.240
route add -net 168.100.100.0 168.121.1.240
su - orarun -c /home/scripts/tlg1d0/startdb.sh
banner end start tlg1d0
#stop_g1d0
banner stop tlg1d0
su - orarun -c /home/scripts/tlg1d0/stopdb.sh
banner end tlg1d0
编写完成后cp到另一节点:
[tlg1a0][root][/usr/sbin/cluster/app]>rcp * tlg1d0:/usr/sbin/cluster/app
注意:在两个节点要保证hosts 和 启动/停止脚本要一样存在。
同时网关在start就脚本里要增加。
8. 配置 tty 心跳 网络/磁盘心跳
Ø 串口线心跳(两边都要增加)
. smitty tty->Change / add a TTY->rs232->sa->port number : 0
如果是有串口卡,则注意不要用port 0,tty0。
确认
a0: cat /etc/hosts>/dev/tty1
d0:cat
可看到tlg1a0上/etc/hosts的内容。
Ø 磁盘心跳
1. 建立1个共享盘 1G足够
2. 两边在此盘上都建立enhance current vg—〉ha_heart(取名),不用varyon
9. syncd daemon的数据刷新频率
该值表示刷新内存数据到硬盘的频率,缺省为60,HACMP安装后一般可改为10,
smitty hacmp -> HACMP Extended Configuration > Advanced Performance Tuning
Parameters > Change/Show syncd frequency ->10
or /usr/es/sbin/cluster/utilities/clchsyncd 10
10. 互拷用户的.profile等环境文件
[tlg1d0][root][/home/orarun]>rcp * tlg1a0:/home/orarun
[tlg1d0][root][/home/orarun]>rcp .profile tlg1a0:/home/orarun
[tlg1a0][root][/home/tuxrun]> rcp * tlg1a0:/home/tuxrun
[tlg1a0][root][/home/tuxrun]> rcp .profile tlg1a0:/home/tuxrun
11. 修改文件系统的目录权限,保证两边一致
[tlg1d0][root][/]>chown bsx1:bea /BSMesWare/BSX1
第四章 配置篇
[tlg1a0][root][/]>smitty hacmp
1.添加cluster 和node
Initialization and Standard Configuration –〉
Add Nodes to an HACMP Cluster ->
* Cluster Name [lg_cluster]
New Nodes (via selected communication paths)[m [tlg1a0 tlg1d0]
Currently Configured Node(s)
这一步很重要,一般我们都是把每个节点的boot 作为communication path . New node
可以一起加,也可以一个一个的加。当回车以后,系统会自己discover hacmp 的资源
显示如下:
。。。。。。。。。。。。。。。。。。。。
Communication path tlg1a0 discovered a new node. Hostname is tlg1a0. Adding it to the configuration with Nodename tlg1a0.
Communication path tlg1d0 discovered a new node. Hostname is tlg1d0. Adding it to the configuration with Nodename tlg1d0.
Discovering IP Network Connectivity
Retrieving data from available cluster nodes. This could take a few minutes....
Discovered [9] interfaces
IP Network Discovery completed normally
Current cluster configuration:
Cluster Name: lg_cluster
Cluster Connection Authentication Mode: Standard
Cluster Message Authentication Mode: None
Cluster Message Encryption: None
Use Persistent Labels for Communication: No
There are 2 node(s) and 3 network(s) defined
NODE tlg1a0:
Network net_ether_01
tlg1a0_boot 192.1.1.1
Network net_ether_02
tlg1a0_isvc 182.1.1.1
Network net_ether_03
tlg1a0_boot2 192.168.2.1
tlg1a0_boot1 192.168.1.1
NODE tlg1d0:
Network net_ether_01
tlg1d0_boot 192.1.1.2
Network net_ether_02
tlg1d0_isvc 182.1.1.2
Network net_ether_03
tlg1d0_boot1 192.168.1.2
……………………..
2.添加network,修改网卡归属
1) 修改网卡归属
通过上面步骤地自动discovery,实际一般network和devices均已添加,我们只需略做修正就可以了。
Extended Configuration
Extended Topology Configuration
Configure HACMP Communication Interfaces/Devices
Change/Show Communication Interfaces/Devices
选 en0 tlg1a0_boot
Change/Show a Communication Interface
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
Node Name [tlg1a0]
Network Interface en0
IP Label/Address tlg1a0_boot
Network Type ether
* Network Name [net_ether_01] –〉net_ether_02
同样修改tlg1d0_boot
Change/Show a Communication Interface
[Entry Fields]
Node Name [tlg1d0]
Network Interface en0
IP Label/Address tlg1d0_boot
Network Type ether
* Network Name [net_ether_02] –〉net_ether_02
这样可以看到net_ether_01没有了。
2) 心跳网络必须自己添加:
Extended Configuration
Extended Topology Configuration
Configure HACMP Communication Interfaces/Devices
Add Communication Interfaces/Devices
Add Discovered Communication Interface and Devices
Communication Devices
Ø 串口心跳
# Node Device Device Path
tlg1a0 tty1 /dev/tty1
tlg1d0 tty1 /dev/tty1
Ø 磁盘心跳
# Node Device Device Path Pvid
tlg1a0 hdisk5 /dev/hdisk5 00c1eedfbf8 x
tlg1d0 hdisk5 /dev/hdisk5 00c1eedfbf8 x
按F7逐一选中回车,将增加心跳网络,至此toplog已配好
3. 察看确认拓扑(toplog)结构
Extended Configuration
Extended Topology Configuration
Show HACMP Topology
Show Cluster Topology
Cluster Name: lg_cluster
Cluster Connection Authentication Mode: Standard
Cluster Message Authentication Mode: None
Cluster Message Encryption: None
Use Persistent Labels for Communication: No
NODE tlg1a0:
Network net_diskhb_01
tlg1a0_hdisk5_01 /dev/hdisk5
Network net_ether_02
tlg1a0_boot 192.1.1.1
tlg1a0_isvc 182.1.1.1
Network net_ether_03
tlg1a0_boot1 192.168.1.1
tlg1a0_boot2 192.168.2.1
NODE tlg1d0:
Network net_diskhb_01
tlg1d0_hdisk5_01 /dev/hdisk5
Network net_ether_02
tlg1d0_isvc 182.1.1.2
tlg1d0_boot 192.1.1.2
Network net_ether_03
tlg1d0_boot2 192.168.2.2
tlg1d0_boot1 192.168.1.2
可以看到已符合规划要求,可继续了
4. 添加高可用资源
(service ip , application server , vg and jfs )
1) 添加app server
Extended Configuration
Extended Resource Configuration
HACMP Extended Resources Configuration
Configure HACMP Applications
Configure HACMP Application Servers
Add an Application Server
* Server Name [g1a0_app]
*Start Script [/usr/sbin/cluster/app/start_g1a0]
* Stop Script [/usr/sbin/cluster/app/stop_g1a0]
Application Monitor Name(s)
同样增加 g1d0_app
* Server Name [g1d0_app]
*Start Script [/usr/sbin/cluster/app/start_g1d0]
* Stop Script [/usr/sbin/cluster/app/stop_g1d0]
2) 添加service ip
Extended Configuration
Extended Resource Configuration
HACMP Extended Resources Configuration
Configure HACMP Service IP Labels/Addresses
Add a Service IP Label/Address
Configurable on Multiple Nodes
net_ether_02 (192.1.1.0/24 182.1.1.0/24)
* IP Label/Address tlg1d0_svc
* Network Name net_ether_02
Alternate HW Address to accompany IP Label/Address []
同样增加tlg1a0_svc,
net_ether_03 (192.168.1.0/24 192.168.2.0/24)
同样增加tlg1a0_1580_svc, tlg1d0_1580_svc, tlg1a0_l1_svc, tlg1d0_l1_svc, tlg1a0_l2_svc, tlg1d0_l2_svc
3) 创建资源组
Extended Configuration
Extended Resource Configuration
HACMP Extended Resource Group Configuration
Add a Resource Group
Add a Resource Group (extended)
Type or select values in entry fields.
Press Enter AFTER making all desired changes. [Entry Fields]
* Resource Group Name [tlg1a0RG]
* Participating Nodes (Default Node Priority) [tlg1a0 tlg1d0]
Startup Policy Online On Home Node Only
Fallover Policy Fallover To Next Priority Node In The List
Fallback Policy Fallback To Higher Priority Node In The List
同样建立tlg1d0_RG
….
Resource Group Name [tlg1d0_RG]
* Participating Nodes (Default Node Priority) [tlg1d0 tlg1a0]
…
2).配置资源组
Extended Configuration
Extended Resource Configuration
HACMP Extended Resource Group Configuration
Change/Show Resources and Attributes for a Resource Group
tlg1a0_RG
Change/Show All Resources and Attributes for a Resource Group
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
Resource Group Name tlg1a0_RG
Participating Nodes (Default Node Priority) tlg1a0 tlg1d0
Startup Policy Online On Home Node Only
Fallover Policy Fallover To Next Priority Node In The List
Fallback Policy Fallback To Higher Priority Node In The List
Fallback Timer Policy (empty is immediate) []
Service IP Labels/Addresses [tlg1a0_1580_svc tlg1a0_l1_svc tlg1a0_l2_svc tlg1a0_svc]
Application Servers [g1a0_app]
Volume Groups [g1a0msa2vg]
Use forced varyon of volume groups, if necessary false
同样的方法配置tlg1d0_RG
5. 检查和同步hacmp配置
(注意:1-4的配置均在tlg1a0上完成,现先强制同步到tlg1d0)
Extended Configuration
Extended Verification and Synchronization
HACMP Verification and Synchronization
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* Verify, Synchronize or Both [Both]
* Automatically correct errors found during [Yes]
verification?
* Force synchronization if verification fails? [Yes]
* Verify changes only? [No]
* Logging [Standard]
Command: OK stdout: yes stderr: no
Before command completion, additional instructions may appear below.
[MORE...5]
Verification will automatically correct verification errors.
Retrieving data from available cluster nodes. This could take a few minutes......
Verifying Cluster Topology...
Verifying Cluster Resources...
processing RG: tlg1a0_RG
processing RG: tlg1d0_RG
ERROR: The HACMP timestamp file for shared volume group: g1a0msa2vg is inconsistent
with the time stamp in the VGDA for the following nodes: tlg1a0 tlg1d0
ERROR: The HACMP timestamp file for shared volume group: g1d0msd2vg is inconsistent
with the time stamp in the VGDA for the following nodes: tlg1a0 tlg1d0
Starting Corrective Action: cl_resource_modify_vg_time_stamp.
Setting HACMP timestamp for volume group: g1a0msa2vg to 44a762222f495eff on node: tlg1a0: PASS
Setting HACMP timestamp for volume group: g1a0msa2vg to 44a762222f495eff on node: tlg1d0: PASS
Setting HACMP timestamp for volume group: g1d0msd2vg to 44a7654527e38a67 on node: tlg1a0: PASS
Setting HACMP timestamp for volume group: g1d0msd2vg to 44a7654527e38a67 on node: tlg1d0: PASS
-----------------------------------------------------------------
1 problem(s) were resolved.
Remember to redo automatic error notification if configuration has changed.
Updating ODM errnotify on node tlg1d0.
Adding any necessary HACMP entries to /etc/inittab and /etc/rc.net for IPAT on node tlg1a0.
Updating ODM errnotify on node tlg1a0.
Verification has completed normally.
记得修改/etc/hosts,改为svc的地址上。
##tlg1a0
182.1.1.1 tlg1a0_isvc tlg1a0_stb
168.120.1.201 tlg1a0_l1_svc
168.121.1.201 tlg1a0_l2_svc
140.100.100.101 tlg1a0_1580_svc
192.1.1.1 tlg1a0_boot
192.168.1.1 tlg1a0_boot1
192.168.2.1 tlg1a0_boot2
190.2.206.11 tlg1a0_svc tlg1a0
##tlg1d0
182.1.1.2 tlg1d0_isvc tlg1d0_stb
192.1.1.2 tlg1d0_boot t
192.168.1.2 tlg1d0_boot1
192.168.2.2 tlg1d0_boot2
190.2.206.12 tlg1d0_svc lg1d0
168.120.1.202 tlg1d0_l1_svc
168.121.1.202 tlg1d0_l2_svc
140.100.100.102 tlg1d0_1580_svc
到此为止我们的hacmp已经基本配置完成了。
6. 确认hacmp配置
1) snmp的调整(AIX5.3不需要)
在aix5.2 下要对snmp 做一些调整才可以看到真正的hacmp的状态。
具体来说, aix 5.2 的 snmp 默认是version 3 :
scmb02:/usr/sbin>ls -l |grep snmp
lrwxrwxrwx 1 root system 8 Apr 08 17:55 clsnmp -> clsnmpne
-rwxr-x--- 1 root system 83150 Mar 12 2003 clsnmpne
-rwxr-x--- 1 root system 55110 Mar 12 2003 pppsnmpd
lrwxrwxrwx 1 root system 9 Apr 08 17:55 snmpd -> snmpdv3ne
而hacmp 只支持snmp version 1 . 所以我们要做一下调整:
stopsrc -s snmpd
/usr/sbin/snmpv3_ssw -1startsrc -s snmpd
scmb02:/usr/sbin>ls -l |grep snmp
lrwxrwxrwx 1 root system 18 Apr 21 13:40 clsnmp -> /usr/sbin/clsnmpne
-rwxr-x--- 1 root system 83150 Mar 12 2003 clsnmpne
-rwxr-x--- 1 root system 55110 Mar 12 2003 pppsnmpd
lrwxrwxrwx 1 root system 17 Apr 21 13:40 snmpd -> /usr/sbin/snmpdv1
2) 启动hacmp时选择:
* Start now, on system restart or both[m now
Start Cluster Services on these nodes [scmb02]
BROADCAST message at startup? true
Startup Cluster Lock Services? false
Startup Cluster Information Daemon? true
Reacquire resources after forced down ? false
3) 启动clstat
cd /usr/sbin/cluster
clstat
clstat - HACMP Cluster Status Monitor
-------------------------------------
Cluster: lg_cluster (1145945461)
Sun Jul 2 16:54:16 BEIST 2006
State: UP Nodes: 2
SubState: STABLE
Node: tlg1a0 State: UP
Interface: tlg1a0_isvc (1) Address: 182.1.1.1
State: UP
Interface: tlg1a0_boot (1) Address: 192.1.1.1
State: UP
Interface: tlg1a0_boot1 (2) Address: 192.168.1.1
State: UP
Interface: tlg1a0_boot2 (2) Address: 192.168.2.1
State: UP
Interface: tlg1a0_heart (0) Address: 0.0.0.0
State: UP
Interface: tlg1a0_1580_svc (2) Address: 140.100.100.101
State: UP
Interface: tlg1a0_l1_svc (2) Address: 168.120.1.201
State: UP
Interface: tlg1a0_l2_svc (2) Address: 168.121.1.201
State: UP
Interface: tlg1a0_svc (1) Address: 190.2.206.11
State: UP
Resource Group: tlg1a0_RG State: On line
Node: tlg1d0 State: UP
Interface: tlg1d0_isvc (1) Address: 182.1.1.2
State: UP
Interface: tlg1d0_boot (1) Address: 192.1.1.2
State: UP
Interface: tlg1d0_boot1 (2) Address: 192.168.1.2
State: UP
Interface: tlg1d0_boot2 (2) Address: 192.168.2.2
State: UP
Interface: tlg1d0_heart (0) Address: 0.0.0.0
State: UP
Interface: tlg1d0_1580_svc (2) Address: 140.100.100.102
State: UP
第五章 测试篇
太累了,见另文《网段合并实现及交换机对HA影响测试报告.doc》
已经公开 Saturday, August 05, 2006 4:49 PM 作者:
mxin