今天有个备份策略出问题,报错内容是这样的:
04/24/2008 00:03:00 - requesting resource bfepdb-hcart2
Z,W([a&LYryW004/24/2008 00:03:00 - requesting resource bfbkup.NBU_CLIENT.MAXJOBS.bfepdbITPUB个人空间0HAx ^)t5Ne]'n
04/24/2008 00:03:00 - requesting resource bfbkup.NBU_POLICY.MAXJOBS.bfep_db
c:Ui+o_ d0t004/24/2008 00:03:00 - awaiting resource bfepdb-hcart2. Waiting for resources.ITPUB个人空间Nn[4mu6k+N y
Reason: Tape media server is not active, Media server: erpdb,ITPUB个人空间8RH\x8dLt
Robot Type(Number): TLD(0), Media ID: N/A, Drive Name: N/A,ITPUB个人空间.U9_Uu&v&z WZ
Volume Pool: DB_ep_full, Storage Unit: bfepdb-hcart2, Drive Scan Host: N/AITPUB个人空间;s/L`K4LV%la
client backup was not attempted because backup window closed (196)
在master server打开控制台,deivce--host,发现的status是“active for disk”。由于这个media server上面连接的是磁带库。正常应该是“active for type and disk”于是在这台机器上执行tpconfig –d,一直没有反馈信息。
1.通过bpps查看进程:
NB Processes
(hN6n#J Klo!Mw0------------
\,?&LE"I c:m0 root 11379 1 0 15:24:23 ? 0:00 /usr/openv/netbackup/bin/bpcompatd
mdEqI-` b,o&Y0 root 11384 1 0 15:24:28 ? 0:00 /usr/openv/netbackup/bin/nbslITPUB个人空间#b9u$q!l@{;O
root 11356 1 0 15:24:17 ? 0:00 /usr/openv/netbackup/bin/nbnosITPUB个人空间-^;K!Al3R*t-_%[p
root 11391 1 0 15:24:29 ? 0:00 /usr/openv/netbackup/bin/nbsvcmon
MM ProcessesITPUB个人空间#El.y O Uv9A B
------------ITPUB个人空间d@-f.m^[| {Z
root 11494 11364 0 15:30:37 ? 0:00 tldd
6Hgy"Ko1];H(T^qg0 root 11364 1 0 15:24:20 ? 0:00 /usr/openv/volmgr/bin/ltid
&[IfpA%I+I0 root 11497 11364 0 15:30:39 ? 0:00 avrd
TD"Me\)ho;qi:w5~0 root 11480 1 0 15:30:34 ? 0:00 vmd
没有发现异常情况。
2.在master server上执行:
vmdareq -a
发现没有bfepdb这个media server的信息,决定重启nbu进程。netbackup stop后bpps -a
NB Processes
pA0\;}G^(QV@{0------------
MM ProcessesITPUB个人空间x(F/^rN,s4w
------------
3.ioscan -fnC tape
Class I H/W Path Driver S/W State H/W Type Description
D#s1UEr:Jtk7\UZ0=========================================================================ITPUB个人空间.sD Dg:z;w/HMpQ
tape 0 0/0/1/0.4.0 stape CLAIMED DEVICE HP C5683AITPUB个人空间9z dFdS_fX3\5c8s
/dev/rmt/0m /dev/rmt/0mnb /dev/rmt/c0t4d0BESTn /dev/rmt/c0t4d0DDSb ITPUB个人空间z? p'TW
/dev/rmt/0mb /dev/rmt/c0t4d0BEST /dev/rmt/c0t4d0BESTnb /dev/rmt/c0t4d0DDSn
w'e DS8N Y%@z5rS_0 /dev/rmt/0mn /dev/rmt/c0t4d0BESTb /dev/rmt/c0t4d0DDS /dev/rmt/c0t4d0DDSnbITPUB个人空间]}1v%S P,_9tr(l
tape 7 0/10/0/0.97.26.255.1.3.0 stape CLAIMED DEVICE HP Ultrium 2-SCSI
^G {%e [6h`0 /dev/rmt/7m /dev/rmt/7mn /dev/rmt/c16t3d0BEST /dev/rmt/c16t3d0BESTn
;d Df:_ }$?.Yt T0 /dev/rmt/7mb /dev/rmt/7mnb /dev/rmt/c16t3d0BESTb /dev/rmt/c16t3d0BESTnbITPUB个人空间 AyOb-X w^4e E
tape 8 0/10/0/0.97.26.255.1.3.1 stape CLAIMED DEVICE HP Ultrium 2-SCSI
7l#Mx4fD[Q-j!Y] Q0 /dev/rmt/8m /dev/rmt/8mn /dev/rmt/c16t3d1BEST /dev/rmt/c16t3d1BESTn
gap#bYcof0 /dev/rmt/8mb /dev/rmt/8mnb /dev/rmt/c16t3d1BESTb /dev/rmt/c16t3d1BESTnb
9XY(y/G-y0tape 3 0/12/0/0.97.25.255.1.3.1 stape CLAIMED DEVICE HP Ultrium 2-SCSIITPUB个人空间&zo5F N u
/dev/rmt/3m /dev/rmt/3mn /dev/rmt/c14t3d1BEST /dev/rmt/c14t3d1BESTn
*d;vR_:u\0Md0 /dev/rmt/3mb /dev/rmt/3mnb /dev/rmt/c14t3d1BESTb /dev/rmt/c14t3d1BESTnb
+_0M+E?:` q%U)mh0tape 4 0/12/0/0.97.25.255.1.3.2 stape CLAIMED DEVICE HP Ultrium 2-SCSI
xJ5Y'l'uD1C0 /dev/rmt/4m /dev/rmt/4mn /dev/rmt/c14t3d2BEST /dev/rmt/c14t3d2BESTnITPUB个人空间&] m t$Jo(FC2La
/dev/rmt/4mb /dev/rmt/4mnb /dev/rmt/c14t3d2BESTb /dev/rmt/c14t3d2BESTnb
设备也没有什么异常情况。
4.接着netbackup start,再手工启动策略问题依然存在。
5.再次netbackup stop后执行bp.kill_all,彻底杀掉nbu进程,再netbackup start启动nbu,vmdareq -a一切正常。
# netbackup start
NetBackup Database Server started.
-C#V g/UPU0NetBackup Notification Service started.ITPUB个人空间XV^G6TJ Uu;t'DN
NetBackup Enterprise Media Manager started.ITPUB个人空间hYR5EzUL8ck
NetBackup Resource Broker started.
"X*I.[z+D ]0Media Manager daemons started.ITPUB个人空间dN E%JeE)O
NetBackup request daemon started.ITPUB个人空间D.OXa5l4f
NetBackup compatibility daemon started.ITPUB个人空间 D2Ss0B:u%r*fN|
NetBackup Job Manager started.ITPUB个人空间c*F?^ O4n![
NetBackup Policy Execution Manager started.
E {lp5U0C @*zR+Y0NetBackup Service Layer started.ITPUB个人空间!Gwil|@
NetBackup is not configured for clustering.
s |P B;w*GmmVp0NetBackup Service Monitor started.
# vmdareq -a
Drive2 - AVAILABLEITPUB个人空间;at+p.dCtW$m\ `
bfbkup UPITPUB个人空间Fv~] e"soi
erpdb UPITPUB个人空间z]"]uonP/?3_+r
Drive3 - AVAILABLE
&z;{%oct;Wg5V i0 bfbkup UP
C_#];W.Pz0 erpdb UP
],w-B E#Wb;y Z o.i0HPUltrium2-SCSI0 - AVAILABLEITPUB个人空间 ?C-b3vLZDURv
bfbkup UPITPUB个人空间7Ub GX7?&X:Y:A
erpdb UP
9_9V"i:G+Q Q0HPUltrium2-SCSI1 - AVAILABLEITPUB个人空间!rq,d {/D"r}&y
bfbkup UPITPUB个人空间Dd^;? V0Q
erpdb UP
6.结论
出现这种问题,可能是由nbu进程的异常造成的。但是正常的重启可能仍然不能解决问题,这时候需要执行bp.kill_all脚本来停止nbu的后台驻留程序。