一、故障定位
1.1.故障信息
Log摘要 #errpt -d H IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION 51E537B5 0211064413 P H sysplanar0 platform_dump saved to file 291D64C3 0211064413 I H sysplanar0 Platform dump data 51E537B5 0211064313 P H sysplanar0 platform_dump saved to file 291D64C3 0211064313 I H sysplanar0 Platform dump data BFE4C025 0211063913 P H sysplanar0 UNDETERMINED ERROR 51E537B5 0207210013 P H sysplanar0 platform_dump saved to file 291D64C3 0207210013 I H sysplanar0 Platform dump data 51E537B5 0207205913 P H sysplanar0 platform_dump saved to file 291D64C3 0207205913 I H sysplanar0 Platform dump data BFE4C025 0207205513 P H sysplanar0 UNDETERMINED ERROR 51E537B5 0203023513 P H sysplanar0 platform_dump saved to file 291D64C3 0203023513 I H sysplanar0 Platform dump data 51E537B5 0203023413 P H sysplanar0 platform_dump saved to file 291D64C3 0203023413 I H sysplanar0 Platform dump data BFE4C025 0203023113 P H sysplanar0 UNDETERMINED ERROR #errpt –a j BFE4C025 LABEL: SCAN_ERROR_CHRP IDENTIFIER: BFE4C025 Date/Time: Sequence Number: 364 Machine Id: 00C1EEE44C00 Node Id: wap-partner1 Class: H Type: PERM Resource Name: sysplanar0 Resource Class: planar Resource Type: sysplanar_rspc Location: Description UNDETERMINED ERROR Failure Causes UNDETERMINED Recommended Actions RUN SYSTEM DIAGNOSTICS. Detail Data Diagnostic Analysis Diagnostic Log sequence number: 248 Resource tested: sysplanar0 Resource Description: System Planar Location: SRC: B151E40F Description: CEC hardware Unrecovered Error, general. Refer to the system service documentation for more information. Additional Words: 2-030000F0 3-53B43310 4-C13920FF 5-400000FF 6-00000000 7-00000000 8-00000000 9-00000000 Possible FRUs: Priority: H FRU: 10N6604 S/N: YL1028352020 CCIN: 53B4 Location: U788C.001.AAC4488-P1 Error/Event Logs Platform Event Log - 501A910A Created at :02/10/2013 22:42:16 Subsystem :I/O Bridge Event Severity :Informational Event Event Type : Miscellaneous, Informational Only Action Flags :Report to Operating System Action Status :Reported to Opr Sys Primary System Reference Code Reference Code :B7006992 Hex Words 2 - 5 :00000062 00010002 28510000 00000000 Hex Words 6 - 9 :000000A1 00011000 00000000 00000000 Log Hex Dump
|
1.2.故障定位
通过HMC收集ASM的故障现象,有错误指向主板微码版本。
通过OS中的errpt显示,有以下几次报错,错误位置指向主板(U788C.001.AAC4488-P1)。
BFE4C025 0211063913 P Hsysplanar0 UNDETERMINED ERROR
BFE4C025 0207205513 P Hsysplanar0 UNDETERMINED ERROR
BFE4C025 0203023113 P Hsysplanar0 UNDETERMINED ERROR
SRC: B151E40F
Description: CEC hardware Unrecovered Error, general. Refer to the
system service documentation for more information.
AdditionalWords: 2-030000F0 3-53B43310 4-D1411046 5-400000FF
6-00000000 7-00000000 8-00000000 9-00000000
Possible FRUs:
Priority: H FRU:10N6604 S/N: YL1028352020 CCIN: 53B4
Location:U788C.001.AAC4488-P1
二、故障处理
2.1.先决条件
注意 |
确保系统关机,电源断开 操作时,使用防静电护腕 添加或更换硬件组件之前请作好数据备份。如果部件未正确安装,则可能会导致数据丢失。 |
2.2.准备项
准备确认项 |
||
类型 |
准备项 |
状态 |
硬件 |
笔记本一台 |
已准备就绪 |
串口线一根 |
已准备就绪 |
|
网线一根 |
已准备就绪 |
|
一字、十字螺丝刀各一把 |
已准备就绪 |
|
防静电护腕一个 |
已准备就绪 |
|
新裸机器一台 |
已准备就绪 |
|
软件 |
||
其它 |
||
2.3.操作项
1、旧机器进入系统,通过bootlist –m normal –o 查看硬盘引导顺序,然后通过lscfg –vpl hdisk0记录下硬盘插槽的顺序。
2、备份好旧机器的必要数据后,Shutdown旧机器,待前面板绿灯慢速闪烁,拔下电源线。
3、拔出旧机器硬盘,按原有顺序,将硬盘插入新机器硬盘笼子。
4、旧机器下架,新机器上架,固定好以后接好输入输出设备,然后上电。
5、待新机器前面板绿灯慢速闪烁以后,按白色按钮开机。
6、开机后到选择界面按1进入SMS菜单,按5选择进入引导列表,选择从刚才查看到的第一块硬盘引导进入系统。
7、进入系统后查看errpt有无本次开机后的硬件报错,再查看文件系统是否正常,然后可以正常开启应用。
本次更换只需要将硬盘迁移到新机器上即可,如果新机器由于硬件问题不能正常引导进入系统,则可将硬盘插回原机器恢复。
涉及到的变更信息:
1.主机序列号,如果客户有固定资产记录,则需要进行修改。
2.网卡的MAC地址变更,如果客户进行了MAC地址绑定,则需要重新进行绑定。