1.故障定位
1.1.故障信息
Log摘要 通过串口线连接底层,搜集当前环境状态 sc>showenvironment =============== Environmental Status =============== -------------------------------------------------------------------------------- System Temperatures (Temperatures in Celsius): -------------------------------------------------------------------------------- Sensor Status Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard -------------------------------------------------------------------------------- MB.P0.T_CORE OK 60 -15 -10 0 100 105 110 MB.P1.T_CORE OK 62 -15 -10 0 100 105 110 MB.T_REMOTE OK 27 -- -- -- -- -- -- MB.T_1064 OK 53 -15 -10 0 105 110 115 MB.T_FIRE OK 35 -15 -10 0 95 105 108 MB.T_AMB OK 30 -15 -10 0 65 75 85 FIOB.T_AMB OK 15 -15 -10 0 45 47 50 PDB.T_DISK OK 23 -15 -10 0 55 65 70 PDB.T_PS0 OK 20 -15 -10 0 48 50 53 PDB.T_PS1 OK 20 -15 -10 0 48 50 53 -------------------------------------- Keyswitch: -------------------------------------- Keyswitch position: NORMAL -------------------------------------------------------- System Indicator Status: -------------------------------------------------------- SYS.LOCATE SYS.SERVICE SYS.ACT -------------------------------------------------------- OFF ON ON -------------------------------------------------------- SYS.PSFAIL SYS.OVERTEMP SYS.FANFAIL -------------------------------------------------------- OFF ON OFF -------------------------------------------- System Disks: -------------------------------------------- Disk Status Service OK2RM -------------------------------------------- HDD0 OK OFF OFF HDD1 OK OFF OFF HDD2 NOT PRESENT OFF OFF HDD3 NOT PRESENT OFF OFF ---------------------------------------------------------- Fans (Speeds Revolution Per Minute): ---------------------------------------------------------- Sensor Status Speed Warn Low ---------------------------------------------------------- PDB.HDDFB.FT6.F0 OK 10305 -- 8000 PDB.HDDFB.FT6.F1 OK 10465 -- 8000 FT0.F0 OK 5037 -- 2022 FT1.F0 OK 5037 -- 2022 FT2.F0 OK 5037 -- 2022 FT3.F0 OK 5273 -- 2022 FT4.F0 OK 5113 -- 2022 FT5.F0 OK 5192 -- 2022 -------------------------------------------------------------------------------- Voltage sensors (in Volts): -------------------------------------------------------------------------------- Sensor Status Voltage LowSoft LowWarn HighWarn HighSoft -------------------------------------------------------------------------------- MB.P0.V_CORE OK 1.45 1.21 1.23 1.57 1.60 MB.P1.V_CORE OK 1.47 1.21 1.23 1.57 1.60 MB.V_+3V3 OK 3.31 2.48 2.48 3.49 3.59 MB.V_+12V OK 12.10 9.04 9.04 12.96 13.56 MB.BAT.V_BAT OK 3.13 -- 2.26 -- -- -------------------------------------------- Power Supply Indicators: -------------------------------------------- Supply DC-OK AC-OK Service -------------------------------------------- PS0 ON ON OFF PS1 ON ON OFF ------------------------------------------------------------------------------ Power Supplies: ------------------------------------------------------------------------------ Supply Status Underspeed Overtemp Overvolt Undervolt Overcurrent ------------------------------------------------------------------------------ PS0 OK OFF OFF OFF OFF OFF PS1 OK OFF OFF OFF OFF OFF 进入系统收集当前环境状态 root@I2000 # 系统配置:Sun Microsystems sun4u Sun Fire V245 系统时钟频率:188 MHz 内存大小:4GB ==================================== CPUs ==================================== E$ CPU CPU CPU Freq Size Implementation Mask Status Location --- -------- ---------- --------------------- ----- ------ -------- 0 1504 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line MB/P0 1 1504 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line MB/P1 ================================== IO 设备 ================================== Bus Freq Slot + Name + Type MHz Status Path Model ------ ---- ---------- ---------------------------- -------------------- pci 188 MB pci10b9,5229 (ide) okay /pci@1e,600000/pci@0/pci@1/pci@0/ide pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@9/pci@0/network@4 pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@9/pci@0/network@4,1 pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@a/pci@0/network pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@a/pci@0/network pci 188 MB scsi-pci1000,50 (scsi-2) LSI,1064 okay /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1 ================================== 内存配置 ================================== 区段表: ----------------------------------------------------------------------- 基本地址大小交插系数包含 ----------------------------------------------------------------------- 0x0 2GB 4 BankIDs 0,1,2,3 0x1000000000 2GB 4 BankIDs 16,17,18,19 记忆库表: ----------------------------------------------------------- 物理位置ID ControllerID GroupID 大小交插方式 ----------------------------------------------------------- 0 0 0 512MB 0,1,2,3 1 0 1 512MB 2 0 1 512MB 3 0 0 512MB 16 1 0 512MB 0,1,2,3 17 1 1 512MB 18 1 1 512MB 19 1 0 512MB 内存模块群组: -------------------------------------------------- ControllerID GroupID Labels Status -------------------------------------------------- 0 0 MB/P0/B0/D0 okay 0 0 MB/P0/B0/D1 okay 0 1 MB/P0/B1/D0 okay 0 1 MB/P0/B1/D1 okay 1 0 MB/P1/B0/D0 okay 1 0 MB/P1/B0/D1 okay 1 1 MB/P1/B1/D0 okay 1 1 MB/P1/B1/D1 okay =============================== usb 设备 =============================== Name Port# ------------ ----- hub 1 ================================== 环境状态 ================================== 风扇状态: ------------------------------------------- Location Sensor Status ------------------------------------------- PDB/HDDFB/FT6/F0 F0 okay PDB/HDDFB/FT6/F1 F1 okay MB/FIOB/FCB0/FT0/F0 F0 okay MB/FIOB/FCB0/FT1/F0 F0 okay MB/FIOB/FCB0/FT2/F0 F0 okay MB/FIOB/FCB1/FT3/F0 F0 okay MB/FIOB/FCB1/FT4/F0 F0 okay MB/FIOB/FCB1/FT5/F0 F0 okay PS0 FF_FAN okay PS1 FF_FAN okay 温度传感器: ----------------------------------------- Location Sensor Status ----------------------------------------- MB/P0 T_CORE okay MB/P1 T_CORE okay MB T_REMOTE okay MB T_1064 okay MB T_FIRE okay MB T_AMB okay MB/FIOB T_AMB okay PDB T_DISK okay PDB T_PS0 okay PDB T_PS1 okay PS0 FF_OT okay PS1 FF_OT okay ------------------------------------ 当前的传感器: ---------------------------------------- Location Sensor Status ---------------------------------------- PS0 FF_OC okay PS1 FF_OC okay ------------------------------------ 电压传感器: ----------------------------------- Location Sensor Status ----------------------------------- MB/P0 V_CORE okay MB/P1 V_CORE okay MB V_+3V3 okay MB V_+12V okay MB/BATTERY V_BAT okay PS0 P_PWR okay PS0 FF_POK okay PS0 FF_UV okay PS0 FF_OV okay PS1 P_PWR okay PS1 FF_POK okay PS1 FF_UV okay PS1 FF_OV okay ----------------------------------------- 键开关: ----------------------------------------- 位置钥控开关状态 ----------------------------------------- MB SYSCTRL NORMAL -------------------------------------------------- Led 状态: -------------------------------------------------------------- Location Led State Color -------------------------------------------------------------- MB ACT on green MB LOCATE off white MB SERVICE on amber MB PSFAIL off amber MB OVERTEMP on amber MB FANFAIL off amber PS0 SERVICE off amber PS0 DC_OK on green PS0 AC_OK on green PS1 SERVICE off amber PS1 DC_OK on green PS1 AC_OK on green MB/HDDBP/HDD0 SERVICE off amber MB/HDDBP/HDD0 OK2RM off blue MB/HDDBP/HDD1 SERVICE off amber MB/HDDBP/HDD1 OK2RM off blue MB/HDDBP/HDD2 SERVICE off amber MB/HDDBP/HDD2 OK2RM off blue MB/HDDBP/HDD3 SERVICE off amber MB/HDDBP/HDD3 OK2RM off blue =========================== 字段取代单元的操作状态 =========================== --------------------------------- 字段取代单元 (FRU) 的操作状态: --------------------------------- Location Status --------------------------------- MB/SC okay MB/HDDBP/HDD0 present MB/HDDBP/HDD1 present PS0 okay PS1 okay ================================== HW 修订 ================================== ASIC Revisions: ------------------------------------------------------------------- Path Device Status Revision ------------------------------------------------------------------- /pci@1e,600000 pciex108e,80f0 okay 4 /pci@1f,700000 pciex108e,80f0 okay 4 系统 PROM 修订: ---------------------- OBP 4.30.4 2009/08/19 07:18 Sun Fire V215/V245 POST 4.30.4 2009/08/19 07:35 Chassis Serial Number: ---------------------- root@I2000 # 系统配置:Sun Microsystems sun4u Sun Fire V245 系统时钟频率:188 MHz 内存大小:4GB ==================================== CPUs ==================================== E$ CPU CPU CPU Freq Size Implementation Mask Status Location --- -------- ---------- --------------------- ----- ------ -------- 0 1504 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line MB/P0 1 1504 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line MB/P1 ================================== IO 设备 ================================== Bus Freq Slot + Name + Type MHz Status Path Model ------ ---- ---------- ---------------------------- -------------------- pci 188 MB pci10b9,5229 (ide) okay /pci@1e,600000/pci@0/pci@1/pci@0/ide pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@9/pci@0/network@4 pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@9/pci@0/network@4,1 pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@a/pci@0/network pci 188 MB pci14e4,1668 (network) okay /pci@1e,600000/pci@0/pci@a/pci@0/network pci 188 MB scsi-pci1000,50 (scsi-2) LSI,1064 okay /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1 ================================== 内存配置 ================================== 区段表: ----------------------------------------------------------------------- 基本地址大小交插系数包含 ----------------------------------------------------------------------- 0x0 2GB 4 BankIDs 0,1,2,3 0x1000000000 2GB 4 BankIDs 16,17,18,19 记忆库表: ----------------------------------------------------------- 物理位置ID ControllerID GroupID 大小交插方式 ----------------------------------------------------------- 0 0 0 512MB 0,1,2,3 1 0 1 512MB 2 0 1 512MB 3 0 0 512MB 16 1 0 512MB 0,1,2,3 17 1 1 512MB 18 1 1 512MB 19 1 0 512MB 内存模块群组: -------------------------------------------------- ControllerID GroupID Labels Status -------------------------------------------------- 0 0 MB/P0/B0/D0 okay 0 0 MB/P0/B0/D1 okay 0 1 MB/P0/B1/D0 okay 0 1 MB/P0/B1/D1 okay 1 0 MB/P1/B0/D0 okay 1 0 MB/P1/B0/D1 okay 1 1 MB/P1/B1/D0 okay 1 1 MB/P1/B1/D1 okay |
1.2.故障定位
通过底层的环境状态显示,并无硬件告警。
通过系统的环境状态显示,并无硬件告警。
由于故障显示未知,因此我们判断:
机器固件版本较低的话可能会出现一些莫名的故障,如误告警,或有告警却可能无法体现故障信息,建议升级微码到最新版本。
升级后,进一步分析故障。
2.故障处理
2.1.先决条件
注意 |
升级微码之前请作好数据备份。如果微码未正确升级完成,则可能会导致数据丢失。 |
2.2.准备项
准备确认项 |
||
类型 |
准备项 |
状态 |
硬件 |
笔记本一台 |
已准备就绪 |
串口线一根 |
已准备就绪 |
|
网线一根 |
已准备就绪 |
|
软件 |
微码包 |
已准备就绪 |
其它 |
||
2.3.操作项
序号 |
操作项 |
1、 |
查看老版本的Firmware 版本。 |
2、 |
从sunsolve 里下载现在最新的版本微码包. |
3、 |
把微码包用bin格式上传到/tmp目录 |
4、 |
将FIRMWARE 信息DOWNLOAD 到SC 闪存里 |
5、 |
关闭操作系统,关闭电源。 |
6、 |
查看keyswitch是否在NORMAL状态.如果处在LOCK状态,把他改到正常状态. |
7、 |
flashupdate -s 127.0.0.1 升级固件 |
8、 |
重起机器,SC使更新的固件生效 |
9、 |
微码升级完成。 |