用户打来电话说下面各个学校都不能上网了!这个事情有点大,远程登录用户的安全网关,发现安全网关可以登录,运行正常。但是安全网关与核心7609相连的接口是无连接状态,看来问题应该是出在7609上,后来用户反映下面各学校的交换机上连接口指示类都不亮,进一步确定了问题的位置——CISCO7609

匆忙赶到用户现场,发现7609引擎板上的灯全部都亮起了红色,这可不是一个好现象。据用户反映该机房在近两天停电了两次,故障是发生在第二次停电后重起。

通过console口接入发现7609进入到了如下模式:
rommon 1 >

虽然进入到了rommon模式,但最起码能够进来,还是不错的!

通过dir命令查看,发现系统文件仍在,如下是dir查看的结果

rommon 5 > dir bootdisk:
Initializing ATA monitor library...
Directory of bootdisk:
2         160648132 -rw-     c7600rsp72043-ipservices-mz.151-3.S4.bin
19613     33554432  -rw-     sea_log.dat
23709     461976    -rw-     crashinfo_20141013-160619-BJ


发现系统文件后,我按照常规作法,对7609进行了手动引导。

rommon 6 > boot bootdisk:c7600rsp72043-ipservices-mz.151-3.S4.bin

但是引导的结果并不尽如人意!!!,以下是手动引导过程:
Initializing ATA monitor library...
Self extracting the p_w_picpath... [OK]
Self decompressing the p_w_picpath : ##################################################################################################################################################################################################################################################################################################### [OK]
*** No sreloc section
              Restricted Rights Legend

Use, duplication, or disclosure by the Government is
subject to restrictions as set forth in subparagraph
(c) of the Commercial Computer Software - Restricted
Rights clause at FAR sec. 52.227-19 and subparagraph
(c) (1) (ii) of the Rights in Technical Data and Computer
Software clause at DFARS sec. 252.227-7013.

           cisco Systems, Inc.
           170 West Tasman Drive
           San Jose, California 95134-1706

Cisco IOS Software, c7600rsp72043_sp Software (c7600rsp72043_sp-IPSERVICES-M), Version 15.1(3)S4, RELEASE SOFTWARE (fc2)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2012 by Cisco Systems, Inc.
Compiled Wed 01-Aug-12 15:21 by prod_rel_team

*Aug  4 08:43:10.427: %SYS-SP-3-LOGGER_FLUSHING: System pausing to ensure console debugging output.
Firmware compiled 20-Jan-11 16:56 by integ Build [100]
*Aug  4 08:43:03.275: %SCHED-7-WATCH: Attempt to set uninitialized watched boolean (address 0). -Process= "", ipl= 3, pid= 3
-Traceback= 81BC060z 837FA8Cz 8844E10z 8844FA4z 8C7C344z 90CA7F0z 91B6D80z 9049E24z 8C7BE4Cz 8397F8Cz 9012F4Cz 9012F4Cz 8398048z 90127B4z 80AA630z 8D6802Cz
*Aug  4 08:43:06.531: %PFREDUN-6-ACTIVE: Initializing as ACTIVE processor
*Aug  4 08:43:10.427: %OIR-SP-6-CONSOLE: Changing console ownership to route processor

System Bootstrap, Version 12.2(33r)SRE, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 2011 by cisco Systems, Inc.
C7600-RSP720-10GE/RP platform with 1048576 Kbytes of main memory
rommon 1 >

经过一翻引导,7609再次进入了rommon模式。此时我的心情不太爽!!!有问题了!

走到这里可能有人跟我一样被上面的显示的内容整蒙了!手动引导竟然不行!!!我在网上查了一下,的确有哥们碰到了类似的情况,重灌了系统也不行,但是论坛上并没有给出具体的解决办法,有人说7609的子卡坏了,有人建议把只保留引擎重新启动。我也曾尝试只保留引擎重启可是依然不行。

难道试着灌一下IOS?还有其它的方法么?经过一翻思量和商讨,决定把灌IOS放在最后,在这之前要进行一个重要的操作,更改寄存器值!不能确保一定好用,但是也不能说一定没用!

rommon 1 > confreg 0x2102
You must reset or power cycle for new config to take effect

修改寄存器值后需要重启才能生效(其实在改2102之前我还做过别的修改)

对系统进行Reset后,发现7609在一步步进行正常启动

Resetting .......
System Bootstrap, Version 12.2(33r)SRE, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 2011 by cisco Systems, Inc.
C7600-RSP720-10GE/RP platform with 1048576 Kbytes of main memory

Download Start
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
Download Completed! Booting the p_w_picpath.
Self decompressing the p_w_picpath : ############################################################################################################################################################################################################################################################################################################################################################################################################################################################################################################################################################################## [OK]
*** No sreloc section
              Restricted Rights Legend

Use, duplication, or disclosure by the Government is
subject to restrictions as set forth in subparagraph
(c) of the Commercial Computer Software - Restricted
Rights clause at FAR sec. 52.227-19 and subparagraph
(c) (1) (ii) of the Rights in Technical Data and Computer
Software clause at DFARS sec. 252.227-7013.

           cisco Systems, Inc.
           170 West Tasman Drive
           San Jose, California 95134-1706



Cisco IOS Software, c7600rsp72043_rp Software (c7600rsp72043_rp-IPSERVICES-M), Version 15.1(3)S4, RELEASE SOFTWARE (fc2)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2012 by Cisco Systems, Inc.
Compiled Wed 01-Aug-12 15:14 by prod_rel_team

Cisco CISCO7609-S (M8500) processor (revision 1.0) with 917504K/65536K bytes of memory.
Processor board ID FXS1638Q19L
 BASEBOARD: RSP720-10GE
 CPU: MPC8548_E, Version: 2.1, (0x80390021)
 CORE: E500, Version: 2.2, (0x80210022)
 CPU:1200MHz, CCB:400MHz, DDR:200MHz,
 L1:    D-cache 32 kB enabled
        I-cache 32 kB enabled

Last reset from s/w reset
1 Virtual Ethernet interface
3 Gigabit Ethernet interfaces
2 Ten Gigabit Ethernet interfaces
3964K bytes of non-volatile configuration memory.

500472K bytes of Internal ATA PCMCIA card (Sector size 512 bytes).
interface GigabitEthernet1/1


后面检测板卡的过程我就不贴出来了。

到最后系统终于启动正常,为了进一步测试我把电源断开,再重新加电,系统依然正常启动!至此,故障解决了!


胡思乱想:我在想是不是由于用户那里的两次断电,加电过程,由于电压、电流高低不同,导致瞬间系统的寄存器值发生改变,以导致了后来的故障!