[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [cobalt-users] RAQ550 keeps rebooting
- Subject: RE: [cobalt-users] RAQ550 keeps rebooting
- From: "Bob Noordam" <mac@xxxxxxxx>
- Date: Wed May 5 11:04:01 2004
- List-id: Mailing list for users to share thoughts on Sun Cobalt products. <cobalt-users.list.cobalt.com>
>
> One of our raq550 failed yesterday morning. It keeps
> rebooting over and over
> again once it has started up.
> So I went to pick up at our datacenter and tried to see
> what's wrong at the
> office. Just before the reboot I can see the hardware problem led (the
> wrench) flashing. According to the manual : (amber System
> Fault LED) lights
> up to indicate a system fault such as improper power supply voltages,
> enclosure or power
> supply fan problems, high temperature, hard disk drive failure, memory
> failure or low CMOS lithium-battery voltage.
> Last weekend I received one mail from the system that one or
> more components
> were getting hot.
> I've checked the fans and they work pretty well IMO. Tried
> other memory but
> the problem remains.
> Is there anything else I should have a look at ?
>
> Thanks
> John
>
I observed the same behaviour when one disk went down, corrupting some data
on the second at the same time. The 'good' disk didn't mount /var anymore.
The key to what was going on, was monitoring the boot process with a serial
null modem cable (115200,n,8,1) and see the system report a lot of 'file not
found's' just before rebooting. The solution in this case was mounting the
'good' disk in a SuSE 8.2 system (it has xfs support in the kernel by
default) and run xfs_repair -L /dev/hdc2
Bob.