[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [cobalt-users] Qube2 Crashes without obvious cause



Hi Mike,

on 8/12/00 2:18 PM, Mike Vanecek at nospam99@xxxxxxxxxxxx wrote:

> :>Any suggestions as to how to track down the real problem?
> 
> Power failures?  

Client says no, besides the Qube should restart when power is back ... some
of the failures are around 0100 - 0200 and the Qube is still lifeless when
the office opens at 0930.  Others have been during business hours (e.g. this
afternoon) ... staff would have noticed the disruption.

It would be nice if the box had a hardware watchdog to do a hardware reset
if/when it crashed.

> Cleaning crew pulling the plug out?

Nope.

> Anything in the
> /var/cobalt/ monitor logs (since they get updated every 15 minutes - unless
> you have changed crontab like I have to do a monitor every 6 hours)? You can
> check memory and disk usage there.

Hadn't looked there it might be interesting to correlate the "stop" time in
/var/log/messages with these logs.  It might also be interesting to disable
monitoring altogether to see if its a "monitor" which is causing the
problem, but wouldn't that cause it to crash every 15 minutes?

> What changes have been made to the system
> or the environment that might suggest a cause?

Up until now none ... its as delivered.  The user had been complaining about
"File System Full" email so messages so I moved /usr/doc to /home/doc to
free up a little space on /.

[root /root]# df -m
Filesystem         MB-blocks    Used Available Capacity Mounted on
/dev/hda1                290     211       79     73%   /
/dev/hda3                193      25      168     13%   /var
/dev/hda4              18172     577    17595      3%   /home
[root /root]#

It was running at 78%.

> Ain't computers fun?

Yep.  Life would be boring otherwise.  ;-)

Hardware replacement looks like the only option, but to get a replacement
you must first "prove" that is a hardware problem.

Cheers,  Malcolm

-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --

                       Information Alchemy Pty Ltd
                             ACN 089 239 305
                           Canberra, Australia

Malcolm McLeary                                Mobile:     0412 636 086
Managing Director                              Email:  mmcleary@xxxxxxx

     This message was sent using Outlook Express 5.0 for Macintosh.