[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [cobalt-users] RaQ3 Automatic Reboot?



On Mon, 23 Apr 2001, Wayne Sagar wrote:

> >man watchdog
> 
> Well according to the tech's at the NOC... (it took several hours to track
> it down) they did do a manual reset for me....

Gotta love good tech support ;)

I actually don't know offhand if the raq3 has the watchdog enabled or
not...

here's the top of the man page for it...

       Watchdog  is  a daemon that checks if your system is still
       working. If programs in user space are not longer executed
       it will hard reset the system.

       The kernel provides /dev/watchdog, which when open must be
       written to within a minute or  the  machine  will  reboot.
       Each  write delays the reboot time another minute. After a
       minute the watchdog hardware will cause the reset. In  the
       case  of  the software watchdog the ability to reboot will
       depend on the state of the machines and interrupts.
...
       Watchdog itself does several additional tests to check the
       system status:

       Check whether the process table is full.

       Check whether there is enough free memory available.

       Check whether some given files are accessible.

       Check whether some given files change in a given interval.

       Check  whether the average work load exceeds a predefined
       maximal value.

       Check whether the a file table overflow occurred.

       Check  whether a given process (specified by a pid file)
       is still running.

       Check whether some given IP addresses answer to a ping
       Check whether some given network interfaces received  some
       traffic.

       Check the temperature (if available).
...


Very kewl proggy :)
-
ps: don't even get me started with nsi .. they overrode a password
protected admin contact and moved one of my customers domains without the
customers permission, grrrr.....why the (*&(* do they have passwords?

gsh