[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[cobalt-users] RaQ4r In Constant Reboot! - URGENT!



Hello List!

Please help me stop my RaQ4r from acting like a Windows box...

I have a slight, *ha*, problem...  Our one and only RaQ4r is stuck in a
constant reboot loop..  It seems that it'll boot fine, and everything will
appear ok.  But upon closer inspection, "cat /proc/mdstat" we see that the
second drive is being rebuilt by the first.  Ok, fine..  But the time just
keeps going up and up and up and up..  Hmmm, same thing happening in the web
admin panel..  Not good...  Do some more research, "ps aux" & "top", and it
seems that ChiliSoft is eating lots more ram and CPU resources than anything
else..  Ok, so let's do this, "cd /home/chiliasp/asp-apache-3000" then
"./stopcaspd", GREAT, now when I "cat /proc/mdstat" and check the web admin
panel, the time is actually going down and the percentage complete is now
going up!  Wonderful!

But that didn't solve the problem completely, apparently, there is ALSO
something else compounding the situation.  Because within 15 - 20 minutes of
the RAID finishing it's resync, it magically reboots again..  Sometimes it
doesn't even reboot..  Sometimes it just locks and ALL lights on the front
and back go dead!  Only the LCD is left lit up, BUT the buttons, "S" and "E"
and the arrows don't work at all..  I tried holding in all of them (one by
one), nothing..  Flip switch in back, try again..  Hmmm, starting to sound
like a Window's box..

Now, we did notice that when the indicator lights on the front blink, so
does the LCD screen..  You know, like a power drain is happening...  We did
get it back and stable by stopping Chilisoft from starting and unplugging
the 2nd hard drive..  All seems to well again, and the box has so much free
resources, I just can't believe it...  And when the indicator lights blink,
the LCD doesn't blink anymore, at least it's not noticeable..

Now, go with me here..  This is my idea as to what is happening, and maybe
you guys and gals could help me out...  I think that the CPU is not getting
enough power (dimming of LCD during activity), which is one reason why it's
stable again after removing 2nd drive?  Also, and ChiliSoft, which eats
system resources, could be putting unnecessary strain on the CPU causing it
to consume more and more power until finally it BUSTS and reboots?

Now this is just a thought, based on the dimming of the LCD and the fact
that the drives wouldn't rebuild unless ChiliSoft is shut off..  And the
drives are showing up as ok..  They show a normal response (cat
"/proc/mdstat") when RAID gets to finish it's rebuild..  But the system is
still unstable with both drives connected, but unplug one drive (power
cable) and BAM, stable Linux again..

I've been in the archives and every where I can go, please tell me someone
knows something..  Or even if I am on the right track...

Thanks List!

-Jamie-
http://w-c.net
WebConnection.Net, Inc.
In a mad world, only the mad are sane...