[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[cobalt-users] RE: RaQ3 Mysterious dying continued



Hi Yah,

Grant Stern Replied...

"when i upgraded, the machine booted normally, then, started losing services mysteriously. then all processes started to stick, is that happening? no processes stop, then cpu overloads, then raq go boom? of course this all happened on a friday nite after hours."

This sounds just about right, though would this effect the server from copying my backup files via FTP to the backup server?

Still waiting to hear if it's behind the Ciso 1900 series switch...

Colo got back to me this morning and told me it's down to the server being overloaded and I should be considering sharing the load between multiple servers ?

I honestly don't think that this is the case - though I don't know enough about Cobalts etc to give an educated opinion - 90% of the servers sites are static/brochureware sites, nothing fancy with maybe a form to mail script, two sites are using mysql and they aren't used very often and one site uses phpnuke but with the amount of traffic/visitors they get I'd say the only user is the site owner. In total this boxes average traffic is 13Gb - I've got a customer on another box who uses twice that on a monthly basis.

I have raqbackup running at 10.30pm every night and that's done it's thing long before the 4am cron.dailys so it's not as though the two are choking the machine. I've been sitting watching top most of this morning while the box has been updating webalizer and doing the logrotations and at most I'm seeing...

 9:26am  up 1 day,  1:44,  1 user,  load average: 1.06, 0.98, 0.54
75 processes: 73 sleeping, 2 running, 0 zombie, 0 stopped
CPU states:  3.4% user,  4.2% system,  0.0% nice, 92.3% idle
Mem:  517112K av, 434588K used,  82524K free, 178716K shrd, 197804K buff
Swap: 131536K av,   1740K used, 129796K free                152108K cached

What I have noticed is that after the machines booted up I have: Swap: 131536K av, 0K used, then after raqbackup is run the K's used crept up slightly, then ran it again and it crept up again but the K's used never seems to creep back down again????

Dave Thurman Replied...

"is it possible you have a bad cat cable, maybe just the slightest movement, or vibration makes and breaks contact?? Just a thought. You have to physically move/get in the box."

Could be Dave I'll get the colo to check it out

Keep you updated...

Regards

Chae