[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [cobalt-users] Everything EXCEPT SSH is Dead - RaQ 4



I came across something interesting.  I did a hostname, and it came back
with the TLD.  I did a hostname xxx.xxx.xxx.xxx with my IP to match what
it tells me when logging into the machine via SSH - and magically
everything starts working again once services are restarted.  The only
service that hangs still is Sendmail.  After a reboot however, the whole
process must be done again, as the server reverts to the hostname as the
TLD, which means none of the services work again. 

Maybe this sparks someone's interests and ideas? :-)

As for the top, I did this and saw nothing eating up processor time.
Just lots of hung sendmail queues waiting for responses, which disappear
after about 2 minutes.  I can see lotsa mail trying to hit the server!


-----Original Message-----
From: cobalt-users-admin@xxxxxxxxxxxxxxx
[mailto:cobalt-users-admin@xxxxxxxxxxxxxxx] On Behalf Of Peter Masloch
Sent: Saturday, June 08, 2002 11:19 AM
To: cobalt-users@xxxxxxxxxxxxxxx
Subject: RE: [cobalt-users] Everything EXCEPT SSH is Dead - RaQ 4


did you make a "top" to see what is running? What are the CPU states?
Some of the log messages indicating that DNS is not working but you said
already that it was not running. Like i said, make a "top" and see how
many processes are running, look at %CPU and %MEM.


>>When you "connect" from a remote machine does SMTP / FTP send the
>sign-on msg
>
>Nothing at all.  The machine accepts the connection, but goes
>nowhere from there.  No message, just a minute or so of 
>nothing and then disconnect.
>
>>What happens if you restart the services? /etc/rc.d/init.d/initd
>restart
>>/etc/rc.d/init.d/httpd restart /etc/rc.d/init.d/admserv restart 
>>/etc/rc.d/init.d/sendmail restart
>
>Nothing.  The services take a LONG time to restart, but don't
>send any errors back.
>
>Here's a copy of what I get:
>
>/etc/rc.d/init.d/initd restart - this doesn't work.  I checked
>on my other RaQ which is working, and this doesn't work there 
>either.  I did a whereis initd, but couldn't find it anywhere.
>
>[root log]# /etc/rc.d/init.d/httpd restart
>Shutting down Web Service:
>Setting up Web Service: chiliasp: module started, version
>3.6.2.34 (about 2 minute delay) /usr/sbin/httpd [root log]#
>
>[root log]# /etc/rc.d/init.d/admserv restart
>Stopping admin web server:
>Starting admin web server: /usr/sbin/httpd.admsrv
>(about 2 minute delay before the /usr/sbin/httpd.admsrv shows
>up though) [root log]#
>
>[root log]# /etc/rc.d/init.d/sendmail restart
>Shutting down Mail Service: sendmail ok
>Setting up Mail Service:
>[root log]#
>
>>What kind of errors do you get when you restart the services? What do
>the log files say?
>
>Here's what I see in the logs:
>
>/var/log/httpd/error
>[Fri Jun  7 11:03:43 2002] [notice] caught SIGTERM, shutting
>down [Fri Jun  7 11:11:21 2002] [alert] mod_unique_id: unable to
>gethostbyname("www.ilumen.ca")
>[Fri Jun  7 11:46:47 2002] [alert] mod_unique_id: unable to
>gethostbyname("www.ilumen.ca")
>[Fri Jun  7 12:23:14 2002] [alert] mod_unique_id: unable to
>gethostbyname("www.ilumen.ca")
>..the alerts go on every half hour from there....
>
>/var/log/messages
>Jun  8 04:08:40 www syslogd 1.3-3: restart.
>Jun  8 04:13:39 www proftpd[11504]: www.ilumen.ca - Fatal:
>unable to determine IP address of `www.ilumen.ca'. Jun  8 
>04:19:01 www proftpd[12069]: www.ilumen.ca - Fatal: unable to 
>determine IP address of `www.ilumen.ca'. Jun  8 04:24:26 www 
>proftpd[12308]: www.ilumen.ca - Fatal: unable to determine IP 
>address of `www.ilumen.ca'. ...same message keeps appearing again....
>
>/var/log/maillog
>Jun  8 10:48:47 www sendmail[31001]: g58Gk7M31001:
>from=<ggofnews-admin@xxxxxxxxxxxxxx>, size=594574, class=0, 
>nrcpts=0, proto=ESMTP, daemon=M$ Jun  8 10:49:23 www 
>sendmail[29772]: NOQUEUE: [205.200.10.68] did not issue 
>MAIL/EXPN/VRFY/ETRN during connection to MTA Jun  8 10:49:37 
>www in.qpopper[31039]: EOF from  at 24.65.45.11
>(24.65.45.11): [0] 29 (Illegal seek); 0 (Success)
>Jun  8 10:49:37 www in.qpopper[31039]: (null) at 24.65.45.11
>(24.65.45.11): -ERR POP EOF or I/O Error: 29 (Illegal seek); 0 
>(Success) ..this is just a sample, there are literally 
>hundreds with similar information.
>
>/var/log/secure
>Jun  8 10:53:19 www in.proftpd[31301]: connect from 127.0.0.1
>Jun  8 10:53:38 www in.qpopper[31321]: connect from 
>24.65.45.11 Jun  8 10:55:18 www in.qpopper[31381]: connect 
>from 24.65.45.11 Jun  8 10:56:56 www in.qpopper[31439]: 
>connect from 24.65.42.63 Jun  8 10:56:58 www 
>in.qpopper[31440]: connect from 24.65.45.11 Jun  8 10:58:38 
>www in.qpopper[31520]: connect from 24.65.45.11 Jun  8 
>10:58:41 www in.proftpd[31521]: connect from 127.0.0.1 ..again 
>just a sample.
>
>...is there anywhere else I should be looking?
>
>_______________________________________________
>cobalt-users mailing list
>cobalt-users@xxxxxxxxxxxxxxx
>To Subscribe or Unsubscribe, please go to:
>http://list.cobalt.com/mailman/listinfo/cobalt->users
>
>


_______________________________________________
cobalt-users mailing list
cobalt-users@xxxxxxxxxxxxxxx
To Subscribe or Unsubscribe, please go to:
http://list.cobalt.com/mailman/listinfo/cobalt-users