Thread: Nagios and SMS Paging ...
I'm getting SMS messages from Nagios periodically telling me that there are problems with the servers that I'm logged into working on ... basically, I'm getting paged for someone else's network problems *sigh* Josh suggested (since I didn't think about it) asking here if someone has set this up ... Thanks ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
Marc G. Fournier wrote: > > I'm getting SMS messages from Nagios periodically telling me that > there are problems with the servers that I'm logged into working on > ... basically, I'm getting paged for someone else's network problems > *sigh* > > Josh suggested (since I didn't think about it) asking here if someone > has set this up ... We did :) however if the network problem is on our end then we will be paged as well. It sounds like you are getting paged for network problems in between, or possibly between us and one of the legs into your network. When was your last page? I want to verify against our reports. Sincerely, Joshua D. Drake > > Thanks ... > > ---- > Marc G. Fournier Hub.Org Networking Services > (http://www.hub.org) > Email: scrappy@hub.org Yahoo!: yscrappy ICQ: > 7615664 > > ---------------------------(end of broadcast)--------------------------- > TIP 3: if posting/reading through Usenet, please send an appropriate > subscribe-nomail command to majordomo@postgresql.org so that your > message can get through to the mailing list cleanly -- Command Prompt, Inc., home of Mammoth PostgreSQL - S/ODBC and S/JDBC Postgresql support, programming shared hosting and dedicated hosting. +1-503-667-4564 - jd@commandprompt.com - http://www.commandprompt.com PostgreSQL Replicator -- production quality replication for PostgreSQL
Attachment
On Tue, 11 Jan 2005, Joshua D. Drake wrote: > Marc G. Fournier wrote: > >> >> I'm getting SMS messages from Nagios periodically telling me that there are >> problems with the servers that I'm logged into working on ... basically, >> I'm getting paged for someone else's network problems *sigh* >> >> Josh suggested (since I didn't think about it) asking here if someone has >> set this up ... > > We did :) however if the network problem is on our end then we will be paged > as well. > It sounds like you are getting paged for network problems in between, or > possibly > between us and one of the legs into your network. > > When was your last page? I want to verify against our reports. Just before I email'd this out to the list ... the first 'legit' one I've had so far was this afternoon, when there was a short outage ... before that, I got paged at 5am telling me the network was down, was up, was down, was up, until about 6:30am ... and all the while, I'd come to check the servers and they were fine ... I have enough problems getting a good nights sleep as it is, let alone getting false alarms :( Please remove me from the page list ... personal pages are appreciated, since it means that a problem has been verified to be at our end first ... but auto-pages are definitely not appreciated, considering the # of 'failure points' that can exist between remote servers ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
> -----Original Message----- > From: pgsql-www-owner@postgresql.org > [mailto:pgsql-www-owner@postgresql.org] On Behalf Of Marc G. Fournier > Sent: 12 January 2005 05:07 > To: Joshua D. Drake > Cc: pgsql-www@postgresql.org > Subject: Re: [pgsql-www] Nagios and SMS Paging ... > > Please remove me from the page list ... personal pages are > appreciated, > since it means that a problem has been verified to be at our > end first ... > but auto-pages are definitely not appreciated, considering the # of > 'failure points' that can exist between remote servers ... Perhaps all auto-detected failures should be posted to the sysadmins list when it's ready? Which reminds me, did you fix the slaves list yet Marc? /D
On Wed, 12 Jan 2005, Dave Page wrote: > > >> -----Original Message----- >> From: pgsql-www-owner@postgresql.org >> [mailto:pgsql-www-owner@postgresql.org] On Behalf Of Marc G. Fournier >> Sent: 12 January 2005 05:07 >> To: Joshua D. Drake >> Cc: pgsql-www@postgresql.org >> Subject: Re: [pgsql-www] Nagios and SMS Paging ... >> >> Please remove me from the page list ... personal pages are >> appreciated, >> since it means that a problem has been verified to be at our >> end first ... >> but auto-pages are definitely not appreciated, considering the # of >> 'failure points' that can exist between remote servers ... > > Perhaps all auto-detected failures should be posted to the sysadmins > list when it's ready? > > Which reminds me, did you fix the slaves list yet Marc? slaves list? don't have anything in my box about problems with it ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
> -----Original Message----- > From: Marc G. Fournier [mailto:scrappy@postgresql.org] > Sent: 12 January 2005 15:31 > To: Dave Page > Cc: Marc G. Fournier; Joshua D. Drake; pgsql-www@postgresql.org > Subject: RE: [pgsql-www] Nagios and SMS Paging ... > > > slaves list? don't have anything in my box about problems with it ... Sounds like you're over-filtering again :-) http://archives.postgresql.org/pgsql-www/2005-01/msg00247.php I assume you missed this as well: http://archives.postgresql.org/pgsql-www/2005-01/msg00235.php And this one: http://archives.postgresql.org/pgsql-www/2005-01/msg00098.php Receipt requested! :-) /D
I removed the sms but have left your email address if that is o.k. Dave Page wrote: > > > >>-----Original Message----- >>From: pgsql-www-owner@postgresql.org >>[mailto:pgsql-www-owner@postgresql.org] On Behalf Of Marc G. Fournier >>Sent: 12 January 2005 05:07 >>To: Joshua D. Drake >>Cc: pgsql-www@postgresql.org >>Subject: Re: [pgsql-www] Nagios and SMS Paging ... >> >>Please remove me from the page list ... personal pages are >>appreciated, >>since it means that a problem has been verified to be at our >>end first ... >>but auto-pages are definitely not appreciated, considering the # of >>'failure points' that can exist between remote servers ... > > > Perhaps all auto-detected failures should be posted to the sysadmins > list when it's ready? > > Which reminds me, did you fix the slaves list yet Marc? > > /D -- Command Prompt, Inc., your source for PostgreSQL replication, professional support, programming, managed services, shared and dedicated hosting. Home of the Open Source Projects plPHP, plPerlNG, pgManage, and pgPHPtoolkit. Contact us now at: +1-503-667-4564 - http://www.commandprompt.com
Attachment
On Wed, 12 Jan 2005, Joshua D. Drake wrote: > > I removed the sms but have left your email address if that is o.k. That's perfect ... in fact, if you can put in my scrappy@ns.sympatico.ca one as a secondary, then if the network is totally down, I'll still get the notices ... Again, please note that I have no problems with getting verified SMS pages, else I would never have given out the address ... its just getting pages at 5am to find out that there is nothing wrong with the servers tends to hurt just a weeeeee bit :) Specially when the wife smacks me cause it woke her up :) Thanks ... > > Dave Page wrote: >> >> >>> -----Original Message----- >>> From: pgsql-www-owner@postgresql.org >>> [mailto:pgsql-www-owner@postgresql.org] On Behalf Of Marc G. Fournier >>> Sent: 12 January 2005 05:07 >>> To: Joshua D. Drake >>> Cc: pgsql-www@postgresql.org >>> Subject: Re: [pgsql-www] Nagios and SMS Paging ... >>> >>> Please remove me from the page list ... personal pages are appreciated, >>> since it means that a problem has been verified to be at our end first ... >>> but auto-pages are definitely not appreciated, considering the # of >>> 'failure points' that can exist between remote servers ... >> >> >> Perhaps all auto-detected failures should be posted to the sysadmins >> list when it's ready? >> >> Which reminds me, did you fix the slaves list yet Marc? >> >> /D > > > -- > Command Prompt, Inc., your source for PostgreSQL replication, > professional support, programming, managed services, shared > and dedicated hosting. Home of the Open Source Projects plPHP, > plPerlNG, pgManage, and pgPHPtoolkit. > Contact us now at: +1-503-667-4564 - http://www.commandprompt.com > > ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
On Wed, 12 Jan 2005, Dave Page wrote: > > >> -----Original Message----- >> From: Marc G. Fournier [mailto:scrappy@postgresql.org] >> Sent: 12 January 2005 15:31 >> To: Dave Page >> Cc: Marc G. Fournier; Joshua D. Drake; pgsql-www@postgresql.org >> Subject: RE: [pgsql-www] Nagios and SMS Paging ... >> >> >> slaves list? don't have anything in my box about problems with it ... > > Sounds like you're over-filtering again :-) > > http://archives.postgresql.org/pgsql-www/2005-01/msg00247.php Fixed ... > I assume you missed this as well: > > http://archives.postgresql.org/pgsql-www/2005-01/msg00235.php transfer of 'pgadmin.org/IN' from 194.217.48.34#53: failed while receiving responses: REFUSED > And this one: > > http://archives.postgresql.org/pgsql-www/2005-01/msg00098.php looking into this one ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
> -----Original Message----- > From: Marc G. Fournier [mailto:scrappy@postgresql.org] > Sent: 16 January 2005 03:11 > To: Dave Page > Cc: Joshua D. Drake; pgsql-www@postgresql.org > Subject: RE: [pgsql-www] Nagios and SMS Paging ... > > On Wed, 12 Jan 2005, Dave Page wrote: > > > > > > >> -----Original Message----- > >> From: Marc G. Fournier [mailto:scrappy@postgresql.org] > >> Sent: 12 January 2005 15:31 > >> To: Dave Page > >> Cc: Marc G. Fournier; Joshua D. Drake; pgsql-www@postgresql.org > >> Subject: RE: [pgsql-www] Nagios and SMS Paging ... > >> > >> > >> slaves list? don't have anything in my box about problems > with it ... > > > > Sounds like you're over-filtering again :-) > > > > http://archives.postgresql.org/pgsql-www/2005-01/msg00247.php > > Fixed ... Ta. > > I assume you missed this as well: > > > > http://archives.postgresql.org/pgsql-www/2005-01/msg00235.php > > transfer of 'pgadmin.org/IN' from 194.217.48.34#53: failed > while receiving responses: REFUSED Hmm: zone "pgadmin.org" IN { type master; file "master/pgadmin.org"; allow-update { none; }; allow-transfer { 200.46.204.2; }; notify yes; }; What needs adding/changing? Also, the original problem was the lookup of wwwmaster.postgresql.org - is that OK now? Regards, Dave.