Re: Infrastructure monitoring - Mailing list pgsql-www

From Guido Barosio
Subject Re: Infrastructure monitoring
Date
Msg-id f7f6b4c70601131851r3af24b00y8e92376f19f6f92e@mail.gmail.com
Whole thread Raw
In response to Re: Infrastructure monitoring  ("Marc G. Fournier" <scrappy@postgresql.org>)
List pgsql-www
Actually it seems to be as easy as requesting a GET to search.postgresql.org
If a script is able to handle the http codes, then alerts could be triggered upon events.
 
The search failure was due to a 503 error being dispatched from the server.
 
 
Am I wrong?
 
Tho, thinking about content,  there is an opensource doing a job such as siteconf (http://www.siteconfidence.com) but I can't remember the name atm.
 
But I understand that the search problem was not a *content* problem itself.
 
G.-

 
On 1/14/06, Marc G. Fournier <scrappy@postgresql.org> wrote:
On Fri, 13 Jan 2006, Josh Berkus wrote:

> Jim,
>
>> Search has been down for at least 2 days now, and this certainly isn't
>> the first time it's happened. There's also been cases of archives
>> getting stuck, and probably other outages besides those that went on
>> until someone email'd about it.
>>
>> Would it be difficult to setup something to monitor these various
>> services? I know there's at least one OSS tool to do it, though I have
>> no idea how hard it would be to tie that into the current
>> infrastructure.
>
> We have an open offer of Hyperic licenses, and they support FreeBSD now.

Not to discount the offer ... but, what exactly would that provide us?  We
already monitor the *servers*, its what is inside of the servers that
needs better monitoring ... knowing nothing about Hyperic, does that
provide something for that?

In the case of the archives, for instance, the problem was a perl process
that for some unknown reason got stuck randomly ... removed that in favor
of an awk script, and it hasn't done it since ... i also redirected cron's
email to scrappy@postgresql.org, so that any errors show up in my mailbox
instead of roots, so I get an hourly reminder that things are running well
...

In the case of search ... John would be better at answering that, but when
he and I talked this past week, he mentioned that he was moving it all
over to two new servers, which I changed the DNS for on Wednesday ...

As I've said above ... physical servers are being monitored, so if anyone
has some ideas on how we can improve "content monitoring", for lack of a
better word, I know I'm all ears ...

Again, if Hyperic can offer something for this, let me know ...

----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email: scrappy@hub.org           Yahoo!: yscrappy              ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster



--
/"\   ASCII Ribbon Campaign  .
\ / - NO HTML/RTF in e-mail  .
X  - NO Word docs in e-mail .
/ \ -----------------------------------------------------------------

pgsql-www by date:

Previous
From: "Marc G. Fournier"
Date:
Subject: Re: Infrastructure monitoring
Next
From: "Magnus Hagander"
Date:
Subject: Re: Infrastructure monitoring