Home > mailing lists

Re: production server down - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: production server down
Date	December 15, 2004 09:10:59
Msg-id	27366.1103091021@sss.pgh.pa.us Whole thread Raw
In response to	Re: production server down (Joe Conway <mail@joeconway.com>)
Responses	Re: production server down
List	pgsql-hackers

Tree view

Joe Conway <mail@joeconway.com> writes:
> Any theories on how we screwed up?

I hesitate to suggest this, but maybe a cron job blindly copying data
from point A to point B?

I'm not sure that that could entirely explain the facts.  My
recollection of the xlog.c logic is that the pg_control file is read
into shared memory during postmaster boot, and after that it's
write-only: at checkpoint times we update the file image in shared
memory and then write it out to pg_control.

Offhand my bets would revolve around (a) multiple postmasters trying to
run the same PGDATA directory (we have interlocks to protect against
this, but I have no faith that they work against an NFS-mounted data
directory), or (b) you somehow wiped a PGDATA directory and restored it
from backup tapes underneath a running postmaster.
        regards, tom lane

pgsql-hackers by date:

From: Joe Conway
Date: 15 December 2004, 08:50:13
Subject: Re: production server down

From: Joe Conway
Date: 15 December 2004, 09:25:28
Subject: Re: production server down

Re: production server down - Mailing list pgsql-hackers

Previous

Next