Thread: Database replication... - Mission Critical DBMS's -- Taking postgreSQL to the Enterprise Computing level...

Database replication... - Mission Critical DBMS's -- Taking postgreSQL to the Enterprise Computing level...

From

"Greg Patnude"

Date:

30 October 2002, 08:11:40

I have a client that I am developing a web-based business application for
using Perl / PHP, DHTML, and postgreSQL on a FreeBSD server with Apache...

The client realized (or hopes) that this application may become mission
critical (to his clients / end-users) and was asking about the possibility
of having a replicated database using postgreSQL in case the database server
takes a dive (paranoia). He is somewhat familiar with MS Access (I gathered
he was thinking of the MS Access replication / front-end / back-end /
synchronized database concept).

I have read through all 7 chapters of the PG documentation and didn't see
anything about replicating a postgreSQL database on a secondary database
server and having the primary server push all data manipulations on through
to the secondary server. The secondary server would be available (through
the connect script) if the primary server died.

Does anyone know of a reasonable means of synchronizing two (or more)
postgreSQL database servers in order to maintain two COMPLETE concurrent
copies of the same database ?

Since I can already see REFINT problems in maintaining the integrity of
serial values, trigger problems, and stored procedure problems (UDF's),
perhaps I should just suggest a robust database backup and restore scheme
with a standby postgreSQL server ready to accept a pgdumpall.

Although the documentation claims that CREATE DATABASE is not intended as a
"copy database" utility, maybe a CRON event (say every 4 hours) on the
secondary server to:

DROP database 'backupdatabase';
CREATE DATABASE 'backupdatabase' WITH TEMPLATE = 'livedatabase';

would do the trick....

I am completely open to suggestions here...

TIA.

GP

Re: [GENERAL] Database replication... - Mission Critical DBMS's --

From

Robert Treat

Date:

30 October 2002, 10:18:47

You might want to check out
http://gborg.postgresql.org/genpage?replication_research which has
information and links to several replication solutions for postgresql.

the techdocs.postgresql.org website also has a good number of papers
regarding replication, as well as a "high availability how-to" that
would probably be worth reading.

Robert Treat

On Wed, 2002-10-30 at 09:24, Bill Gribble wrote:
> On Tue, 2002-10-29 at 17:56, Greg Patnude wrote:
> > Does anyone know of a reasonable means of synchronizing two (or more)
> > postgreSQL database servers in order to maintain two COMPLETE concurrent
> > copies of the same database ?
>
> I'm not sure how complete it is (just starting to look at this myself)
> but 'dbbalancer' at least makes some claims to this.  It acts as
> (basically) a postgresql proxy server, and can send queries to multiple
> servers.  My presumption would be that if you initialize 2 databases to
> a known identical start, have all the same triggers and rules on both,
> then send all queries to both databases, you will have 2 identical
> databases at the end.
>
> Don't know how well that will work in practice tho.  I should know more
> in the next couple of weeks.
>
> b.g.
>
>
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 1: subscribe and unsubscribe commands go to majordomo@postgresql.org

Re: Database replication... - Mission Critical DBMS's -- Taking postgreSQL to the Enterprise Computing level...

From

Andrew Sullivan

Date:

30 October 2002, 10:30:50

On Tue, Oct 29, 2002 at 03:56:46PM -0800, Greg Patnude wrote:

> I have read through all 7 chapters of the PG documentation and didn't see
> anything about replicating a postgreSQL database on a secondary database
> server and having the primary server push all data manipulations on through
> to the secondary server. The secondary server would be available (through
> the connect script) if the primary server died.
>
> Does anyone know of a reasonable means of synchronizing two (or more)
> postgreSQL database servers in order to maintain two COMPLETE concurrent
> copies of the same database ?

I think what you want is that any change, including schema changes,
&c., on the master database are echoed to the slave.  I know rserv
can't do that, and I don't _think_ dbmirror can, either.  But both of
those are possibilities.  AFAIK, the contrib/rserv code is completely
broken in the 7.3 series, so dbmirror might be the free answer to
pick; rserv has a commercial cousin which we use.

The problem with these is that they send _data_ to the slave, and use
standard SQL statements.  So sequences &c. are not carried over.  You
need a script to do that.  It's not an instantaneous failover.

On the other hand, if your client is comparing with Access, the
reliability will be so much better that perhaps some extra work in
the rare case of failure will be acceptable.  It is for us, and we
have extremely stringent SLAs which constrain how long we could be
down if (heaven forfend!) our master database ever died.

A

--
----
Andrew Sullivan                         204-4141 Yonge Street
Liberty RMS                           Toronto, Ontario Canada
<andrew@libertyrms.info>                              M2P 2A8
                                         +1 416 646 3304 x110

Re: [GENERAL] Database replication... - Mission Critical DBMS's --

From

Andrew Sullivan

Date:

30 October 2002, 10:33:09

On Wed, Oct 30, 2002 at 10:18:47AM -0500, Robert Treat wrote:
> regarding replication, as well as a "high availability how-to" that
> would probably be worth reading.

The high availability howto suggests using rsync to synchronise the
data areas of two data servers.  That is an _extremely bad_ idea.
I've suggested before that the link be removed, because it recommends
something almost guaranteed to introduce massive database corruption
at some point.  If there's no load and you have a fast network, you
might get lucky.  But it is an extremely dangerous plan.

A

--
----
Andrew Sullivan                         204-4141 Yonge Street
Liberty RMS                           Toronto, Ontario Canada
<andrew@libertyrms.info>                              M2P 2A8
                                         +1 416 646 3304 x110

Re: [GENERAL] Database replication... - Mission Critical

From

Justin Clift

Date:

01 November 2002, 20:42:48

Andrew Sullivan wrote:
>
> On Wed, Oct 30, 2002 at 10:18:47AM -0500, Robert Treat wrote:
> > regarding replication, as well as a "high availability how-to" that
> > would probably be worth reading.
>
> The high availability howto suggests using rsync to synchronise the
> data areas of two data servers.  That is an _extremely bad_ idea.
> I've suggested before that the link be removed, because it recommends
> something almost guaranteed to introduce massive database corruption
> at some point.  If there's no load and you have a fast network, you
> might get lucky.  But it is an extremely dangerous plan.

Ok, have just removed the link.  Sorry for not getting around to it
before Andrew.

(Bruce pointed out your email, otherwise I would have missed it again
too).

:-/

Regards and best wishes,

Justin Clift


> A
>
> --
> ----
> Andrew Sullivan                         204-4141 Yonge Street
> Liberty RMS                           Toronto, Ontario Canada
> <andrew@libertyrms.info>                              M2P 2A8
>                                          +1 416 646 3304 x110
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: you can get off all lists at once with the unregister command
>     (send "unregister YourEmailAddressHere" to majordomo@postgresql.org)

--
"My grandfather once told me that there are two kinds of people: those
who work and those who take the credit. He told me to try to be in the
first group; there was less competition there."
   - Indira Gandhi

Re: [GENERAL] Database replication... - Mission Critical DBMS's --

From

Bill Gribble

Date:

04 November 2002, 15:12:48

On Tue, 2002-10-29 at 17:56, Greg Patnude wrote:
> Does anyone know of a reasonable means of synchronizing two (or more)
> postgreSQL database servers in order to maintain two COMPLETE concurrent
> copies of the same database ?

I'm not sure how complete it is (just starting to look at this myself)
but 'dbbalancer' at least makes some claims to this.  It acts as
(basically) a postgresql proxy server, and can send queries to multiple
servers.  My presumption would be that if you initialize 2 databases to
a known identical start, have all the same triggers and rules on both,
then send all queries to both databases, you will have 2 identical
databases at the end.

Don't know how well that will work in practice tho.  I should know more
in the next couple of weeks.

b.g.