Streaming replication on 9.1-beta2 after pg_restore is very slow - Mailing list pgsql-testers

From David Hartveld
Subject Streaming replication on 9.1-beta2 after pg_restore is very slow
Date
Msg-id 0317654684C3CF48B06D8FF5AE5D2EE0CBBE@Win-Exchange-02.MENDIXDOMAIN.local
Whole thread Raw
Responses Re: Streaming replication on 9.1-beta2 after pg_restore is very slow  (Josh Berkus)
List pgsql-testers

Hi all,

 

I am experimenting with (synchronous) streaming replication on postgresql 9.1 beta 2 and am having performance problems. I have initially set up an (asynchronous) streaming replication master cluster with postgresql 9.0, which streamed to a single slave cluster. This seemed to work quite well. Then I’ve mostly copied the configuration to a 9.1 beta 2 cluster (master and slave) to see how synchronous replication would behave.

 

The master cluster, when empty after an initdb (pg_createcluster on debian) seems to properly stream changes to one or more slave clusters when correctly set up. I watch the master and slave with pg_current_xlog_location() on master and pg_last_xlog_receive_location() and pg_last_xlog_replay_location(). It seems that slaves pick up changes, such as a simple database creation, or updating a role password, or adding a role. But when I then do a restoredb on the master,  the slaves quickly fall behind and only very slowly catch up (after maybe an hour or something...)

 

The log on the slave is filled with statements similar to the following:

LOG: streaming replication successfully connected to primary

LOG:  record with zero length at 0/9B7A010

FATAL:  terminating walreceiver process due to administrator command

The log file also contains the following line a number of times (with the numbers of course a bit different every time):

               LOG: invalid magic number 0000 in log file 0, segment 9, offset 10878976

 

The log on the master contains several lines with:

               LOG:  could not send data to client: Connection reset by peer

 

Is there possibly a known issue with the beta, or do I have to configure my cluster differently for 9.1? I’m a bit at a loss, and would appreciate any comments.

 

Thanks,

David Hartveld

 

pgsql-testers by date:

Previous
From: Dick Wieland
Date:
Subject: please ignore last post: v9.1beta2 pg_dumpall error
Next
From: Josh Berkus
Date:
Subject: Re: Streaming replication on 9.1-beta2 after pg_restore is very slow