Re: Problems with pg_dump - Mailing list pgsql-admin

From Tom Lane
Subject Re: Problems with pg_dump
Date
Msg-id 7946.1075133986@sss.pgh.pa.us
Whole thread Raw
In response to Re: Problems with pg_dump  (Stefan Holzheu <stefan.holzheu@bitoek.uni-bayreuth.de>)
Responses Re: Problems with pg_dump  (Stefan Holzheu <stefan.holzheu@bitoek.uni-bayreuth.de>)
List pgsql-admin
Stefan Holzheu <stefan.holzheu@bitoek.uni-bayreuth.de> writes:
> pg_dump: lost synchronization with server: got message type "5", length
> 842281016

> The error does not occur always and not always with the same table.

Oh, that's even more interesting.  Is the failure message itself
consistent --- that is, is it always complaining about "message type 5"
and the same bizarre length value?  The "length" looks like it's really
ASCII text ("2408" to be specific), so somehow libpq is misinterpreting
a piece of the COPY datastream as the start of a new message.

> However, the error occurs only on that kind of aggregation tables. There
> is a cron-job keeping the tables up to date, starting all 10 minutes.
> The job does delete and inserts on the table. Could this somehow block
> the dump process? Normally it should not?

It's hard to see how another backend would have anything to do with
this, unless perhaps the error is dependent on a particular data value
that is sometimes present in the table and sometimes not.  It looks to
me like either libpq or the backend is miscounting the number of data
bytes in the COPY datastream.  Would it be possible for you to use a
packet sniffer to capture the communication between pg_dump and the
backend?  If we could look at exactly what's going over the wire, it
would help to pin down the blame.

            regards, tom lane

pgsql-admin by date:

Previous
From: "Anson Liu"
Date:
Subject: A question?
Next
From: Raquel Vieira
Date:
Subject: Re: A question?