Home > mailing lists

Re: Corrupt indexes on slave when using pg_bulkload on master - Mailing list pgsql-general

From	James Cowell
Subject	Re: Corrupt indexes on slave when using pg_bulkload on master
Date	December 17, 2012 23:42:27
Msg-id	1355776940.54156.YahooMailNeo@web186001.mail.ir2.yahoo.com Whole thread Raw
In response to	Re: Corrupt indexes on slave when using pg_bulkload on master (James Cowell <jcowell@btinternet.com>)
Responses	Re: Corrupt indexes on slave when using pg_bulkload on master
List	pgsql-general

Tree view

I saw that 9.1.7 was out and it had another bugfix in for WAL playback.

I installed it and ran my reproduction case and it no longer corrupts the index.

I reindexed the database I load into and did a full data load and the indexes still corrupt on the slave.

It does not appear to be related to constraint violation as one table is new rows only.

I will try and put together a new reproduction case this week.

Cheers,

James

From: James Cowell <jcowell@btinternet.com>
To: Jeff Janes <jeff.janes@gmail.com> 
Cc: "pgsql-general@postgresql.org" <pgsql-general@postgresql.org> 
Sent: Thursday, 13 December 2012, 12:26
Subject: Re: [GENERAL] Corrupt indexes on slave when using pg_bulkload on master

Hi Jeff,

Thanks again for your reply.

>If there are no constraint violations, do you still see the problem?

Yes, I've stripped it down to an empty table with a 10 row load and the pk index on the secondary node still corrupts.

> Were there any older version on which it worked? 

I'm afraid I started on 9.1.5, I upgraded to 9.1.6 when I had the initial problem due to the bugfix in the changelog to do with corrupt indexes on the secondary node but it hasn't resolved the issue.

> Can you post a minimal schema and control file to reproduce the problem?

I've attached a text file with details for table, load config file etc, is that everything you would need?

Cheers,

James

From: Jeff Janes <jeff.janes@gmail.com>
To: James Cowell <jcowell@btinternet.com> 
Cc: "pgsql-general@postgresql.org" <pgsql-general@postgresql.org> 
Sent: Monday, 10 December 2012, 16:53
Subject: Re: [GENERAL] Corrupt indexes on slave when using pg_bulkload on master

On Wed, Dec 5, 2012 at 5:17 AM, James Cowell <jcowell@btinternet.com> wrote:
> I'm using pg_bulkload to load large amounts of CSV data into a postgres
> database hourly.
>
> This database is replicated to a second node.
>
> Whenever a bulk load happens the indexes on the updated tables on the
> secondary node corrupt and are unusable until a reindex is run on the
> primary node.  I get the error below on node 2:
>
> ERROR: index "tablename" contains unexpected zero page at block 0
> SQL state: XX002
> Hint: Please REINDEX it.
>
> I'm assuming that this is because of the way pg_bulkload builds the index on
> the primary, and possibly has something to do with the way pg_bulkload
> overwrites rows in the event of a constraint violation,

If there are no constraint violations, do you still see the problem?

> but at the same time
> if something works on the primary shouldn't the replicated node be able to
> process the WAL log?
>
> I've tried this on 9.1.6 and 9.1.5 on RHEL 6.3 with pg_bulkload build
> 3.1.1-1.pg91.rhel6 and it happens every time.

Were there any older version on which it worked?  Can you post a
minimal schema and control file to reproduce the problem?

Cheers,

Jeff

pgsql-general by date:

From: Tom Lane
Date: 17 December 2012, 22:30:47
Subject: Re: PG 8.4 to 9.2 upgrade issues with ownership of large objects

From: Glen Eustace
Date: 18 December 2012, 00:08:34
Subject: mod_auth_pgsql and Apache 2.4

Re: Corrupt indexes on slave when using pg_bulkload on master - Mailing list pgsql-general

Previous

Next