Home > mailing lists

Re: VLDB Features - Mailing list pgsql-hackers

From	Neil Conway
Subject	Re: VLDB Features
Date	December 14, 2007 20:18:24
Msg-id	1197677930.1536.18.camel@dell.linuxdev.us.dell.com Whole thread Raw
In response to	Re: VLDB Features (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: VLDB Features Re: VLDB Features
List	pgsql-hackers

Tree view

On Fri, 2007-12-14 at 18:22 -0500, Tom Lane wrote:
> If we could somehow only do a subtransaction per failure, things would
> be much better, but I don't see how.

One approach would be to essentially implement the pg_bulkloader
approach inside the backend. That is, begin by doing a subtransaction
for every k rows (with k = 1000, say). If you get any errors, then
either repeat the process with k/2 until you locate the individual
row(s) causing the trouble, or perhaps just immediately switch to k = 1.
Fairly ugly though, and would be quite slow for data sets with a high
proportion of erroneous data.

Another approach would be to distinguish between errors that require a
subtransaction to recover to a consistent state, and less serious errors
that don't have this requirement (e.g. invalid input to a data type
input function). If all the errors that we want to tolerate during a
bulk load fall into the latter category, we can do without
subtransactions.

-Neil

pgsql-hackers by date:

From: "Jonah H. Harris"
Date: 14 December 2007, 19:51:12
Subject: Re: Negative LIMIT and OFFSET?

From: Tom Lane
Date: 14 December 2007, 21:04:13
Subject: Re: VLDB Features

Re: VLDB Features - Mailing list pgsql-hackers

Previous

Next