Home > mailing lists

Re: How to implement the skip errors for copy from ? - Mailing list pgsql-hackers

From	xbzhang
Subject	Re: How to implement the skip errors for copy from ?
Date	June 18, 2014 06:21:53
Msg-id	2014061811212012467393@kingbase.com.cn Whole thread Raw
In response to	How to implement the skip errors for copy from ? (xbzhang <xbzhang@kingbase.com.cn>)
List	pgsql-hackers

Tree view

Use subtraction is very inefficient, a project called pg_bulkload support the

skip errors ,and it does not useing subtraction. It performance is very good.

So I want to imitate pg_bulkload to implementation skip errors of copy.

if i do the following thing to copy :

1. disable all of trigger of table;

2. only skip the following errors:

* format error of tuple;

* check constraint violation;

* unique or primary key constraint violation;

other errors will be abort current transcation except above three errors.

3. bad tuple will be deleted and the resource owner per tuples is reseased

when a unique key violation, xmax of the tuple will be marked to current

transaction id as it not be seen as good, and all of index entries of the bad

tuple will be real delete at next VACUUM.

Is it right to skip errors for copy from?

From: Tom Lane
Date: 2014-06-17 21:40
To: xbzhang
CC: Amit Kapila; Pavel Stehule; Alvaro Herrera; pgsql-hackers
Subject: Re: [HACKERS] How to implement the skip errors for copy from ?
xbzhang <xbzhang@kingbase.com.cn> writes:
> LWlocks can record in resource owner per tuples, so they can be released��at rigth way, but the memory allocated on memory context��is one problem.Are there any others problems?

See AbortSubTransaction(), CleanupSubTransaction(), and the rather large
number of subroutines they call. Almost everything that code does is
connected to cleaning up something that might have been left unfinished
after an elog(ERROR) took control away in the middle of some code
sequence.

In addition, you can't just wave your hands and presto the bad tuple is
not there anymore. For example, the failure might have been a unique key
violation in some index or other. Not only is the bad tuple already on
disk, but possibly so are index entries for it in other indexes. In
general the only way to get rid of those index entries is a VACUUM.
So you really have to have a subtransaction whose XID is what you mark
the new tuple with, and then rolling back the subtransaction is what
causes the new tuple to not be seen as good. (Actually getting rid of
it will be left for the next VACUUM.)

regards, tom lane

-----
???????????
????:AVG - www.avg.com
??:2013.0.3480 / ?????:3955/7691 - ????:06/17/14

pgsql-hackers by date:

From: Tom Lane
Date: 18 June 2014, 05:51:00
Subject: Re: BUG #10680 - ldapbindpasswd leaks to postgresql log

From: Amit Kapila
Date: 18 June 2014, 06:29:47
Subject: Re: Built-in support for a memory consumption ulimit?

Re: How to implement the skip errors for copy from ? - Mailing list pgsql-hackers

Previous

Next