Re: COPY vs INSERT - Mailing list pgsql-performance

From Mischa Sandberg
Subject Re: COPY vs INSERT
Date
Msg-id 1115248292.427956a468a55@webmail.telus.net
Whole thread Raw
In response to Re: COPY vs INSERT  ("David Roussel" <pgsql-performance@diroussel.xsmail.com>)
Responses Re: COPY vs INSERT  (Christopher Petrilli <petrilli@gmail.com>)
Re: COPY vs INSERT  (Kris Jurka <books@ejurka.com>)
List pgsql-performance
Quoting David Roussel <pgsql-performance@diroussel.xsmail.com>:

> > COPY invokes all the same logic as INSERT on the server side
> > (rowexclusive locking, transaction log, updating indexes, rules).
> > The difference is that all the rows are inserted as a single
> > transaction. This reduces the number of fsync's on the xlog,
> > which may be a limiting factor for you. You'll want to crank
> > WAL_BUFFERS and CHECKPOINT_SEGMENTS to match, though.
> > One of my streams has 6K records; I run with WB=1000, CS=128.
>
> So what's the difference between a COPY and a batch of INSERT
> statements.  Also, surely, fsyncs only occur at the end of a
> transaction, no need to fsync before a commit has been issued,
> right?

Sorry, I was comparing granularities the other way araound. As far as
xlog is concerned, a COPY is ALWAYS one big txn, no matter how many
putline commands you use to feed the copy. With inserts, you can choose
whether to commit every row, every nth row, etc.

Copy makes better use of the TCP connection for transmission. COPY uses
the TCP connection like a one-way pipe. INSERT is like an RPC: the
sender has to wait until the insert's return status roundtrips.
--
Engineers think equations approximate reality.
Physicists think reality approximates the equations.
Mathematicians never make the connection.


pgsql-performance by date:

Previous
From: "David Roussel"
Date:
Subject: Re: COPY vs INSERT
Next
From: John A Meinel
Date:
Subject: Re: COPY vs INSERT