Home > mailing lists

Re: Fast insertion indexes: why no developments - Mailing list pgsql-hackers

From	Jeff Janes
Subject	Re: Fast insertion indexes: why no developments
Date	November 5, 2013 16:57:13
Msg-id	CAMkU=1zUdJUtdxUNXR3DDLJ2szoppLWkTVTxy-QwoKkxw+yGhQ@mail.gmail.com Whole thread
In response to	Re: Fast insertion indexes: why no developments (Leonardo Francalanci <m_lists@yahoo.it>)
Responses	Re: Fast insertion indexes: why no developments
List	pgsql-hackers

Tree view

On Tue, Nov 5, 2013 at 12:25 AM, Leonardo Francalanci <m_lists@yahoo.it> wrote:

Andres Freund-3 wrote
> On 2013-11-04 11:27:33 -0500, Robert Haas wrote:
>> On Mon, Nov 4, 2013 at 11:24 AM, Claudio Freire <

> klaussfreire@

> > wrote:
>> > Such a thing would help COPY, so maybe it's worth a look
>>
>> I have little doubt that a deferred insertion buffer of some kind
>> could help performance on some workloads, though I suspect the buffer
>> would have to be pretty big to make it worthwhile on a big COPY that
>> generates mostly-random insertions.
>
> Even for random data presorting the to-be-inserted data appropriately
> could result in much better io patterns.

Mmh, I'm afraid that the buffer should be huge to get some real advantage.
You have to buffer enough values to avoid "touching" entire pages, which is
not that easy.

Some experiments I did a few years ago showed that applying sorts to the data to be inserted could be helpful even when the sort batch size was as small as one tuple per 5 pages of existing index. Maybe even less.

Cheers,

Jeff

pgsql-hackers by date:

From: Tom Lane
Date: 05 November 2013, 16:52:53
Subject: Disallow pullup of a subquery with a subquery in its targetlist?

From: Joe Love
Date: 05 November 2013, 17:16:25
Subject: Re: Handle LIMIT/OFFSET before select clause (was: Feature request: optimizer improvement)

Re: Fast insertion indexes: why no developments - Mailing list pgsql-hackers

Previous

Next