Home > mailing lists

Re: Building multiple indexes on one table. - Mailing list pgsql-performance

From	Claudio Freire
Subject	Re: Building multiple indexes on one table.
Date	July 23, 2014 19:50:03
Msg-id	CAGTBQpZeWUZfuq6tZnKefsQdMum9JxXNCEgFdwTtRO=81m6avg@mail.gmail.com Whole thread
In response to	Re: Building multiple indexes on one table. (Marc Mamin <M.Mamin@intershop.de>)
Responses	Re: Building multiple indexes on one table.
List	pgsql-performance

Tree view

On Wed, Jul 23, 2014 at 4:40 PM, Marc Mamin <M.Mamin@intershop.de> wrote:
>>On Thu, Jul 17, 2014 at 7:47 PM, Chris Ruprecht <chris@cdrbill.com> wrote:
>>> Is there any way that I can build multiple indexes on one table without having to scan the table multiple times?
Forsmall tables, that's probably not an issue, but if I have a 500 GB table that I need to create 6 indexes on, I don't
wantto read that table 6 times. 
>>> Nothing I could find in the manual other than reindex, but that's not helping, since it only rebuilds indexes that
arealready there and I don't know if that reads the table once or multiple times. If I could create indexes inactive
andthen run reindex, which then reads the table once, I would have a solution. But that doesn't seem to exist either. 
>>
>>Just build them with separate but concurrent connections, and the
>>scans will be synchronized so it will be only one.
>>
>>Btw, reindex rebuilds one index at a time, so what I do is issue
>>separate reindex for each index in parallel, to avoid the repeated
>>scans as well.
>>
>>Just make sure you've got the I/O and CPU capacity for it (you'll be
>>writing many indexes at once, so there is a lot of I/O).
>
> Index creation on large tables are mostly CPU bound as long as no swap occurs.
> I/O may be an issue when all your indexes are similar; e.g. all on single int4 columns.
> in other cases the writes will not all take place concurrently.
> To reduce I/O due to swap, you can consider increasing maintenance_work_mem on the connextions/sessionns
> that build the indexes.

Usually there will always be swap, unless you've got toy indexes.

But swap I/O is all sequential I/O, with a good readahead setting
there should be no problem.

It's the final writing step that can be a bottleneck if you have a
lame I/O system and try to push 5 or 6 indexes at once.

pgsql-performance by date:

From: Marc Mamin
Date: 23 July 2014, 19:40:19
Subject: Re: Building multiple indexes on one table.

From: Felipe Santos
Date: 23 July 2014, 20:21:09
Subject: Re: Building multiple indexes on one table.

Re: Building multiple indexes on one table. - Mailing list pgsql-performance

Previous

Next