Home > mailing lists

Re: Multicolumn index - WHERE ... ORDER BY - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: Multicolumn index - WHERE ... ORDER BY
Date	December 24, 2009 11:56:57
Msg-id	530.1261670200@sss.pgh.pa.us Whole thread Raw
In response to	Multicolumn index - WHERE ... ORDER BY (Lucas Maystre <lum@open.ch>)
List	pgsql-performance

Tree view

Lucas Maystre <lum@open.ch> writes:
> Example of a query I might have:
> SELECT id FROM mail WHERE from_address LIKE 'bill%'
> ORDER BY time DESC LIMIT 50;

> The solution I had in mind was to create a multicolumn index over
> 'from_address' and 'time':
> CREATE INDEX idx_from_time ON mail (from_address, time DESC);
> so that it could directly use the 'time' ordering and lookup only the
> first 50 rows using the index.

> but... it doesn't work :-) i.e. my multicolumn index is never used. So:
> - do you guys have any ideas why it doesn't work?

The from_address condition isn't simple equality, so the output of a
scan wouldn't be sorted by time --- it would have subranges that are
sorted, but that's no help overall.  You still have to read the whole
scan output and re-sort.  So this index has no advantage over the
smaller index on just from_address.

> - do you see an alternative solution?

There might be some use in an index on (time, from_address).  That
gives the correct time ordering, and at least the prefix part of the
from_address condition can be checked in the index without visiting the
heap.

            regards, tom lane

pgsql-performance by date:

From: Greg Smith
Date: 24 December 2009, 11:52:02
Subject: Re: SATA drives performance

From: gael@pilotsystems.net (Gaël Le Mignot)
Date: 24 December 2009, 12:19:44
Subject: Re: SATA drives performance

Re: Multicolumn index - WHERE ... ORDER BY - Mailing list pgsql-performance

Previous

Next