Home > mailing lists

Re: Optimizing "top queries" ... - Mailing list pgsql-hackers

From	Martijn van Oosterhout
Subject	Re: Optimizing "top queries" ...
Date	December 6, 2006 14:06:56
Msg-id	20061206180620.GB11211@svana.org Whole thread Raw
In response to	Re: Optimizing "top queries" ... (Gregory Stark <stark@enterprisedb.com>)
List	pgsql-hackers

Tree view

On Wed, Dec 06, 2006 at 03:34:02PM +0000, Gregory Stark wrote:
> One thing to keep in mind though is that I also wanted to cover the case of
> Unique(Sort(...)) and Limit(Unique(Sort(...))) which can throw away duplicates
> earlier. Do we want three different plan nodes? Are there other cases like
> these that can benefit?

I would think that we really don't want more than one type of sort
node. What I think would be good is the setting of Unique and Limit as
options to the Sort node. Instead of peeking across nodes at execution
time, have the optimizer merge them.

For Unique, you arrange to have the sort algorithm, if two nodes are
equal, to throw one of the tuples.

For Limit, you can do it by having the tapes stop writing after the
given number of rows.

However, I think there's a difference between Unique across all
columns, or Unique as in DISTINCT ON(). The latter would be easier to
integrate.

Have a nice day,
--
Martijn van Oosterhout   <kleptog@svana.org>   http://svana.org/kleptog/
> From each according to his ability. To each according to his ability to litigate.

pgsql-hackers by date:

From: Neil Conway
Date: 06 December 2006, 13:47:04
Subject: Re: [PATCHES] GUC description cleanup

From: Teodor Sigaev
Date: 06 December 2006, 14:42:02
Subject: Re: [PATCHES] Bundle of patches

Re: Optimizing "top queries" ... - Mailing list pgsql-hackers

Previous

Next