Home > mailing lists

Re: Optimisation of INTERSECT expressions - Mailing list pgsql-performance

From	Stephan Szabo
Subject	Re: Optimisation of INTERSECT expressions
Date	March 23, 2004 13:50:54
Msg-id	20040323063650.X85869@megazone.bigpanda.com Whole thread Raw
In response to	Optimisation of INTERSECT expressions ("Phil Endecott" <spam_from_postgresql_lists@chezphil.org>)
Responses	Re: Optimisation of INTERSECT expressions
List	pgsql-performance

Tree view

On Tue, 23 Mar 2004, Phil Endecott wrote:

> Dear PostgresQL Experts,
>
> I am trying to get to the bottom of some efficiency problems and hope that
> you can help.  The difficulty seems to be with INTERSECT expressions.
>
> I have a query of the form
>      select A from T where C1 intersect select A from T where C2;
> It runs in about 100 ms.
>
> But it is equivalent to this query
>      select A from T where C1 and C2;
> which runs in less than 10 ms.
>
> Looking at the output of "explain analyse" on the first query, it seems
> that PostgresQL always computes the two sub-expressions and then computes
> an explicit intersection on the results.  I had hoped that it would notice
> that both subexpressions are scanning the same input table T and convert
> the expression to the second form.
>
> Is there a reason why it can't do this transformation?

Probably because noone's bothered to try to prove under what conditions
it's the same.

For example, given a non-unique A, the two queries can give different
answers (if say the same two A values match both C1 and C2 in different
rows how many output rows does each give? *), also given a non-stable A
(for example random) the two queries are not necessarily equivalent.

pgsql-performance by date:

From: "Phil Endecott"
Date: 23 March 2004, 13:12:04
Subject: Optimisation of INTERSECT expressions

From: Stephan Szabo
Date: 23 March 2004, 14:14:46
Subject: Re: Optimisation of INTERSECT expressions

Re: Optimisation of INTERSECT expressions - Mailing list pgsql-performance

Previous

Next