Home > mailing lists

Re: DIfferent plans for explicit versus implicit join using link table - Mailing list pgsql-general

From	John D. Burger
Subject	Re: DIfferent plans for explicit versus implicit join using link table
Date	April 26, 2007 23:25:44
Msg-id	CAE9E752-3F00-47A4-BC4A-AA1D6156011E@mitre.org Whole thread Raw
In response to	Re: DIfferent plans for explicit versus implicit join using link table (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: DIfferent plans for explicit versus implicit join using link table
List	pgsql-general

Tree view

Tom Lane replied:

>> I have two queries for looking up related words which I think should
>> be equivalent, but 7.4.8 comes up with very different plans.
>
> They're not at all equivalent:

> If there are duplicate word1id,word2id entries in allwordrelations,
> the
> first query will produce duplicate outputs; the second will not.

Ah, that should have been my second guess - whenever I fail to get
stuff like this, it's usually to do with either duplicates or NULLs.

> If there were a unique constraint on (word1id, word2id), in theory
> the planner could prove that the IN form could be simplified to a
> plain
> join, but there is no such logic in HEAD let alone 7.4, and in any
> case
> you've not got such a constraint.

But such would reflect the reality of my data, so it should be there.

> The plan that gets chosen is to forcibly unique-ify the (word1id,
> word2id) data (via a "sort | uniq"-like pipeline) and then do a normal
> join with that.  Which is expensive because allwordrelations is big.
> But the alternative is probably even worse: without that
> allwordrelations has to be joined to w1 and w2 simultaneously, meaning
> that the unconstrained cartesian product of w1 and w2 has to be formed
> first.

Hmm, but wouldn't it at least filter one side per my where clause:
w1.word = 'dogging'?  Anyway, thanks, the incremental enlightenment
continues.

- John Burger
   MITRE

pgsql-general by date:

From: "CAJ CAJ"
Date: 26 April 2007, 22:34:42
Subject: Re: PostgreSQL upgrade server A -> server B

From: Tom Lane
Date: 26 April 2007, 23:54:26
Subject: Re: DIfferent plans for explicit versus implicit join using link table

Re: DIfferent plans for explicit versus implicit join using link table - Mailing list pgsql-general

Previous

Next