Home > mailing lists

Re: eqjoinsel_semi still sucks ... - Mailing list pgsql-hackers

From	Andrey Lepikhov
Subject	Re: eqjoinsel_semi still sucks ...
Date	June 23, 2023 14:28:55
Msg-id	296c33dd-0523-b91a-36ce-6f5f470aae2c@postgrespro.ru Whole thread Raw
In response to	eqjoinsel_semi still sucks ... (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: eqjoinsel_semi still sucks ...
List	pgsql-hackers

Tree view

On 2/5/2012 20:34, Tom Lane wrote:
> On reflection I think that the idea of clamping ndistinct beforehand is
> just wrong, and what we ought to do instead is apply a multiplier to the
> selectivity estimate afterwards.  In the case of a base rel we could
> just multiply by the selectivity of its baserestrictinfo list.  For join
> rels it's a bit harder to guess how much a given input relation might
> have been decimated, but if the join's estimated size is smaller than
> the output size of the base rel the correlation var came from, we could
> multiply by that ratio (on top of whatever correction came from the base
> rel's restriction clauses).
I got stuck in some cases where (due to a tree of filters) the planner 
underestimates the JOIN just because the ndistinct conveys a huge number 
to the selectivity estimation formula. However, the estimation of both 
input relations is made correctly and is limited.
I've tried to understand the logic through commits 0d3b231eebf, 
97930cf578e and 7f3eba30c9d. But it is still not clear.
So, why the idea of clamping ndistinct is terrible in general? Could you 
explain your reasons a bit more?

-- 
regards,
Andrey Lepikhov
Postgres Professional

pgsql-hackers by date:

From: Andrew Dunstan
Date: 23 June 2023, 14:28:52
Subject: Re: Migration database from mysql to postgress

From: Andrew Dunstan
Date: 23 June 2023, 14:47:50
Subject: Re: Do we want a hashset type?

Re: eqjoinsel_semi still sucks ... - Mailing list pgsql-hackers

Previous

Next