Re: Queries joining views - Mailing list pgsql-general

From Tom Lane
Subject Re: Queries joining views
Date
Msg-id 2072.1156251397@sss.pgh.pa.us
Whole thread Raw
In response to Re: Queries joining views  (Alban Hertroys <alban@magproductions.nl>)
Responses Re: Queries joining views  (Alban Hertroys <alban@magproductions.nl>)
List pgsql-general
Alban Hertroys <alban@magproductions.nl> writes:
> zorgweb_solaris=> select * from pg_stats where attname = 'number' and
> tablename IN ('mm_insrel_table', 'mm_product_table', 'mm_object');

> tablename         | mm_product_table
> histogram_bounds  | {2930,3244,3558,3872,4186,4500,4814,5128,5442,5756,6070}

> tablename         | mm_insrel_table
> {615920,689286,750855,812003,872741,933041,1004672,1068250,1134894,1198559,1261685}

> tablename         | mm_object
> histogram_bounds  |
> {287,124412,256534,375896,505810,643940,770327,899229,1028933,1153260,1262338}

OK, so here's our problem: according to those stats, the ranges of
"number" in mm_product_table and mm_insrel_table don't overlap at all.
So the cost model for mergejoin predicts that a mergejoin on "number"
will have to read all of mm_product_table but only the first record from
mm_insrel_table, and given the difference in size of the two tables,
that looks like a pretty good deal.

Given that the plan is not actually very fast, I suppose that the
histogram is not telling the whole truth --- probably there are a few
outlying records in one table or the other causing there to be a more
significant overlap than the planner expects.  If so, you can probably
fix it by increasing the statistics target for that table.

            regards, tom lane

pgsql-general by date:

Previous
From: Karsten Hilbert
Date:
Subject: GNUmed release (uses PostgreSQL)
Next
From: Alban Hertroys
Date:
Subject: Re: Queries joining views