Home > mailing lists

Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL
Date	March 21, 2011 23:02:11
Msg-id	10261.1300748520@sss.pgh.pa.us Whole thread Raw
In response to	Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL
List	pgsql-hackers

Tree view

I wrote:
> Hmm.  We could possibly fix this by having planagg.c do a completely
> separate planner run for each aggregate, wherein it actually does build
> the "equivalent" query
>     SELECT col FROM tab WHERE existing-quals AND col IS NOT NULL
>     ORDER BY col ASC/DESC LIMIT 1
> and plan that.  That'd be less efficient than the current way,
> especially for cases where there are multiple aggregates, because there
> would be some duplication of processing between the per-aggregate
> planner runs and the main one.  But since we can only do this
> optimization for rather simple queries anyway, maybe it wouldn't matter
> much.

I studied the code some more, and I think this probably can be made to
work.  The basic idea is to have preprocess_minmax_aggregates build
simplified queries like the above (working by modifying the query tree
that exists at the point where it's called) and call query_planner on
them.  Save aside the resulting path data, then let the regular planning
process continue.  When optimize_minmax_aggregates is called, see
whether the regular plan is cheaper than the sum of the path costs.
If not, use the paths to construct a replacement plan, same as now.

The reason this should work is that query_planner() embodies pretty
much all the useful processing that happens between
preprocess_minmax_aggregates and optimize_minmax_aggregates --- the
other code in that stretch is mostly about grouping, which would disable
the minmax optimization anyway.  So no important steps will get left
out.  Of course, this introduces still more coupling between planagg.c
and planner.c, but I think that's probably tolerable.  The main
objection to this approach is having to do all the index analysis
N+1 times for an N-aggregate query.  I don't see any practical
alternative though if we want to make use of indexes that wouldn't be
used without the IS NOT NULL clause.
        regards, tom lane

pgsql-hackers by date:

From: Yeb Havinga
Date: 21 March 2011, 22:58:28
Subject: Re: Sync Rep and shutdown Re: Sync Rep v19

From: Tom Lane
Date: 21 March 2011, 23:06:36
Subject: Re: Chinese initdb on Windows

Re: Planner regression in 9.1: min(x) cannot use partial index with NOT NULL - Mailing list pgsql-hackers

Previous

Next