Home > mailing lists

Re: [HACKERS] Partition-wise aggregation/grouping - Mailing list pgsql-hackers

From	Jeevan Chalke
Subject	Re: [HACKERS] Partition-wise aggregation/grouping
Date	November 1, 2017 15:50:11
Msg-id	CAM2+6=X8VFpCvq=d2vYMxmTEmeC-Fb-_NwbZgvSALuBrDzU3hA@mail.gmail.com Whole thread
In response to	Re: [HACKERS] Partition-wise aggregation/grouping (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: [HACKERS] Partition-wise aggregation/grouping
List	pgsql-hackers

Tree view

On Sat, Oct 28, 2017 at 3:07 PM, Robert Haas <robertmhaas@gmail.com> wrote:

On Fri, Oct 27, 2017 at 1:01 PM, Jeevan Chalke
<jeevan.chalke@enterprisedb.com> wrote:
> 1. Added separate patch for costing Append node as discussed up-front in the
> patch-set.
> 2. Since we now cost Append node, we don't need
> partition_wise_agg_cost_factor
> GUC. So removed that. The remaining patch hence merged into main
> implementation
> patch.
> 3. Updated rows in test-cases so that we will get partition-wise plans.

With 0006 applied, cost_merge_append() is now a little bit confused:

/*
* Also charge a small amount (arbitrarily set equal to operator cost) per
* extracted tuple. We don't charge cpu_tuple_cost because a MergeAppend
* node doesn't do qual-checking or projection, so it has less overhead
* than most plan nodes.
*/
run_cost += cpu_operator_cost * tuples;

/* Add MergeAppend node overhead like we do it for the Append node */
run_cost += cpu_tuple_cost * DEFAULT_APPEND_COST_FACTOR * tuples;

The first comment says that we don't add cpu_tuple_cost, and the
second one then adds half of it anyway.

Yep.
But as David reported earlier, if we remove the first part i.e. adding cpu_operator_cost per tuple, Merge Append will be preferred over an Append node unlike before. And thus, I thought of better having both, but no so sure. Should we remove that part altogether, or add both in a single statement with updated comments?

I think it's fine to have a #define for DEFAULT_APPEND_COST_FACTOR,
because as you say it's used twice, but I don't think that should be
exposed in cost.h; I'd make it private to costsize.c and rename it to
something like APPEND_CPU_COST_MULTIPLIER. The word DEFAULT, in
particular, seems useless to me, since there's no provision for it to
be overridden by a different value.

Agree. Will make that change.

What testing, if any, can we think about doing with this plan to make
sure it doesn't regress things? For example, if we do a TPC-H run
with partitioned tables and partition-wise join enabled, will any
plans change with this patch?

I have tried doing this on my local developer machine. For 1GB database size (tpc-h scale factor 1), I see no plan change with and without this patch.

I have tried with scale factor 10, but query is not executing well due to space and memory constraints. Can someone try out that?

Do they get faster or not? Anyone have
other ideas for what to test?

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Jeevan Chalke
Technical Architect, Product Development
EnterpriseDB Corporation
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Chris Travers
Date: 01 November 2017, 15:43:35
Subject: Re: [HACKERS] Oracle to PostGre

From: Ildus Kurbangaliev
Date: 01 November 2017, 16:24:16
Subject: [HACKERS] proposal: extend shm_mq to support more use cases

Re: [HACKERS] Partition-wise aggregation/grouping - Mailing list pgsql-hackers

Previous

Next