Home > mailing lists

Re: [HACKERS] Print correct startup cost for the group aggregate. - Mailing list pgsql-hackers

From	Ashutosh Bapat
Subject	Re: [HACKERS] Print correct startup cost for the group aggregate.
Date	March 6, 2017 15:25:03
Msg-id	CAFjFpRfvJEsStm3trC7h_GGDPC-YMqaXMjGUHo5e4QMMtYRr5g@mail.gmail.com Whole thread Raw
In response to	Re: [HACKERS] Print correct startup cost for the group aggregate. (Rushabh Lathia <rushabh.lathia@gmail.com>)
List	pgsql-hackers

Tree view

>
>
> I understood you reasoning of why startup_cost = input_startup_cost and not
> input_total_cost for aggregation by sorting. But what I didn't understand is
> how come higher startup cost for aggregation by sorting would force hash
> aggregation to be chosen? I am not clear about this part.

See this comment in cost_agg()    * Note: in this cost model, AGG_SORTED and AGG_HASHED have exactly the    * same
totalCPU cost, but AGG_SORTED has lower startup cost.  If the    * input path is already sorted appropriately,
AGG_SORTEDshould be    * preferred (since it has no risk of memory overflow).
 

AFAIU, if the input is already sorted, aggregation by sorting and
aggregation by hashing will have almost same cost, the startup cost of
AGG_SORTED being lower than AGG_HASHED. Because of lower startup cost,
AGG_SORTED gets chosen by add_path() over AGG_HASHED path.

-- 
Best Wishes,
Ashutosh Bapat
EnterpriseDB Corporation
The Postgres Database Company

pgsql-hackers by date:

From: Kyotaro HORIGUCHI
Date: 06 March 2017, 15:20:06
Subject: Re: [HACKERS] Restricting maximum keep segments by repslots

From: Amit Langote
Date: 06 March 2017, 15:41:02
Subject: Re: [HACKERS] UPDATE of partition key

Re: [HACKERS] Print correct startup cost for the group aggregate. - Mailing list pgsql-hackers

Previous

Next