Home > mailing lists

Re: Spilling hashed SetOps and aggregates to disk - Mailing list pgsql-hackers

From	Jeff Davis
Subject	Re: Spilling hashed SetOps and aggregates to disk
Date	June 11, 2018 21:13:45
Msg-id	1528740825.8818.52.camel@j-davis.com Whole thread Raw
In response to	Re: Spilling hashed SetOps and aggregates to disk (Tomas Vondra <tomas.vondra@2ndquadrant.com>)
Responses	Re: Spilling hashed SetOps and aggregates to disk
List	pgsql-hackers

Tree view

On Mon, 2018-06-11 at 19:33 +0200, Tomas Vondra wrote:
> For example we hit the work_mem limit after processing 10% tuples, 
> switching to sort would mean spill+sort of 900GB of data. Or we
> might 
> say - hmm, we're 10% through, so we expect hitting the limit 10x, so 
> let's spill the hash table and then do sort on that, writing and
> sorting 
> only 10GB of data. (Or merging it in some hash-based way, per
> Robert's 
> earlier message.)

Your example depends on large groups and a high degree of group
clustering. That's fine, but it's a special case, and complexity does
have a cost, too.

Regards,
    Jeff Davis

pgsql-hackers by date:

From: Tom Lane
Date: 11 June 2018, 20:59:35
Subject: Re: Spilling hashed SetOps and aggregates to disk

From: Andrew Dunstan
Date: 11 June 2018, 21:31:11
Subject: Re: CF bug fix items

Re: Spilling hashed SetOps and aggregates to disk - Mailing list pgsql-hackers

Previous

Next