Home > mailing lists

Re: [HACKERS] [POC] Faster processing at Gather node - Mailing list pgsql-hackers

From	Ants Aasma
Subject	Re: [HACKERS] [POC] Faster processing at Gather node
Date	November 16, 2017 18:23:12
Msg-id	CA+CSw_t-7H=DQXnkhqXPiW3TYi9iVyFERyOuq62+ScD0zosqFg@mail.gmail.com Whole thread
In response to	Re: [HACKERS] [POC] Faster processing at Gather node (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: [HACKERS] [POC] Faster processing at Gather node
List	pgsql-hackers

Tree view

On Thu, Nov 16, 2017 at 6:42 AM, Robert Haas <robertmhaas@gmail.com> wrote:
> The problem here is that we have no idea how big the queue needs to
> be.  The workers will always be happy to generate tuples faster than
> the leader can read them, if that's possible, but it will only
> sometimes help performance to let them do so.   I think in most cases
> we'll end up allocating the local queue - because the workers can
> generate faster than the leader can read - but only occasionally will
> it make anything faster.

For the Gather Merge driven by Parallel Index Scan case it seems to me
that the correct queue size is one that can store two index pages
worth of tuples. Additional space will always help buffer any
performance variations, but there should be a step function somewhere
around 1+1/n_workers pages. I wonder if the queue could be dynamically
sized based on the driving scan. With some limits of course as parent
nodes to the parallel index scan can increase the row count by
arbitrary amounts.

Regards,
Ants Aasma
--
Cybertec Schönig & Schönig GmbH
Gröhrmühlgasse 26, A-2700 Wiener Neustadt
Web: http://www.postgresql-support.de, http://www.cybertec.at

pgsql-hackers by date:

From: Magnus Hagander
Date: 16 November 2017, 18:20:48
Subject: Re: Schedule for migration to pglister

From: Robert Haas
Date: 16 November 2017, 18:27:29
Subject: Re: [HACKERS] [POC] Faster processing at Gather node

Re: [HACKERS] [POC] Faster processing at Gather node - Mailing list pgsql-hackers

Previous

Next