parallelizing subplan execution (was: explain and PARAM_EXEC) - Mailing list pgsql-hackers

From Dimitri Fontaine
Subject parallelizing subplan execution (was: explain and PARAM_EXEC)
Date
Msg-id m2tytc3uga.fsf_-_@hi-media.com
Whole thread Raw
In response to Re: explain and PARAM_EXEC  (Robert Haas <robertmhaas@gmail.com>)
Responses Re: parallelizing subplan execution (was: explain and PARAM_EXEC)
List pgsql-hackers
Robert Haas <robertmhaas@gmail.com> writes:
> On Sat, Feb 20, 2010 at 6:57 AM, Dimitri Fontaine
> <dfontaine@hi-media.com> wrote:
>> How much does this stuff is dependent on the current state of the
>> backend?
>
> A whole lot.

Bad news.

>> Ok that's a far stretch from the question at hand, but would that be a
>> plausible approach to have parallel queries in PostgreSQL ?
>
> This is really a topic for another thread, but at 100,000 feet it
> seems to me that the hardest question is - how will you decide which
> operations to parallelize in the first place?  Actually making it
> happen is really hard, too, of course, but even to get that that point
> you have to have some model for what types of operations it makes
> sense to parallelize and how you're going to decide when it's a win.

My naive thoughts would be to add some cost parameters. The fact to
fork() another backend first, then model for each supported subplan (we
will want to add more, or maybe have a special rendez-vous-materialise
node) some idea of the data exchange cost.

Now the planner would as usual try to find the less costly plan, and
will be able to compare plans with and without distributing the work.

Overly naive ?

Regards,
--
dim


pgsql-hackers by date:

Previous
From: Robert Haas
Date:
Subject: Re: explain and PARAM_EXEC
Next
From: Tom Lane
Date:
Subject: Re: explain and PARAM_EXEC