Home > mailing lists

Re: [WIP] speeding up GIN build with parallel workers - Mailing list pgsql-hackers

From	Amit Kapila
Subject	Re: [WIP] speeding up GIN build with parallel workers
Date	March 16, 2016 12:38:44
Msg-id	CAA4eK1LVQj_AokerKcxsx9GrYaf-Ra_vFnmvLWb=aRpTpEz0bA@mail.gmail.com Whole thread
In response to	Re: [WIP] speeding up GIN build with parallel workers ("Constantin S. Pan" <kvapen@gmail.com>)
Responses	Re: [WIP] speeding up GIN build with parallel workers
List	pgsql-hackers

Tree view

On Wed, Mar 16, 2016 at 2:55 PM, Constantin S. Pan <kvapen@gmail.com> wrote:
>
> On Wed, 16 Mar 2016 12:14:51 +0530
> Amit Kapila <amit.kapila16@gmail.com> wrote:
>
> > On Wed, Mar 16, 2016 at 5:41 AM, Constantin S. Pan <kvapen@gmail.com>
> > wrote:
> > > 3. Tested on some real data (GIN index on email message body
> > > tsvectors). Here are the timings for different values of
> > > 'gin_shared_mem' and 'gin_parallel_workers' on a 4-CPU
> > > machine. Seems 'gin_shared_mem' has no visible effect.
> > >
> > > wnum mem(MB) time(s)
> > > 0 16 247
> > > 1 16 256
> > >
> >
> >
> > It seems from you data that with 1 worker, you are always seeing
> > slowdown, have you investigated the reason of same?
> >
>
> That slowdown is expected. It slows down because with 1 worker it
> has to transfer the results from the worker to the backend.
>
> The backend just waits for the results from the workers and merges them
> (in case wnum > 0).

Why backend just waits, why can't it does the same work as any worker does? In general, for other parallelism features the backend also behaves the same way as worker in producing the results if the results from workers is not available.

With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

pgsql-hackers by date:

From: Robert Haas
Date: 16 March 2016, 12:29:42
Subject: Re: Parallel Aggregate

From: otheus uibk
Date: 16 March 2016, 12:52:16
Subject: async replication code

Re: [WIP] speeding up GIN build with parallel workers - Mailing list pgsql-hackers

Previous

Next