increased max_parallel_workers_per_gather results in fewer workers? - Mailing list pgsql-performance

From Philip Semanchuk
Subject increased max_parallel_workers_per_gather results in fewer workers?
Date
Msg-id B2C0B20E-4562-4F0B-A876-922EA4AC22A8@americanefficient.com
Whole thread Raw
Responses Re: increased max_parallel_workers_per_gather results in fewerworkers?  (Justin Pryzby <pryzby@telsasoft.com>)
List pgsql-performance
Hi all,
I’ve been experimenting with some performance tuning on a particular query, and I observed a result that I don’t
understand. 

I’ve been setting max_parallel_workers_per_gather to values the range 1-6 and then running EXPLAIN ANALYZE to see how
much benefit we get from more parallelization. My data is organized by year, so the year is a parameter in the query’s
WHEREclause. 

For my 2018 data, Postgres launches as many workers as max_parallel_workers_per_gather permits, and the execution time
decreasesnicely, from 280 seconds with 1 worker all the way down to 141s with 6 workers. So far, so good. 

When I run the same query for our 2022 data, I get the same behavior (improvement) for max_parallel_workers_per_gather
valuesof 1-4. But with max_parallel_workers_per_gather set to 5 or 6, Postgres only uses 1 worker, and the execution
timeincreases dramatically, even worse than when I deliberately limit the number of workers to 1  — 

- max_parallel_workers_per_gather=1, runtime = 1061s
- max_parallel_workers_per_gather=2, runtime = 770s
- max_parallel_workers_per_gather=3, runtime = 637s
- max_parallel_workers_per_gather=4, runtime = 573s
- max_parallel_workers_per_gather=5, runtime = 1468s
- max_parallel_workers_per_gather=6, runtime = 1469s

Our 2022 data set is several times larger than our 2018 data, so I suspect some resource is getting exhausted, but I’m
notsure what. So far, this result has been 100% re-creatable. I’m on a dedicated test server with 16 virtual CPUs and
128GbRAM; no one else is competing with me for Postgres processes. max_worker_processes and max_parallel_workers are
bothset to 12. 

Can anyone help me understand why this happens, or where I might look for clues?

Thanks,
Philip


pgsql-performance by date:

Previous
From: Justin Pryzby
Date:
Subject: Re: When to use PARTITION BY HASH?
Next
From: Justin Pryzby
Date:
Subject: Re: increased max_parallel_workers_per_gather results in fewerworkers?