Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning - Mailing list pgsql-hackers

From Amit Langote
Subject Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning
Date
Msg-id CA+HiwqEuGduyQZd-ZiGn1rxjT6uPgdnMpDyYRni7RYMN-uR1eg@mail.gmail.com
Whole thread Raw
In response to Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning  (Ashutosh Bapat <ashutosh.bapat.oss@gmail.com>)
Responses Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning
List pgsql-hackers
On Mon, Mar 17, 2025 at 5:47 PM Ashutosh Bapat
<ashutosh.bapat.oss@gmail.com> wrote:
> On Fri, Mar 14, 2025 at 5:36 PM Amit Langote <amitlangote09@gmail.com> wrote:
> > Thanks for the patch and the extensive benchmarking.
> >
> > Would you please also share a simple, self-contained example that I
> > can use to reproduce and verify the performance improvements? It’s
> > helpful to first see the patch in action with a representative query,
> > and then refer to the more extensive benchmark results you've already
> > shared as needed. I'm also having a hard time telling from the scripts
> > which query was used to produce the numbers in the report.
> >
>
> Here are steps
> 1. Run setup.sql attached in [1]. It will add a few helper functions
> and create required tables (partitioned and non-partitioned ones).
> 2. The sheet named "rae data" attached to [1] has queries whose
> performance is measured. They use the tables created by setup.sql.
>
> You may further use the same scripts to run benchmarks.

Thanks for pointing me to that.

I ran a couple of benchmarks of my own as follows.

cat benchmark_amit.sh
for p in 0 16 32 64 128 256 512 1024; do
  echo -ne "$p\t";
  pgbench -i --partitions=$p > /dev/null 2>&1
  pgbench -n -T 30 -f /tmp/query.sql | grep latency | awk '{print $4}';
done

For a 3-way join:
cat /tmp/query.sql
select * from pgbench_accounts t1, pgbench_accounts t2,
pgbench_accounts t3 where t2.aid = t1.aid AND t3.aid = t2.aid;

nparts  master      patched     %change
0       35.508      36.066      1.571
16      66.79       67.704      1.368
32      67.774      68.179      0.598
64      51.023      50.471     -1.082
128     56.4        55.759     -1.137
256     70.134      68.401     -2.471
512     120.621     113.552    -5.861
1024    405.339     312.726    -22.848

For a 6-way jon
cat /tmp/query.sql
select * from pgbench_accounts t1, pgbench_accounts t2,
pgbench_accounts t3, pgbench_accounts t4, pgbench_accounts t5,
pgbench_accounts t6 where t2.aid = t1.aid AND t3.aid = t2.aid and
t4.aid = t3.aid and t5.aid = t4.aid and t6.aid = t5.aid;

nparts  master      patched     %change
0       66.144      64.932     -1.832
16      100.874     100.491    -0.380
32      104.645     104.536    -0.104
64      114.415     109.193    -4.564
128     145.422     130.458    -10.290
256     273.761     209.919    -23.320
512     1359.896    616.295    -54.681
1024    7183.765    2857.086   -60.229

-60% means 60% reduction in latency due to the patch.

As others have already found, performance improvements become
substantial as both partition count and join depth increase. The patch
seems to have minimal impact at low partition counts and low join
complexity -- base cases (e.g., 0–32 partitions, 3-way joins) are
essentially unchanged, which is good to see.

I haven’t measured memory increase from the patch, but let me know if
that has already been evaluated and shown not to be a showstopper.

I also noticed using perf that create_join_clause() is a hotspot when
running without the patch, especially at high partition counts (> 500)
and the more join relations.

Let me know if my methodology seems off or if the results look reasonable.

--
Thanks, Amit Langote



pgsql-hackers by date:

Previous
From: Shubham Khanna
Date:
Subject: Re: Adding a '--clean-publisher-objects' option to 'pg_createsubscriber' utility.
Next
From: Jakub Wartak
Date:
Subject: Re: BitmapHeapScan streaming read user and prelim refactoring