Home > mailing lists

Lazy JIT IR code generation to increase JIT speed with partitions - Mailing list pgsql-hackers

From	Luc Vlaming
Subject	Lazy JIT IR code generation to increase JIT speed with partitions
Date	December 28, 2020 08:44:26
Msg-id	244ee08c-5e26-45d1-8d10-d7b4d16b08ae@swarm64.com Whole thread Raw
Responses	Re: Lazy JIT IR code generation to increase JIT speed with partitions
List	pgsql-hackers

Tree view

Hi,

I would like to propose a small patch to the JIT machinery which makes 
the IR code generation lazy. The reason for postponing the generation of 
the IR code is that with partitions we get an explosion in the number of 
JIT functions generated as many child tables are involved, each with 
their own JITted functions, especially when e.g. partition-aware 
joins/aggregates are enabled. However, only a fraction of those 
functions is actually executed because the Parallel Append node 
distributes the workers among the nodes. With the attached patch we get 
a lazy generation which makes that this is no longer a problem.

For benchmarks I have in TPC-H and TPC-DS like queries with partitioning 
by hash seen query runtimes increase by 20+ seconds even on the simpler 
queries. Also I created a small benchmark to reproduce the case easily 
(see attached sql file):

without patch, using 7 launched workers:
- without jit: ~220ms
- with jit: ~1880ms
without patch, using 50 launched workers:
- without jit: ~280ms
- with jit: ~3400ms

with patch, using 7 launched workers:
- without jit: ~220ms
- with jit: ~590ms

with patch, using 50 launched workers:
- without jit: ~280ms
- with jit: ~530ms

Thoughts?

With Regards,
Luc Vlaming
Swarm64

Attachment

pgsql-hackers by date:

From: Masahiko Sawada
Date: 28 December 2020, 08:43:26
Subject: Re: Add table AM 'tid_visible'

From: Masahiko Sawada
Date: 28 December 2020, 08:48:26
Subject: Re: Parallel Full Hash Join

Lazy JIT IR code generation to increase JIT speed with partitions - Mailing list pgsql-hackers

Attachment

Previous

Next