Home > mailing lists

Re: Parallel INSERT (INTO ... SELECT ...) - Mailing list pgsql-hackers

From	Greg Nancarrow
Subject	Re: Parallel INSERT (INTO ... SELECT ...)
Date	February 4, 2021 15:05:17
Msg-id	CAJcOf-fb3ZZf+bx-EwKREcOASksk5Tqw0ncxtYr4Pq+M=wjJmA@mail.gmail.com Whole thread Raw
In response to	RE: Parallel INSERT (INTO ... SELECT ...) ("tsunakawa.takay@fujitsu.com" <tsunakawa.takay@fujitsu.com>)
List	pgsql-hackers

Tree view

On Thu, Feb 4, 2021 at 11:56 AM tsunakawa.takay@fujitsu.com
<tsunakawa.takay@fujitsu.com> wrote:
>
> >
> > So, the results indicate that after the patch we touch more buffers
> > during planning which I think is because of accessing the partition
> > information, and during execution, the patch touches fewer buffers for
> > the same reason. But why this can reduce the time with patch? I think
> > this needs some investigation.
>
> I guess another factor other than shared buffers is relcache and catcache.  The patched version loads those cached
entriesfor all partitions of the insert target table during the parallel-safety check in planning, while the unpatched
versionhas to gradually build those cache entries during execution.  How can wee confirm its effect? 
>

I believe that we can confirm its effect by invalidating relcache and
catcache, in both the patched and unpatched versions, just after the
parallel-safety checks are performed in the planner, and then running
tests and comparing the performance.

So that's exactly what I did (adding a call to
InvalidateSystemCaches() just after the parallel-safety checks in the
planner).
I found that then the unpatched version always performed better than
the patched version for tests inserting 1000 records into a table with
100,200,500 and 1000 partitions.
Looking at the breakdown of the timing for each Insert, the Planning
Time was always significantly more for the patched version (expected,
because it does extra checks), but the Execution Time was very similar
for both the patched and unpatched versions.

Regards,
Greg Nancarrow
Fujitsu Australia

pgsql-hackers by date:

From: Amit Kapila
Date: 04 February 2021, 14:33:37
Subject: Re: DROP TABLE can crash the replication sync worker

From: Dilip Kumar
Date: 04 February 2021, 15:08:21
Subject: Re: Is Recovery actually paused?

Re: Parallel INSERT (INTO ... SELECT ...) - Mailing list pgsql-hackers

Previous

Next