Home > mailing lists

Re: Further pg_upgrade analysis for many tables - Mailing list pgsql-hackers

From	Jeff Janes
Subject	Re: Further pg_upgrade analysis for many tables
Date	November 14, 2012 20:40:45
Msg-id	CAMkU=1zj1hNnmUGk+s4i2qf3NM-ihZYdBbMg_h2ifwDt44eoTQ@mail.gmail.com Whole thread Raw
In response to	Re: Further pg_upgrade analysis for many tables (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Further pg_upgrade analysis for many tables (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-hackers

Tree view

On Thu, Nov 8, 2012 at 9:50 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Jeff Janes <jeff.janes@gmail.com> writes:
>> Are sure the server you are dumping out of is head?
>
> I experimented a bit with dumping/restoring 16000 tables matching
> Bruce's test case (ie, one serial column apiece).  The pg_dump profile
> seems fairly flat, without any easy optimization targets.  But
> restoring the dump script shows a rather interesting backend profile:
>
> samples  %        image name               symbol name
> 30861    39.6289  postgres                 AtEOXact_RelationCache
> 9911     12.7268  postgres                 hash_seq_search
...
>
> The hash_seq_search time is probably mostly associated with
> AtEOXact_RelationCache, which is run during transaction commit and scans
> the relcache hashtable looking for tables created in the current
> transaction.  So that's about 50% of the runtime going into that one
> activity.
>
> There are at least three ways we could whack that mole:
>
> * Run the psql script in --single-transaction mode, as I was mumbling
> about the other day.  If we were doing AtEOXact_RelationCache only once,
> rather than once per CREATE TABLE statement, it wouldn't be a problem.
> Easy but has only a narrow scope of applicability.
>
> * Keep a separate list (or data structure of your choice) so that
> relcache entries created in the current xact could be found directly
> rather than having to scan the whole relcache.  That'd add complexity
> though, and could perhaps be a net loss for cases where the relcache
> isn't so bloated.

Maybe a static list that can overflow, like the ResourceOwner/Lock
table one recently added.  The overhead of that should be very low.

Are the three places where "need_eoxact_work = true;" the only places
where things need to be added to the new structure?  It seems like
there is no need to remove things from the list, because the things
done in AtEOXact_RelationCache are idempotent.


> * Limit the size of the relcache (eg by aging out
> not-recently-referenced entries) so that we aren't incurring O(N^2)
> costs for scripts touching N tables.  Again, this adds complexity and
> could be counterproductive in some scenarios.

I made the crude hack of just dumping the relcache whenever it was
>1000 at eox.  The time to load 100,000 tables went from 62 minutes
without the patch to 12 minutes with it.  (loading with "-1 -f" took
23 minutes).

The next quadratic behavior is in init_sequence.

Cheers,

Jeff

diff --git a/src/backend/utils/cache/relcache.c
b/src/backend/utils/cache/relcache.c
index 8c9ebe0..3941c98 100644
--- a/src/backend/utils/cache/relcache.c
+++ b/src/backend/utils/cache/relcache.c
@@ -2260,6 +2260,8 @@ AtEOXact_RelationCache(bool isCommit)               )               return;

+        if (hash_get_num_entries(RelationIdCache)>1000)
{RelationCacheInvalidate();}

pgsql-hackers by date:

From: Bruce Momjian
Date: 14 November 2012, 20:22:16
Subject: Re: Enabling Checksums

From: Bruce Momjian
Date: 14 November 2012, 20:55:04
Subject: Re: WIP checksums patch

Re: Further pg_upgrade analysis for many tables - Mailing list pgsql-hackers

Previous

Next