Re: Improving connection scalability: GetSnapshotData() - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Improving connection scalability: GetSnapshotData()
Date
Msg-id 1006917.1597602624@sss.pgh.pa.us
Whole thread Raw
In response to Re: Improving connection scalability: GetSnapshotData()  (Andres Freund <andres@anarazel.de>)
Responses Re: Improving connection scalability: GetSnapshotData()  (Andres Freund <andres@anarazel.de>)
Re: Improving connection scalability: GetSnapshotData()  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
Andres Freund <andres@anarazel.de> writes:
> 690 successful runs later, it didn't trigger for me :(. Seems pretty
> clear that there's another variable than pure chance, otherwise it seems
> like that number of runs should have hit the issue, given the number of
> bf hits vs bf runs.

It seems entirely likely that there's a timing component in this, for
instance autovacuum coming along at just the right time.  It's not too
surprising that some machines would be more prone to show that than
others.  (Note peripatus is FreeBSD, which we've already learned has
significantly different kernel scheduler behavior than Linux.)

> My current plan would is to push a bit of additional instrumentation to
> help narrow down the issue.

+1

            regards, tom lane



pgsql-hackers by date:

Previous
From: Andres Freund
Date:
Subject: Re: Improving connection scalability: GetSnapshotData()
Next
From: Andres Freund
Date:
Subject: Re: Improving connection scalability: GetSnapshotData()