Home > mailing lists

Re: Issue with the PRNG used by Postgres - Mailing list pgsql-hackers

From	Andres Freund
Subject	Re: Issue with the PRNG used by Postgres
Date	April 11 22:52:11
Msg-id	20240411195211.u3e74ejcww7wpqnn@awork3.anarazel.de Whole thread Raw
In response to	Re: Issue with the PRNG used by Postgres (Robert Haas <robertmhaas@gmail.com>)
Responses	Re: Issue with the PRNG used by Postgres Re: Issue with the PRNG used by Postgres
List	pgsql-hackers

Tree view

Hi,

On 2024-04-11 15:24:28 -0400, Robert Haas wrote:
> On Wed, Apr 10, 2024 at 9:53 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> > Maybe we should rip out the whole mechanism and hard-wire
> > spins_per_delay at 1000 or so.
> 
> Or, rip out the whole, whole mechanism and just don't PANIC.

I continue believe that that'd be a quite bad idea.

My suspicion is that most of the false positives are caused by lots of signals
interrupting the pg_usleep()s. Because we measure the number of delays, not
the actual time since we've been waiting for the spinlock, signals
interrupting pg_usleep() trigger can very significantly shorten the amount of
time until we consider a spinlock stuck.  We should fix that.

> To believe that the PANIC is the right idea, we have to suppose that
> we have stuck-spinlock bugs that people actually hit, but that those
> people don't hit them often enough to care, as long as the system
> resets when the spinlock gets stuck, instead of hanging. I can't
> completely rule out the existence of either such bugs or such people,
> but I'm not aware of having encountered them.

I don't think that's a fair description of the situation. It supposes that the
alternative to the PANIC is that the problem is detected and resolved some
other way. But, depending on the spinlock, the problem will not be detected by
automated checks for the system being up. IME you end up with a system that's
degraded in a complicated hard to understand way, rather than one that's just
down.

Greetings,

Andres Freund

pgsql-hackers by date:

From: Robert Haas
Date: 11 April, 22:24:28
Subject: Re: Issue with the PRNG used by Postgres

From: Corey Huinker
Date: 11 April, 22:54:07
Subject: Re: Statistics Import and Export

Re: Issue with the PRNG used by Postgres - Mailing list pgsql-hackers

Previous

Next