Home > mailing lists

Re: BUG #15290: Stuck Parallel Index Scan query - Mailing list pgsql-bugs

From	Victor Yegorov
Subject	Re: BUG #15290: Stuck Parallel Index Scan query
Date	July 23, 2018 14:42:52
Msg-id	CAGnEboi_50VyFM02y7z9gpgVkdFWx5JMieeSDTaJ88qmqFBvoQ@mail.gmail.com Whole thread
In response to	Re: BUG #15290: Stuck Parallel Index Scan query (Thomas Munro <thomas.munro@enterprisedb.com>)
Responses	Re: BUG #15290: Stuck Parallel Index Scan query
List	pgsql-bugs

Tree view

пн, 23 июл. 2018 г. в 11:47, Thomas Munro <thomas.munro@enterprisedb.com>:

On Mon, Jul 23, 2018 at 7:57 PM, Victor Yegorov <vyegorov@gmail.com> wrote:
> - `ERROR: canceling statement due to conflict with recovery`, happened
> right when our problematic query started, same user

Ok, so that would explain how the master was cancelled. In 2877's
stack we see that it was aborting here:

Right:

ERROR: canceling statement due to conflict with recovery

DETAIL: User was holding shared buffer pin for too long.

#11 0x00007f539697ba5e in PostgresMain (argc=1,
argv=argv@entry=0x7f5398d1bbc8, dbname=0x7f5398d1bb98 "coub",
username=0x7f5398d1bbb0 "app") at
/build/postgresql-10-U6N320/postgresql-10-10.4/build/../src/backend/tcop/postgres.c:3879

That line calls AbortCurrentTransaction(), just after the call to
EmitErrorReport() that wrote something in your log. Andres's theory
(interrupts 'held') seems promising... perhaps there could be a bug
where parallel index scans leak a share-locked page or something like
that. I tried to reproduce this a bit, but no cigar so far. I wonder
if there could be something about your bloated index that reaches
buggy behaviour...

If you happen to have a core file for a worker that is waiting in
ConditionVariableSleep(), or it happens again, you'd be able to see if
an LWLock is causing this by printing num_held_lwlocks.

No, we do not have core files around. And so far I was not able to reproduce this situation.
I will keep monitoring. In case I'll hit it again — what else (except for num_held_lwlocks)
should I check for?

Victor Yegorov

pgsql-bugs by date:

From: Thomas Munro
Date: 23 July 2018, 11:46:33
Subject: Re: BUG #15290: Stuck Parallel Index Scan query

From: Moshe Jacobson
Date: 23 July 2018, 22:13:41
Subject: Re: pg_restore: All GRANTs on table fail when any one role is missing

Re: BUG #15290: Stuck Parallel Index Scan query - Mailing list pgsql-bugs

Previous

Next