Home > mailing lists

Re: Elusive segfault with 9.3.5 & query cancel - Mailing list pgsql-hackers

From	Peter Geoghegan
Subject	Re: Elusive segfault with 9.3.5 & query cancel
Date	December 6, 2014 01:11:25
Msg-id	CAM3SWZQHJu8x0aTeqE6g99AJsM1iG0Bb9Y8TH-gAjSA77oghbA@mail.gmail.com Whole thread Raw
In response to	Re: Elusive segfault with 9.3.5 & query cancel (Josh Berkus <josh@agliodbs.com>)
Responses	Re: Elusive segfault with 9.3.5 & query cancel (Jim Nasby <Jim.Nasby@BlueTreble.com>)
List	pgsql-hackers

Tree view

On Fri, Dec 5, 2014 at 1:29 PM, Josh Berkus <josh@agliodbs.com> wrote:
>> We made some changes which decreased query cancel (optimizing queries,
>> turning on hot_standby_feedback) and we haven't seen a segfault since
>> then.  As far as the user is concerned, this solves the problem, so I'm
>> never going to get a trace or a core dump file.
>
> Forgot a major piece of evidence as to why I think this is related to
> query cancel:  in each case, the segfault was preceeded by a
> multi-backend query cancel 3ms to 30ms beforehand.  It is possible that
> the backend running the query which segfaulted might have been the only
> backend *not* cancelled due to query conflict concurrently.
> Contradicting this, there are other multi-backend query cancels in the
> logs which do NOT produce a segfault.

I wonder if it would be useful to add additional instrumentation so
that even without a core dump, there was some cursory information
about the nature of a segfault.

Yes, doing something with a SIGSEGV handler is very scary, and there
are major portability concerns (e.g.
https://bugs.ruby-lang.org/issues/9654), but I believe it can be made
robust on Linux. For what it's worth, this open source project offers
that kind of functionality in the form of a library:
https://github.com/vmarkovtsev/DeathHandler

-- 
Peter Geoghegan

pgsql-hackers by date:

From: Josh Berkus
Date: 06 December 2014, 00:29:43
Subject: Re: Elusive segfault with 9.3.5 & query cancel

From: Adam Brightwell
Date: 06 December 2014, 01:21:22
Subject: Re: Role Attribute Bitmask Catalog Representation

Re: Elusive segfault with 9.3.5 & query cancel - Mailing list pgsql-hackers

Previous

Next