Thread: pg_test_fsync crashes on systems with POSIX signal handling

pg_test_fsync crashes on systems with POSIX signal handling

From
Tom Lane
Date:
On my old HPUX box:

$ ./pg_test_fsync
2 seconds per test
Direct I/O is not supported on this platform.

Compare file sync methods using one 8kB write:
(in wal_sync_method preference order, except fdatasync
is Linux's default)       open_datasync                     165.122 ops/sec (  6056 microsecs/op)       fdatasync
               Alarm call
 
$ echo $?
142        -- that's SIGALRM

The reason it's failing is that according to the traditional (not BSD)
definition of signal(2), the signal handler is reset to SIG_DFL when the
signal is delivered.  So the second occurrence of SIGALRM doesn't call
the signal handler but just crashes the process.

The quick-and-dirty fix for this is to just copy pqsignal() into
pg_test_fsync, and use that instead of calling signal() directly.
I wonder though if we shouldn't move that function into libpgport.
Thoughts?
        regards, tom lane



Re: pg_test_fsync crashes on systems with POSIX signal handling

From
Bruce Momjian
Date:
On Fri, Mar 15, 2013 at 03:05:54PM -0400, Tom Lane wrote:
> On my old HPUX box:
> 
> $ ./pg_test_fsync
> 2 seconds per test
> Direct I/O is not supported on this platform.
> 
> Compare file sync methods using one 8kB write:
> (in wal_sync_method preference order, except fdatasync
> is Linux's default)
>         open_datasync                     165.122 ops/sec (  6056 microsecs/op)
>         fdatasync                       Alarm call
> $ echo $?
> 142        -- that's SIGALRM
> 
> The reason it's failing is that according to the traditional (not BSD)
> definition of signal(2), the signal handler is reset to SIG_DFL when the
> signal is delivered.  So the second occurrence of SIGALRM doesn't call
> the signal handler but just crashes the process.
> 
> The quick-and-dirty fix for this is to just copy pqsignal() into
> pg_test_fsync, and use that instead of calling signal() directly.
> I wonder though if we shouldn't move that function into libpgport.
> Thoughts?

Well, the Win32 signal handler is already in port, so moving the Unix
one seems to make sense, i.e. the comment above pgsignal says:
/* Win32 signal handling is in backend/port/win32/signal.c */

--  Bruce Momjian  <bruce@momjian.us>        http://momjian.us EnterpriseDB
http://enterprisedb.com
 + It's impossible for everything to be true. +



Re: pg_test_fsync crashes on systems with POSIX signal handling

From
Tom Lane
Date:
Bruce Momjian <bruce@momjian.us> writes:
> On Fri, Mar 15, 2013 at 03:05:54PM -0400, Tom Lane wrote:
>> The quick-and-dirty fix for this is to just copy pqsignal() into
>> pg_test_fsync, and use that instead of calling signal() directly.
>> I wonder though if we shouldn't move that function into libpgport.
>> Thoughts?

> Well, the Win32 signal handler is already in port, so moving the Unix
> one seems to make sense, i.e. the comment above pgsignal says:
>     /* Win32 signal handling is in backend/port/win32/signal.c */

Done, though it was a bit more painful than I expected --- I seem to
have guessed completely wrong about where the portability hazards were.
Good thing we have a buildfarm.
        regards, tom lane