Re: serverlog rotation/functions - Mailing list pgsql-hackers

From Andreas Pflug
Subject Re: serverlog rotation/functions
Date
Msg-id 40F85368.2090302@pse-consulting.de
Whole thread Raw
In response to Re: serverlog rotation/functions  (Bruce Momjian <pgman@candle.pha.pa.us>)
Responses Re: serverlog rotation/functions
Re: serverlog rotation/functions
List pgsql-hackers
Bruce Momjian wrote:
>>Ok, no limit (but a default maximum of 50k remains). And since it's 
>>superuser only, he hopefully knows what he does.
> 
> 
> Huh?  Why have a default maximum?

Just for convenience. Both start and size are optional parameters, but 
with start=0 and size=50000. Well, it's a very special function anyway, 
so we could require the user to supply all parameters. I'll remove it.

> 
> 
> Ah, I think it needs to close that as soon as it starts.  Don't other
> subprocesses do that?  That shared memory is very fragile and we don't
> want an errant pointer poking in there.

The result of an errant pointer writing to that shred mem would be

1) wrong pid for SysLogger, so it can't be signalled to rotate from backends
2) wrong timestamp, so backends don't know the latest logfile.

Nothing particularly crash prone really.


> This seems clean and fast enough to me:
> 
>     SELECT filename
>     FROM pg_dir_ls('/var/log')
>     ORDER BY 1 DESC
>     LIMIT 1

For a logfile listing function, this would look

SELECT MAX(startdate)  FROM pg_logfile_ls()


> 
> Considering that any query from a client is going to have to go through
> the parser and be executed, an 'ls' in a directory just isn't a
> measurable performance hit.
> 
> If you want run a test that does an 'ls' and one that doesn't to see
> that there is no measurable performance difference.
> 
So while a simple PG_RETURN_TIMESTAMP(logfiletimestamp) compared to a 
lengthy setof returning function is drastically faster, this appears 
much less drastic with parser overhead.

> I would not worry about the clock going backward.  PostgreSQL would have
> enough problems with timestamp columns moving backward that the file log
> times are the least of our problems.

I see, so the admin is in trouble anyway (what about PITR? Data column 
deviations appear harmless compared to restoration based on timestamps).

> 
> 
> You can't know that you aren't reading corrupt data if you read shared
> memory without a lock.  What if the write is happening as you read?

I thought about this quite a while.

If the shmem fields aren't written atomically (one is 32bit, one 64 bit, 
probably on dword boundaries so writing will happen at least processor 
bus wide, do we support any 16 bit processor?) the corruption 
consequences as above apply. In the case of the timestamp, the high word 
will rarely change anyway, only every 2^32 seconds...

Concurrent access on the logger pid would mean to call 
pg_logfile_rotate() while a killed logger is being restarted, which is 
creating a new logfile then anyway. This would send a SIGINT into outer 
space, maybe to the bgwriter triggering a checkpoint, or the postmaster 
shutting it down (gracefully, still unwanted).

BTW, the consequences of a trigger flag in shmem would be less because 
all that could happen was a log rotation (which appends to existing 
files, just in case syslogger died in the milliseconds after a rotation).

> 
> The only clean solution I can think of is to write an operating system
> file that contains the current log filename and read from that.  I
> believe such writes are atomic.  But again, this seems like overkill to
> me.

Ah wait.
Digging further behind SIGUSR1 I now *do* see a solution without pid in 
shmem, using SendPostmasterSignal. Well, a little hint from gurus would 
have helped...

I'll convert to this, *dropping* all shmem.


Regards,
Andreas


pgsql-hackers by date:

Previous
From: Simon Riggs
Date:
Subject: Re: Point in Time Recovery
Next
From: Simon Riggs
Date:
Subject: Re: Point in Time Recovery