Thread: Streaming replication, some small issues

Streaming replication, some small issues

From

Heikki Linnakangas

Date:

08 December 2009, 07:31:20

A couple of small issues spotted while reviewing the streaming
replication patch:

- Because sentPtr is initialized to zeros, GetOldestWALSendPointer will
return zero before a just-launched WAL sender has sent its first
message. That can lead to WAL files that are still needed by another
standby to be deleted prematurely.

- If a WAL file is not found in the master for some reason, standby goes
into an infinite loop retrying it:

ERROR:  could not read xlog records: FATAL:  could not open file
"pg_xlog/000000010000000000000000" (log file 0, segment 0): No such file
or directory
ERROR:  could not read xlog records: FATAL:  could not open file
"pg_xlog/000000010000000000000000" (log file 0, segment 0): No such file
or directory
ERROR:  could not read xlog records: FATAL:  could not open file
"pg_xlog/000000010000000000000000" (log file 0, segment 0): No such file
or directory

...
- It's possible to shut down master, change max_wal_senders to 0,
restart and do an operation like CLUSTER which then skips WAL-logging.
Then shutdown, change max_wal_senders back to non-zero. All this while
the standby is running. Leads to a corrupt standby.


I've also pushed a couple of small cosmetic changes to replication
branch at git://git.postgresql.org/git/users/heikki/postgres.git

I'll continue reviewing...

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

Re: Streaming replication, some small issues

From

Fujii Masao

Date:

08 December 2009, 11:20:25

On Tue, Dec 8, 2009 at 5:30 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
> A couple of small issues spotted while reviewing the streaming
> replication patch:

Thanks for the review!

> - Because sentPtr is initialized to zeros, GetOldestWALSendPointer will
> return zero before a just-launched WAL sender has sent its first
> message. That can lead to WAL files that are still needed by another
> standby to be deleted prematurely.

Oops! I fixed that (in my git repository, see the bottom of this mail).

> - If a WAL file is not found in the master for some reason, standby goes
> into an infinite loop retrying it:
>
> ERROR:  could not read xlog records: FATAL:  could not open file
> "pg_xlog/000000010000000000000000" (log file 0, segment 0): No such file
> or directory

http://archives.postgresql.org/pgsql-hackers/2009-09/msg01393.php
>> walreceiver shouldn't die on connection error, just to be restarted by
>> startup process. Can we add error handling a la bgwriter and have a
>> retry loop within walreceiver?

As the result of your current and previous comment, you mean that
walreceiver should always retry connecting to the primary after
a connection error occurs in PQgetXLogData/PQputXLogRecPtr, and
exit after the other errors occur? Though I'm not sure whether
we can determine the error type precisely.

> - It's possible to shut down master, change max_wal_senders to 0,
> restart and do an operation like CLUSTER which then skips WAL-logging.
> Then shutdown, change max_wal_senders back to non-zero. All this while
> the standby is running. Leads to a corrupt standby.

I've regarded this case as a restriction. But, how do you think
we should cope with it?

1. Restriction: only documentation is required?
2. Needs safe guard: - forbid the primary to perform such operations while the   standby is running? - emit PANIC error
onthe standby if the primary which lost sync   restarts? 
3. Full solution: automatic resync mechanism is required?

> I've also pushed a couple of small cosmetic changes to replication
> branch at git://git.postgresql.org/git/users/heikki/postgres.git

Your changes seem good.

I pulled and merged your changes into my repository:
  git://git.postgresql.org/git/users/fujii/postgres.git  branch: replication

And, I pushed the capability of replication of a backup history file
into the repository.

> I'll continue reviewing...

Thanks a lot!

Regards,

--
Fujii Masao
NIPPON TELEGRAPH AND TELEPHONE CORPORATION
NTT Open Source Software Center

Re: Streaming replication, some small issues

From

Greg Stark

Date:

08 December 2009, 12:10:21

On Tue, Dec 8, 2009 at 8:30 AM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
> - It's possible to shut down master, change max_wal_senders to 0,
> restart and do an operation like CLUSTER which then skips WAL-logging.
> Then shutdown, change max_wal_senders back to non-zero. All this while
> the standby is running. Leads to a corrupt standby.

The same thing is possible with archived logs as well, no?

I suspect we should have a WAL record to say "unlogged operation
performed here" which a standby database would recognize and throw a
large warning up. The only reason I say warning is because it might be
reasonable if the relation is some temporary ETL table which isn't
needed in the standby. Perhaps if we note the relation affected we
could throw an error iff the standby is activated with the relation
still existing.

-- 
greg

Re: Streaming replication, some small issues

From

Heikki Linnakangas

Date:

08 December 2009, 12:12:34

Greg Stark wrote:
> On Tue, Dec 8, 2009 at 8:30 AM, Heikki Linnakangas
> <heikki.linnakangas@enterprisedb.com> wrote:
>> - It's possible to shut down master, change max_wal_senders to 0,
>> restart and do an operation like CLUSTER which then skips WAL-logging.
>> Then shutdown, change max_wal_senders back to non-zero. All this while
>> the standby is running. Leads to a corrupt standby.
> 
> The same thing is possible with archived logs as well, no?

Yeah, I think you're right.

> I suspect we should have a WAL record to say "unlogged operation
> performed here" which a standby database would recognize and throw a
> large warning up.

+1. Seems like a very simple solution.

--  Heikki Linnakangas EnterpriseDB   http://www.enterprisedb.com

Re: Streaming replication, some small issues

From

Fujii Masao

Date:

09 December 2009, 00:01:35

On Tue, Dec 8, 2009 at 9:05 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
>> I suspect we should have a WAL record to say "unlogged operation
>> performed here" which a standby database would recognize and throw a
>> large warning up.
>
> +1. Seems like a very simple solution.

Sounds good. This is not just a problem of SR, so I'll implement it
as self-contained feature later.

Design:
- If relation is not temp and archiving (and streaming replication) is enabled, we log the "unlogged OP" record
includingrelfilenode of the relation.
 

- If "unlogged OP" record is found during archive recovery, we register its relfilenode to the hashtable which tracks
maybecorrupted relations. If the registered relfilenode is brandnew, we emit warning. Also, the log record indicating
"DROPTABLE" etc is found, we remove its relfilenode from the hashtable.
 

- When restartpoint occurs, we write all the registered relfilenodes to the flat file.

- At the end of archive recovery, if there is relfilenode in the hashtable, we emit FATAL error to prevent the server
frombeing brought up. XXX: But this might be too conservative. I believe that some people want to complete archive
recoveryeven if a relation is corrupted, and drop that relation after the server has been activated. So I'm going to
providenew recovery.conf parameter specifying whether to let archive recovery fail when some relations might be
corrupted.

Thought? Am I missing something?

Regards,

-- 
Fujii Masao
NIPPON TELEGRAPH AND TELEPHONE CORPORATION
NTT Open Source Software Center

Re: Streaming replication, some small issues

From

Tom Lane

Date:

09 December 2009, 00:12:43

Fujii Masao <masao.fujii@gmail.com> writes:
> Thought? Am I missing something?

This seems terribly overdesigned.  Just emit a warning when you see
the "unlogged op" record and have done.
        regards, tom lane

Re: Streaming replication, some small issues

From

Fujii Masao

Date:

09 December 2009, 00:52:12

On Wed, Dec 9, 2009 at 10:12 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Fujii Masao <masao.fujii@gmail.com> writes:
>> Thought? Am I missing something?
>
> This seems terribly overdesigned.  Just emit a warning when you see
> the "unlogged op" record and have done.

Sounds quite simple. OK, I'll do so.

Regards,

--
Fujii Masao
NIPPON TELEGRAPH AND TELEPHONE CORPORATION
NTT Open Source Software Center

Re: Streaming replication, some small issues

From

Fujii Masao

Date:

09 December 2009, 08:25:19

On Wed, Dec 9, 2009 at 10:51 AM, Fujii Masao <masao.fujii@gmail.com> wrote:
> On Wed, Dec 9, 2009 at 10:12 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Fujii Masao <masao.fujii@gmail.com> writes:
>>> Thought? Am I missing something?
>>
>> This seems terribly overdesigned.  Just emit a warning when you see
>> the "unlogged op" record and have done.
>
> Sounds quite simple. OK, I'll do so.

Here is the patch:

- Write an XLOG UNLOGGED record in WAL if WAL-logging is skipped for only
  the reason that WAL archiving is not enabled and such record has not been
  written yet.

- Cause archive recovery to end if an XLOG UNLOGGED record is found during
  it.


I add this patch to the CommitFest 2010-01.

Regards,

--
Fujii Masao
NIPPON TELEGRAPH AND TELEPHONE CORPORATION
NTT Open Source Software Center

Attachment

log_unlogged_op.patch

Re: Streaming replication, some small issues

From

Fujii Masao

Date:

11 December 2009, 07:04:09

On Tue, Dec 8, 2009 at 5:30 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
> - If a WAL file is not found in the master for some reason, standby goes
> into an infinite loop retrying it:
>
> ERROR:  could not read xlog records: FATAL:  could not open file
> "pg_xlog/000000010000000000000000" (log file 0, segment 0): No such file
> or directory

I also fixed this problem.
 git://git.postgresql.org/git/users/fujii/postgres.git branch: replication

Regards,

--
Fujii Masao
NIPPON TELEGRAPH AND TELEPHONE CORPORATION
NTT Open Source Software Center