Thread: automatic restore point

automatic restore point

From

"Yotsunaga, Naoki"

Date:

26 June 2018, 04:17:31

Hi, I'm a newbie to the hackers but I'd like to propose the "automatic restore point" feature. 
This feature automatically create backup label just before making a huge change to DB. It's useful when this change is
accidentalcase.
 

The following is a description of "automatic restore point".
【Background】
  When DBA's operation failure, for example DBA accidently drop table, the database is restored from the file system
backupand recovered by using time or transaction ID. The transaction ID is identified from WAL.
 
  But below are the following problems in using time or transaction ID.
   -Time
   ・Need to memorize the time of failure operation.
     (It is possible to identify the time from WAL. But it takes time and effort to identify the time.)
   ・Difficult to specify detail point.
   -Transaction ID
   ・It takes time and effort to identify the transaction ID.
  
  In order to solve the above problem, 
  I'd like propose a feature to implement automatic recording function of recovery point.
  
【Feature Description】
  In PostgreSQL, there is a backup control function "pg_create_restore_point()".
  User can create a named point for performing restore by using "pg_create_restore_point()".
  And user can recover by using the named point.
  So, execute "pg_create_restore_point()" automatically before executing the following command to create a point for
performingrestore(recovery point).
 
  The name of recovery point is the date and time when the command was executed.
  In this operation, target resource (database name, table name) and recovery point name are output as a message to
PostgreSQLserver log.
 
  
  - Commands wherein this feature can be appended  
   ・TRUNCATE
　 ・DROP
   ・DELETE(Without WHERE clause)
　 ・UPDATE(Without WHERE clause)
　 ・COPY FROM
　 
【How to use】
　 1) When executing the above command, identify the command and recovery point name that matches the resource
indicatingthe operation failure from the server log.
 
　    
　    ex)Message for executing TRUNCATE at 2018/6/1 12:30:30 (database name:testdb, table name:testtb)
　       set recovery point. operation = 'truncate'
　       database = 'testdb' relation = 'testtb' recovery_point_name = '2018-06-01-12:30:30'

   2) Implement PostgreSQL document '25 .3.4.Recovering Using a Continuous Archive Backup.'
     ※Set "recovery_target_name = 'recovery_point name'" at recovery.conf.

【Setting file】
  Set postgres.conf.
  auto_create_restore_point = on # Switch on/off automatic recording function of recovery point. The default value is
'off'.

So what do you think about it? Do you think is it useful?

Also, when recovering with the current specification, tables other than the returned table also return to the state of
thespecified recovery point. 
 
So, I’m looking for ways to recover only specific tables. Do you have any ideas?

------
Naoki Yotsunaga

Re: automatic restore point

From

"David G. Johnston"

Date:

26 June 2018, 04:33:40

On Mon, Jun 25, 2018 at 6:17 PM, Yotsunaga, Naoki <yotsunaga.naoki@jp.fujitsu.com> wrote:

So what do you think about it? Do you think is it useful?

The cost/benefit ratio seems low...

Also, when recovering with the current specification, tables other than the returned table also return to the state of the specified recovery point.
So, I’m looking for ways to recover only specific tables. Do you have any ideas?

...and this lowers it even further.

I'd rather spend effort making the initial execution of said commands less likely. Something like:

TRUNCATE table YES_I_REALLY_WANT_TO_DO_THIS;

which will fail if you don't add the keyword "YES_I..." to the end of the command and the system was setup to require it.

Or, less annoyingly:

BEGIN;

SET LOCAL perform_dangerous_action = true; --can we require local?

TRUNCATE table;

COMMIT;

David J.

Re: automatic restore point

From

Isaac Morland

Date:

26 June 2018, 06:01:06

On 25 June 2018 at 21:33, David G. Johnston <david.g.johnston@gmail.com> wrote:

On Mon, Jun 25, 2018 at 6:17 PM, Yotsunaga, Naoki <yotsunaga.naoki@jp.fujitsu.com> wrote:

So what do you think about it? Do you think is it useful?

I'd rather spend effort making the initial execution of said commands less likely. Something like:

TRUNCATE table YES_I_REALLY_WANT_TO_DO_THIS;

I think an optional setting making DELETE and UPDATE without a WHERE clause illegal would be handy. Obviously this would have to be optional for backward compatibility. Perhaps even just a GUC setting, with the intent being that one would set it in .psqlrc so that omitting the WHERE clause at the command line would just be a syntax error. If one actually does need to affect the whole table one can just say WHERE TRUE. For applications, which presumably have their SQL queries tightly controlled and pre-written anyway, this would most likely not be particularly useful.

Re: automatic restore point

From

Rui DeSousa

Date:

26 June 2018, 07:04:59

Why not use auto commit off in the session or .psqlrc file or begin and then use rollback?  \set AUTOCOMMIT off

What would be nice is if a syntax error didn’t abort the transaction when auto commit is off — being a bad typist.

Re: automatic restore point

From

Justin Pryzby

Date:

26 June 2018, 07:37:13

On Tue, Jun 26, 2018 at 12:04:59AM -0400, Rui DeSousa wrote:
> Why not use auto commit off in the session or .psqlrc file or begin and then use rollback?  \set AUTOCOMMIT off
> 
> What would be nice is if a syntax error didn’t abort the transaction when auto commit is off — being a bad typist.

I think you'll get that behavior with ON_ERROR_ROLLBACK.

Justin

Re: automatic restore point

From

Rui DeSousa

Date:

26 June 2018, 07:44:13

> On Jun 26, 2018, at 12:37 AM, Justin Pryzby <pryzby@telsasoft.com> wrote:
> 
> I think you'll get that behavior with ON_ERROR_ROLLBACK.
> 

Awesome. Thanks!

Re: automatic restore point

From

Michael Paquier

Date:

26 June 2018, 08:16:00

On Mon, Jun 25, 2018 at 11:01:06PM -0400, Isaac Morland wrote:
> I think an optional setting making DELETE and UPDATE without a WHERE clause
> illegal would be handy. Obviously this would have to be optional for
> backward compatibility. Perhaps even just a GUC setting, with the intent
> being that one would set it in .psqlrc so that omitting the WHERE clause at
> the command line would just be a syntax error. If one actually does need to
> affect the whole table one can just say WHERE TRUE. For applications, which
> presumably have their SQL queries tightly controlled and pre-written
> anyway, this would most likely not be particularly useful.

There was a patch doing exactly that which was discussed last year:
https://commitfest.postgresql.org/13/948/
https://www.postgresql.org/message-id/20160721045746.GA25043@fetter.org
What was proposed was rather limiting though, see my messages on the
thread.  Using a hook, that's simple enough to develop an extension
which does that.
--
Michael

Attachment

signature.asc

Re: automatic restore point

From

Michael Paquier

Date:

26 June 2018, 08:30:35

On Tue, Jun 26, 2018 at 01:17:31AM +0000, Yotsunaga, Naoki wrote:
> The following is a description of "automatic restore point".
> 【Background】
>   When DBA's operation failure, for example DBA accidently drop table,
> the database is restored from the file system backup and recovered by
> using time or transaction ID. The transaction ID is identified from
> WAL.
>
>   In order to solve the above problem,
>   I'd like propose a feature to implement automatic recording function
>   of recovery point.

There is also recovery_target_lsn which is new as of v10.  This
parameter is way better than having to track down time or XID, which is
a reason why I developped it.  Please note that this is also one of the
reasons why it is possible to delay WAL replays on standbys, so as an
operator has room to fix such operator errors.  Having of course cold
backups with a proper WAL archive and a correct retention policy never
hurts.

> 【Setting file】
>   Set postgres.conf.
>   auto_create_restore_point = on # Switch on/off automatic recording
>   function of recovery point. The default value is 'off'.
>
> So what do you think about it? Do you think is it useful?

So basically what you are looking for here is a way to enforce a restore
point to be created depending on a set of pre-defined conditions?  How
would you define and choose those?

> Also, when recovering with the current specification, tables other
> than the returned table also return to the state of the specified
> recovery point.
> So, I’m looking for ways to recover only specific tables. Do you have
> any ideas?

Why not using the utility hook which filters out for commands you'd
like to forbid, in this case TRUNCATE or a DROP TABLE on a given
relation?  Or why not simply using an event trigger at your application
level so as you can actually *prevent* the error to happen first?  With
the last option you don't have to write C code, but this would not
filter TRUNCATE.  In short, what you propose looks over-complicated to
me and there are options on the table which allow the problem you are
trying to solve to not happen at all.  You could also use the utility
hook to log or register somewhere hte XID/time/LSN associated to a given
command and then use it as your restore point.  This could also happen
out of core.
--
Michael

Attachment

signature.asc

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

03 July 2018, 04:06:31

Hi. Thanks for comments.

Explanation of the background of the function proposal was inadequate.
So, I explain again.

I assume the following situation.
User needs to make a quick, seemingly simple fix to an important production database. User composes the query, gives it
anonce-over, and lets it run. Seconds later user realizes that user forgot the WHERE clause, dropped the wrong table,
ormade another serious mistake, and interrupts the query, but the damage has been done.

Also user did not record the time and did not look at a lsn position.

Certainly, I thought about reducing the possibility of executing the wrong command, but I thought that the possibility
couldnot be completely eliminated.

So I proposed the “automatic restore point”.
With this function, user can recover quickly and reliably even if you perform a failure operation.

> I'd rather spend effort making the initial execution of said commands less likely.  
  I think that the function to prohibit DELETE and UPDATE without a WHERE clause in the later response is good way.
  But I think that it is impossible to completely eliminate the failure of the other commands.
  For example, drop the wrong table.

-----
Naoki Yotsunaga

-----Original Message-----
From: Michael Paquier [mailto:michael@paquier.xyz] 
Sent: Tuesday, June 26, 2018 2:16 PM
To: Isaac Morland <isaac.morland@gmail.com>
Cc: David G. Johnston <david.g.johnston@gmail.com>; Yotsunaga, Naoki/四ツ永 直輝 <yotsunaga.naoki@jp.fujitsu.com>; Postgres
hackers<pgsql-hackers@postgresql.org>

Subject: Re: automatic restore point

On Mon, Jun 25, 2018 at 11:01:06PM -0400, Isaac Morland wrote:
> I think an optional setting making DELETE and UPDATE without a WHERE 
> clause illegal would be handy. Obviously this would have to be 
> optional for backward compatibility. Perhaps even just a GUC setting, 
> with the intent being that one would set it in .psqlrc so that 
> omitting the WHERE clause at the command line would just be a syntax 
> error. If one actually does need to affect the whole table one can 
> just say WHERE TRUE. For applications, which presumably have their SQL 
> queries tightly controlled and pre-written anyway, this would most likely not be particularly useful.

There was a patch doing exactly that which was discussed last year:
https://commitfest.postgresql.org/13/948/
https://www.postgresql.org/message-id/20160721045746.GA25043@fetter.org
What was proposed was rather limiting though, see my messages on the thread.  Using a hook, that's simple enough to
developan extension which does that.

--
Michael

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

03 July 2018, 04:07:41

Hi. Thanks for comments.

>There is also recovery_target_lsn which is new as of v10.
 In this method, it is necessary to look at a lsn position before operating.
 But I assume the user who did not look it before operating.
 So I think that this method is not appropriate.

> So basically what you are looking for here is a way to enforce a restore point to be created depending on a set of
pre-definedconditions?  
 
>How would you define and choose those?
 I understand that I was asked how to set up a command to apply this function. 
 Ex) DROP = on 
     TRUNCATE = off
 Is my interpretation right?
 If my interpretation is correct, all the above commands will be applied.
 When this function is turned on, this function works when all the above commands are executed.

-------
Naoki Yotsynaga
-----Original Message-----
From: Michael Paquier [mailto:michael@paquier.xyz] 
Sent: Tuesday, June 26, 2018 2:31 PM
To: Yotsunaga, Naoki/四ツ永 直輝 <yotsunaga.naoki@jp.fujitsu.com>
Cc: Postgres hackers <pgsql-hackers@postgresql.org>
Subject: Re: automatic restore point

On Tue, Jun 26, 2018 at 01:17:31AM +0000, Yotsunaga, Naoki wrote:
> The following is a description of "automatic restore point".
> 【Background】
>   When DBA's operation failure, for example DBA accidently drop table, 
> the database is restored from the file system backup and recovered by 
> using time or transaction ID. The transaction ID is identified from 
> WAL.
>   
>   In order to solve the above problem, 
>   I'd like propose a feature to implement automatic recording function
>   of recovery point.

There is also recovery_target_lsn which is new as of v10.  This parameter is way better than having to track down time
orXID, which is a reason why I developped it.  Please note that this is also one of the reasons why it is possible to
delayWAL replays on standbys, so as an operator has room to fix such operator errors.  Having of course cold backups
witha proper WAL archive and a correct retention policy never hurts.
 

> 【Setting file】
>   Set postgres.conf.
>   auto_create_restore_point = on # Switch on/off automatic recording
>   function of recovery point. The default value is 'off'.
> 
> So what do you think about it? Do you think is it useful?

So basically what you are looking for here is a way to enforce a restore point to be created depending on a set of
pre-definedconditions?  How would you define and choose those?
 

> Also, when recovering with the current specification, tables other 
> than the returned table also return to the state of the specified 
> recovery point.
> So, I’m looking for ways to recover only specific tables. Do you have 
> any ideas?

Why not using the utility hook which filters out for commands you'd like to forbid, in this case TRUNCATE or a DROP
TABLEon a given relation?  Or why not simply using an event trigger at your application level so as you can actually
*prevent*the error to happen first?  With the last option you don't have to write C code, but this would not filter
TRUNCATE. In short, what you propose looks over-complicated to me and there are options on the table which allow the
problemyou are trying to solve to not happen at all.  You could also use the utility hook to log or register somewhere
hteXID/time/LSN associated to a given command and then use it as your restore point.  This could also happen out of
core.
--
Michael

Re: automatic restore point

From

Michael Paquier

Date:

03 July 2018, 04:16:50

On Tue, Jul 03, 2018 at 01:07:41AM +0000, Yotsunaga, Naoki wrote:
>> There is also recovery_target_lsn which is new as of v10.
>  In this method, it is necessary to look at a lsn position before operating.
>  But I assume the user who did not look it before operating.
>  So I think that this method is not appropriate.

You should avoid top-posting on the mailing lists, this breaks the
consistency of the thread.

>> So basically what you are looking for here is a way to enforce a
>> restore point to be created depending on a set of pre-defined
>> conditions?  How would you define and choose those?
>
> I understand that I was asked how to set up a command to apply this function.
>  Ex) DROP = on
>      TRUNCATE = off
>  Is my interpretation right?
>  If my interpretation is correct, all the above commands will be
>  applied.
>  When this function is turned on, this function works when all the
>  above commands are executed.

Yeah, but based on which factors are you able to define that such
conditions are enough to say that this feature is fully-compliant with
user's need, and how can you be sure that this is not going to result in
an additional maintenance burden if you need to define a new set of
conditions in the future.  For example an operator has issued a costly
ALTER TABLE which causes a full table rewrite, which could be also an
operation that you would like to prevent.  Having a set of GUCs which
define such low-level behavior is not really user-friendly.
--
Michael

Attachment

signature.asc

Re: automatic restore point

From

Michael Paquier

Date:

03 July 2018, 04:21:59

On Tue, Jul 03, 2018 at 01:06:31AM +0000, Yotsunaga, Naoki wrote:
>> I'd rather spend effort making the initial execution of said commands
>> less likely.
>
> I think that the function to prohibit DELETE and UPDATE without a
> WHERE clause in the later response is good way.

This has popped up already in the lists in the past.

> But I think that it is impossible to completely eliminate the failure
> of the other commands.  For example, drop the wrong table.

This kind of thing is heavily application-dependent.  For example, you
would likely not care if an operator, who has newly-joined the team in
charge of the maintenance of this data, drops unfortunately a table
which includes logs from 10 years back, and you would very likely care
about a table dropped which has user's login data.  My point is that you
need to carefully design the shape of the configuration you would use,
so as any application's admin would be able to cope with it, for example
allowing exclusion filters with regular expressions could be a good idea
to dig into.  And also you need to think about it so as it is backward
compatible.
--
Michael

Attachment

signature.asc

Re: automatic restore point

From

Jaime Casanova

Date:

03 July 2018, 05:05:57

On Mon, 2 Jul 2018 at 20:07, Yotsunaga, Naoki
<yotsunaga.naoki@jp.fujitsu.com> wrote:
>
> Hi. Thanks for comments.
>
> Explanation of the background of the function proposal was inadequate.
> So, I explain again.
>
> I assume the following situation.
> User needs to make a quick, seemingly simple fix to an important production database. User composes the query, gives
itan once-over, and lets it run. Seconds later user realizes that user forgot the WHERE clause, dropped the wrong
table,or made another serious mistake, and interrupts the query, but the damage has been done. 
> Also user did not record the time and did not look at a lsn position.
>

Thinking on Michael's suggestion of using event triggers, you can
create an event trigger to run pg_create_restore_point() on DROP,
here's a simple example of how that should like:
https://www.postgresql.org/docs/current/static/functions-event-triggers.html

You can also create a normal trigger BEFORE TRUNCATE to create a
restore point just before running the TRUNCATE command.

Those would run *on the background* (you don't need to call them
manually), you can use them right now, won't affect performance for
people not wanting this "functionality".

BTW, Michael's suggestion also included the idea of recording
xid/time/lsn which can be done through triggers too

--
Jaime Casanova                      www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

06 July 2018, 11:05:03

>-----Original Message-----
>From: Michael Paquier [mailto:michael@paquier.xyz] 
>Sent: Tuesday, July 3, 2018 10:22 AM

>This kind of thing is heavily application-dependent.  For example, you would likely not care if an operator, who has
newly-joinedthe team in >charge of the maintenance of this data, drops unfortunately a table which includes logs from
10years back, and you would very likely care >about a table dropped which has user's login data.  My point is that you
needto carefully design the shape of the configuration you would use, >so as any application's admin would be able to
copewith it, for example allowing exclusion filters with regular expressions could be a good >idea to dig into.  And
alsoyou need to think about it so as it is backward compatible.
 

Thanks for comments.

Does that mean that the application (user) is interested in which table?
For example, there are two tables A. It is ok even if one table disappears, but it is troubled if another table B
disappears.So, when the table B is dropped, automatic restore point works. In the table A, automatic restore point does
notwork.
 
So, it is difficult to implement that automatic restore point in postgresql by default.
Is my interpretation right?

---
Naoki Yotsunaga

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

06 July 2018, 11:10:44

>-----Original Message-----
>From: Jaime Casanova [mailto:jaime.casanova@2ndquadrant.com] 
>Sent: Tuesday, July 3, 2018 11:06 AM

>Thinking on Michael's suggestion of using event triggers, you can create an event >trigger to run
pg_create_restore_point()on DROP, here's a simple example of how >that should like:
 
>https://www.postgresql.org/docs/current/static/functions-event-triggers.html

>You can also create a normal trigger BEFORE TRUNCATE to create a restore point just >before running the TRUNCATE
command.

Thanks for comments.
I was able to understand.

---
Naoki Yotsunaga

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

11 July 2018, 09:11:01

>-----Original Message-----
>From: Yotsunaga, Naoki [mailto:yotsunaga.naoki@jp.fujitsu.com] 
>Sent: Friday, July 6, 2018 5:05 PM

>Does that mean that the application (user) is interested in which table? 
>For example, there are two tables A. It is ok even if one table disappears, but it is troubled if another table B
disappears.So, when the table B is dropped, automatic restore point works. In the table A, automatic restore point does
notwork. 
 
>So, it is difficult to implement that automatic restore point in postgresql by default. 
>Is my interpretation right?

I want to hear about the following in addition to the previous comment.
What would you do if your customer dropped the table and asked you to restore it?
Everyone is thinking what to do to avoid operation failure, but I’m thinking about after the user’s failure.
What I mean is that not all users will set up in advance.
For example, if you make the settings described in the manual, you will not drop the table by operation failure.
However,not all users do that setting.
 
For such users, I think that it is necessary to have a function to easily restore data after failing operation without
settinganything in advance.
 
So I proposed this function.

---
Naoki Yotsunaga

Re: automatic restore point

From

Michael Paquier

Date:

11 July 2018, 09:33:36

On Wed, Jul 11, 2018 at 06:11:01AM +0000, Yotsunaga, Naoki wrote:
> I want to hear about the following in addition to the previous
> comment.   What would you do if your customer dropped the table and asked you to
> restore it?

I can think of 4 reasons on top of my mind:
1) Don't do that.
2) Implement safe-guards using utility hooks or event triggers.
3) Have a delayed standby if you don't believe that your administrators
are skilled enough in case.
4) Have backups and a WAL archive.

> Everyone is thinking what to do to avoid operation failure, but I’m
> thinking about after the user’s failure.
> What I mean is that not all users will set up in advance.
> For example, if you make the settings described in the manual, you
> will not drop the table by operation failure. However, not all users
> do that setting.
> For such users, I think that it is necessary to have a function to
> easily restore data after failing operation without setting anything
> in advance. So I proposed this function.

Well, if you put in place correct measures from the start you would not
have problems.  It seems to me that there is no point in implementing
something which is a solution for a very narrow case, where the user has
shot his own foot to begin with.  Having backups anyway is mandatory by
the way, standby replicas are not backups.
--
Michael

Attachment

signature.asc

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

13 July 2018, 11:16:00

>-----Original Message-----
>From: Michael Paquier [mailto:michael@paquier.xyz] 
>Sent: Wednesday, July 11, 2018 3:34 PM

>Well, if you put in place correct measures from the start you would not have problems.  
>It seems to me that there is no point in implementing something which is a solution for a very narrow case, where the
userhas shot his own foot to begin with. 
 
>Having backups anyway is mandatory by the way, standby replicas are not backups.

I think that the Undo function of AWS and Oracle's Flashback function are to save such users, and it is a function to
preventhuman error.
 
So, how about postgres implementing such a function?
 
Also, as an approach to achieving the goal, I thought about outputting lsn to the server log when a specific command
wasexecuted.
 
 
I do not think the source code of postgres will be complicated when implementing this function.
Do you feel it is too complicated?

-------
Naoki Yotsunaga

Re: automatic restore point

From

Michael Paquier

Date:

13 July 2018, 15:18:46

On Fri, Jul 13, 2018 at 08:16:00AM +0000, Yotsunaga, Naoki wrote:
> Do you feel it is too complicated?

In short, yes.
--
Michael

Attachment

signature.asc

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

31 August 2018, 11:14:44

-----Original Message-----
From: Yotsunaga, Naoki [mailto:yotsunaga.naoki@jp.fujitsu.com] 
Sent: Tuesday, June 26, 2018 10:18 AM
To: Postgres hackers <pgsql-hackers@postgresql.org>
Subject: automatic restore point

Hi, I attached a patch to output the LSN before execution to the server log when executing a specific command and
accidentallyerasing data.

A detailed background has been presented before.
In short explain: After the DBA's operation failure and erases the data, it is necessary to perform PITR immediately.
Since it is not possible to easily obtain information for doing the current PITR, I would like to solve it.

The specification has changed from the first proposal.
-Target command
 DROP TABLE
 TRUNCATE

-Setting file
 postgresql.conf
 log_recovery_points = on #default value is 'off'. When the switch is turned on, LSN is output to the server log when
DROPTABLE, TRUNCATE is executed.

-How to use
1) When executing the above command, identify the command and recovery point that matches the resource indicating the
operationfailure from the server log.    

ex) LOG:  recovery_point_lsn: 0/201BB70
　　　　   STATEMENT:  drop table test ;
 2) Implement PostgreSQL document '25 .3.4.Recovering Using a Continuous Archive Backup.'
    *Set "recovery_target_lsn = 'recovery_point_lsn'" at recovery.conf.

Although there was pointed out that the source becomes complicated in the past, we could add the function by adding
about20 steps.

What do you think about it? Do you think is it useful?
------
Naoki Yotsunaga

Attachment

automatic_restore_point.patch

Re: automatic restore point

From

Pavel Stehule

Date:

31 August 2018, 12:49:56

2018-08-31 10:14 GMT+02:00 Yotsunaga, Naoki <yotsunaga.naoki@jp.fujitsu.com>:

-----Original Message-----
From: Yotsunaga, Naoki [mailto:yotsunaga.naoki@jp.fujitsu.com]
Sent: Tuesday, June 26, 2018 10:18 AM
To: Postgres hackers <pgsql-hackers@postgresql.org>
Subject: automatic restore point

Hi, I attached a patch to output the LSN before execution to the server log when executing a specific command and accidentally erasing data.

A detailed background has been presented before.
In short explain: After the DBA's operation failure and erases the data, it is necessary to perform PITR immediately.
Since it is not possible to easily obtain information for doing the current PITR, I would like to solve it.

The specification has changed from the first proposal.
-Target command
DROP TABLE
TRUNCATE

-Setting file
postgresql.conf
log_recovery_points = on #default value is 'off'. When the switch is turned on, LSN is output to the server log when DROP TABLE, TRUNCATE is executed.

-How to use
1) When executing the above command, identify the command and recovery point that matches the resource indicating the operation failure from the server log.
ex) LOG: recovery_point_lsn: 0/201BB70
　　　　 STATEMENT: drop table test ;
2) Implement PostgreSQL document '25 .3.4.Recovering Using a Continuous Archive Backup.'
*Set "recovery_target_lsn = 'recovery_point_lsn'" at recovery.conf.

Although there was pointed out that the source becomes complicated in the past, we could add the function by adding about 20 steps.

What do you think about it? Do you think is it useful?

I think it is useful and simple.

Regards

Pavel

------
Naoki Yotsunaga

RE: automatic restore point

From

"Yotsunaga, Naoki"

Date:

06 September 2018, 03:16:14

-----Original Message-----
From: Yotsunaga, Naoki [mailto:yotsunaga.naoki@jp.fujitsu.com] 
Sent: Tuesday, June 26, 2018 10:18 AM
To: Postgres hackers <pgsql-hackers@postgresql.org>
Subject: automatic restore point

>Hi, I attached a patch to output the LSN before execution to the server log >when executing a specific command and
accidentallyerasing data.
 

Since there was an error in the attached patch, I attached the modified patch.

------
Naoki Yotsunaga

Attachment

automatic_restore_point_v2.patch

Re: automatic restore point

From

Peter Eisentraut

Date:

28 September 2018, 22:13:17

On 06/09/2018 02:16, Yotsunaga, Naoki wrote:
> -----Original Message-----
> From: Yotsunaga, Naoki [mailto:yotsunaga.naoki@jp.fujitsu.com] 
> Sent: Tuesday, June 26, 2018 10:18 AM
> To: Postgres hackers <pgsql-hackers@postgresql.org>
> Subject: automatic restore point
> 
>> Hi, I attached a patch to output the LSN before execution to the server log >when executing a specific command and
accidentallyerasing data.

> 
> Since there was an error in the attached patch, I attached the modified patch.

I think this should be done using event triggers.  Right now, you just
have it hardcoded to TRUNCATE and DROP TABLE, which seems somewhat
arbitrary.  With event triggers, you have the full flexibility to do
what you want.  You can pick which commands to apply it to, you can log
the LSN, you can create restore points, etc.

-- 
Peter Eisentraut              http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: automatic restore point

From

Michael Paquier

Date:

30 September 2018, 15:45:10

On Fri, Sep 28, 2018 at 09:13:17PM +0200, Peter Eisentraut wrote:
> I think this should be done using event triggers.  Right now, you just
> have it hardcoded to TRUNCATE and DROP TABLE, which seems somewhat
> arbitrary.  With event triggers, you have the full flexibility to do
> what you want.  You can pick which commands to apply it to, you can log
> the LSN, you can create restore points, etc.

I still unfortunately don't see what this patch brings more that you
cannot do.  Event triggers are particularly useful in this prospective,
so I am marking the patch as rejected.
--
Michael

Attachment

signature.asc

Re: automatic restore point

From

Alvaro Herrera

Date:

01 October 2018, 06:34:23

On 2018-Sep-30, Michael Paquier wrote:

> On Fri, Sep 28, 2018 at 09:13:17PM +0200, Peter Eisentraut wrote:
> > I think this should be done using event triggers.  Right now, you just
> > have it hardcoded to TRUNCATE and DROP TABLE, which seems somewhat
> > arbitrary.  With event triggers, you have the full flexibility to do
> > what you want.  You can pick which commands to apply it to, you can log
> > the LSN, you can create restore points, etc.
> 
> I still unfortunately don't see what this patch brings more that you
> cannot do.  Event triggers are particularly useful in this prospective,
> so I am marking the patch as rejected.

I don't see it as clear cut as all that ... particularly considering
that a useful event trigger runs *after* the DDL command in question has
already written all its WAL, so such a restore point would be completely
useless.  (Or are ddl_command_start event triggers useful enough?)

-- 
Álvaro Herrera                https://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: automatic restore point

From

Alvaro Herrera

Date:

01 October 2018, 06:35:46

On 2018-Jul-02, Jaime Casanova wrote:

> You can also create a normal trigger BEFORE TRUNCATE to create a
> restore point just before running the TRUNCATE command.

On every single table?

-- 
Álvaro Herrera                https://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: automatic restore point

From

Peter Eisentraut

Date:

01 October 2018, 17:43:09

On 01/10/2018 05:34, Alvaro Herrera wrote:
> I don't see it as clear cut as all that ... particularly considering
> that a useful event trigger runs *after* the DDL command in question has
> already written all its WAL, so such a restore point would be completely
> useless.  (Or are ddl_command_start event triggers useful enough?)

The following appears to work:

CREATE FUNCTION evt_automatic_restart_point() RETURNS event_trigger
LANGUAGE plpgsql
AS $$
BEGIN
  PERFORM pg_create_restore_point(tg_tag);
END
$$;

CREATE EVENT TRIGGER automatic_restart_point ON ddl_command_start
    EXECUTE PROCEDURE evt_automatic_restart_point();

Are there any concerns about this?

-- 
Peter Eisentraut              http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: automatic restore point

From

Alvaro Herrera

Date:

02 October 2018, 01:06:32

On 2018-Oct-01, Peter Eisentraut wrote:

> The following appears to work:
> 
> CREATE FUNCTION evt_automatic_restart_point() RETURNS event_trigger
> LANGUAGE plpgsql
> AS $$
> BEGIN
>   PERFORM pg_create_restore_point(tg_tag);
> END
> $$;
> 
> CREATE EVENT TRIGGER automatic_restart_point ON ddl_command_start
>     EXECUTE PROCEDURE evt_automatic_restart_point();
> 
> Are there any concerns about this?

Mumble.

Re-reading the implementation in standard_ProcessUtility, I wonder what
is PROCESS_UTILITY_QUERY_NONATOMIC -- there seems to be a maze through
SPI that determines whether this flag is set or not, which could affect
whether the event trigger is useful.  Are utilities executed through a
procedure detected by event triggers?  If so, then this mechanism seems
good enough to me.  But if there's a way to sneak utility commands (DROP
TABLE) without the event trigger being invoked, then no (and in that
case maybe it's just a bug in procedures and we can still not include
this patch).

(Grepping for "atomic" is unsurprisingly unhelpful.  I really wish we
didn't use plain words as struct member names ...)

On the TRUNCATE case it's a bit annoying that you can't do it with event
triggers -- you have to create individual regular triggers on truncate
for each table (so you probably need yet another event trigger that
creates such triggers).

-- 
Álvaro Herrera                https://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Re: automatic restore point

From

Peter Eisentraut

Date:

05 October 2018, 16:26:27

On 02/10/2018 00:06, Alvaro Herrera wrote:
> Re-reading the implementation in standard_ProcessUtility, I wonder what
> is PROCESS_UTILITY_QUERY_NONATOMIC -- there seems to be a maze through
> SPI that determines whether this flag is set or not, which could affect
> whether the event trigger is useful.  Are utilities executed through a
> procedure detected by event triggers?  If so, then this mechanism seems
> good enough to me.  But if there's a way to sneak utility commands (DROP
> TABLE) without the event trigger being invoked, then no (and in that
> case maybe it's just a bug in procedures and we can still not include
> this patch).

It looked for a moment that

    isCompleteQuery = (context <= PROCESS_UTILITY_QUERY)

in ProcessUtilitySlow() might be a problem, since that omits
PROCESS_UTILITY_QUERY_NONATOMIC, but it's not actually a problem, since
the commands that run this way (CALL and SET from PL/pgSQL) don't have
event triggers.  But anyway, I propose the attached patch to rephrase
that.  Also some tests that show it all works as expected.

> On the TRUNCATE case it's a bit annoying that you can't do it with event
> triggers -- you have to create individual regular triggers on truncate
> for each table (so you probably need yet another event trigger that
> creates such triggers).

I don't see why we couldn't add event triggers on TRUNCATE or other
commands such as DELETE.

-- 
Peter Eisentraut              http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

Attachment

Re: automatic restore point

From

Peter Eisentraut

Date:

10 October 2018, 21:09:23

On 05/10/2018 15:26, Peter Eisentraut wrote:
> It looked for a moment that
> 
>     isCompleteQuery = (context <= PROCESS_UTILITY_QUERY)
> 
> in ProcessUtilitySlow() might be a problem, since that omits
> PROCESS_UTILITY_QUERY_NONATOMIC, but it's not actually a problem, since
> the commands that run this way (CALL and SET from PL/pgSQL) don't have
> event triggers.  But anyway, I propose the attached patch to rephrase
> that.  Also some tests that show it all works as expected.

I have committed these to master.

-- 
Peter Eisentraut              http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services