Thread: [HACKERS] Change in "policy" on dump ordering?
AFAICT in older versions only object types that absolutely had to wait for DO_POST_DATA_BOUNDARY would do so. More recently though, objects are being added after that (presumably because it's easier than renumbering everything in dbObjectTypePriority). Is this change a good or bad idea? Should there be an official guide for where new things go? -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532)
On 2/21/17 14:58, Jim Nasby wrote: > AFAICT in older versions only object types that absolutely had to wait > for DO_POST_DATA_BOUNDARY would do so. More recently though, objects are > being added after that (presumably because it's easier than renumbering > everything in dbObjectTypePriority). Is there any specific assignment that you have concerns about? > Is this change a good or bad idea? Should there be an official guide for > where new things go? The comment above dbObjectTypePriority explains it, doesn't it? -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
On 2/21/17 4:25 PM, Peter Eisentraut wrote: > On 2/21/17 14:58, Jim Nasby wrote: >> AFAICT in older versions only object types that absolutely had to wait >> for DO_POST_DATA_BOUNDARY would do so. More recently though, objects are >> being added after that (presumably because it's easier than renumbering >> everything in dbObjectTypePriority). > > Is there any specific assignment that you have concerns about? Originally, no, but reviewing the list again I'm kindof wondering about DO_DEFAULT_ACL, especially since the acl code in pg_dump looks at defaults as part of what removes the need to explicitly dump permissions. I'm also wondering if DO_POLICY could potentially affect matviews? Actually, I think matviews really need to be the absolute last thing. What if you had a matview that referenced publications or subscriptions? I'm guessing that would be broken right now. >> Is this change a good or bad idea? Should there be an official guide for >> where new things go? > > The comment above dbObjectTypePriority explains it, doesn't it? Not really; it just makes reference to needing to be in-sync with pg_dump.c. My concern is that clearly people went to lengths in the past to put everything possible before DO_PRE_DATA_BOUNDARY (ie, text search and FDW) but most recently added stuff has gone after DO_POST_DATA_BOUNDARY, even though there's no reason it couldn't be pre-data. That's certainly a change, and I suspect it's not intentional (other than it's obviously less work to stick stuff at the end, but that could be fixed by having an array of the actual enum values and just having pg_dump sort that when it starts). -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532)
On 2/22/17 00:55, Jim Nasby wrote: > Originally, no, but reviewing the list again I'm kindof wondering about > DO_DEFAULT_ACL, especially since the acl code in pg_dump looks at > defaults as part of what removes the need to explicitly dump > permissions. I'm also wondering if DO_POLICY could potentially affect > matviews? I'm not sure about the details of these, but I know that there are reasons why the permissions stuff is pretty late in the dump in general. > Actually, I think matviews really need to be the absolute last thing. > What if you had a matview that referenced publications or subscriptions? > I'm guessing that would be broken right now. I'm not sure what you have in mind here. Publications and subscriptions don't interact with materialized views, so the relative order doesn't really matter. > Not really; it just makes reference to needing to be in-sync with > pg_dump.c. My concern is that clearly people went to lengths in the past > to put everything possible before DO_PRE_DATA_BOUNDARY (ie, text search > and FDW) but most recently added stuff has gone after > DO_POST_DATA_BOUNDARY, even though there's no reason it couldn't be > pre-data. That's certainly a change, and I suspect it's not intentional I think the recent additions actually were intentional, although one could debate the intentions. ;-) -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
On 2/22/17 8:00 AM, Peter Eisentraut wrote: >> Actually, I think matviews really need to be the absolute last thing. >> What if you had a matview that referenced publications or subscriptions? >> I'm guessing that would be broken right now. > I'm not sure what you have in mind here. Publications and subscriptions > don't interact with materialized views, so the relative order doesn't > really matter. CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; SELECT 0 IOW, you can create matviews that depend on any other table/view/matview, but right now if the matview includes certain items it will mysteriously end up empty post-restore. -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532)
On 2/22/17 10:14, Jim Nasby wrote: > CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; > SELECT 0 > > IOW, you can create matviews that depend on any other > table/view/matview, but right now if the matview includes certain items > it will mysteriously end up empty post-restore. Yes, by that logic matview refresh should always be last. -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
On 2/22/17 12:29 PM, Peter Eisentraut wrote: > On 2/22/17 10:14, Jim Nasby wrote: >> CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; >> SELECT 0 >> >> IOW, you can create matviews that depend on any other >> table/view/matview, but right now if the matview includes certain items >> it will mysteriously end up empty post-restore. > > Yes, by that logic matview refresh should always be last. Patches for head attached. RLS was the first item added after DO_REFRESH_MATVIEW, which was added in 9.5. So if we want to treat this as a bug, they'd need to be patched as well, which is a simple matter of swapping 33 and 34. -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532) -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Attachment
Jim, * Jim Nasby (Jim.Nasby@BlueTreble.com) wrote: > On 2/22/17 12:29 PM, Peter Eisentraut wrote: > >On 2/22/17 10:14, Jim Nasby wrote: > >>CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; > >>SELECT 0 > >> > >>IOW, you can create matviews that depend on any other > >>table/view/matview, but right now if the matview includes certain items > >>it will mysteriously end up empty post-restore. > > > >Yes, by that logic matview refresh should always be last. > > Patches for head attached. > > RLS was the first item added after DO_REFRESH_MATVIEW, which was > added in 9.5. So if we want to treat this as a bug, they'd need to > be patched as well, which is a simple matter of swapping 33 and 34. Can you clarify what misbehavior there is with RLS that would warrent this being a bug..? I did consider where in the dump I thought policies should go, though I may certainly have overlooked something. Thanks! Stephen
Hi, On Wed, Feb 22, 2017 at 05:24:49PM -0600, Jim Nasby wrote: > On 2/22/17 12:29 PM, Peter Eisentraut wrote: > >On 2/22/17 10:14, Jim Nasby wrote: > >>CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; > >>SELECT 0 > >> > >>IOW, you can create matviews that depend on any other > >>table/view/matview, but right now if the matview includes certain items > >>it will mysteriously end up empty post-restore. > > > >Yes, by that logic matview refresh should always be last. Glad to hear - I vaguely remember this coming up in a different thread some time ago, and I though you (Peter) had reservations about moving things past after the ACL step, but I couldn't find this thread now anymore, only https://www.postgresql.org/message-id/11166.1424357659%40sss.pgh.pa.us > Patches for head attached. Yay. > diff --git a/src/bin/pg_dump/pg_dump_sort.c b/src/bin/pg_dump/pg_dump_sort.c > index ea643397ba..708a47f3cb 100644 > --- a/src/bin/pg_dump/pg_dump_sort.c > +++ b/src/bin/pg_dump/pg_dump_sort.c > @@ -26,6 +26,9 @@ static const char *modulename = gettext_noop("sorter"); > * Sort priority for database object types. > * Objects are sorted by type, and within a type by name. > * > + * Because materialized views can potentially reference system views, > + * DO_REFRESH_MATVIEW should always be the last thing on the list. > + * I think this comment is overly specific: any materialized view that references a view or table in a different schema (pg_catalog or not) will likely not refresh on pg_restore AIUI, so singling out system views doesn't look right to me. Michael -- Michael Banck Projektleiter / Senior Berater Tel.: +49 2166 9901-171 Fax: +49 2166 9901-100 Email: michael.banck@credativ.de credativ GmbH, HRB Mönchengladbach 12080 USt-ID-Nummer: DE204566209 Trompeterallee 108, 41189 Mönchengladbach Geschäftsführung: Dr. Michael Meskes, Jörg Folz, Sascha Heuer
On 2/22/17 5:38 PM, Michael Banck wrote: >> diff --git a/src/bin/pg_dump/pg_dump_sort.c b/src/bin/pg_dump/pg_dump_sort.c >> index ea643397ba..708a47f3cb 100644 >> --- a/src/bin/pg_dump/pg_dump_sort.c >> +++ b/src/bin/pg_dump/pg_dump_sort.c >> @@ -26,6 +26,9 @@ static const char *modulename = gettext_noop("sorter"); >> * Sort priority for database object types. >> * Objects are sorted by type, and within a type by name. >> * >> + * Because materialized views can potentially reference system views, >> + * DO_REFRESH_MATVIEW should always be the last thing on the list. >> + * > I think this comment is overly specific: any materialized view that > references a view or table in a different schema (pg_catalog or not) > will likely not refresh on pg_restore AIUI, so singling out system views > doesn't look right to me. This isn't a matter of excluded schemas. The problem is that if you had a matview that referenced a system view for something that was restored after DO_REFRESH_MATVIEW (such as subscriptions) then the view would be inaccurate after the restore. Stephen, hopefully that answers your question as well. :) -- Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX Experts in Analytics, Data Architecture and PostgreSQL Data in Trouble? Get it in Treble! http://BlueTreble.com 855-TREBLE2 (855-873-2532)
Hi, I've found the (AIUI) previous discussion about this, it's Bug #13907: https://www.postgresql.org/message-id/flat/20160202161407.2778.24659%40wrigleys.postgresql.org#20160202161407.2778.24659@wrigleys.postgresql.org On Wed, Feb 22, 2017 at 05:24:49PM -0600, Jim Nasby wrote: > On 2/22/17 12:29 PM, Peter Eisentraut wrote: > >On 2/22/17 10:14, Jim Nasby wrote: > >>CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; > >>SELECT 0 > >> > >>IOW, you can create matviews that depend on any other > >>table/view/matview, but right now if the matview includes certain items > >>it will mysteriously end up empty post-restore. > > > >Yes, by that logic matview refresh should always be last. In https://www.postgresql.org/message-id/9af4bc32-7e55-a21d-47e7-608582a8c48d%402ndquadrant.com you (Peter) wrote: "The reason that ACLs are restored last is that they could contain owner self-revokes. So anything that you run after the ACLs could fail because of that. I think a more complete fix would be to split up the ACL restores into the general part, which you would run right after the object is restored, and the owner revokes, which you would run last." > Patches for head attached. FWIW, Keven Grittner had proposed a more involved patch in https://www.postgresql.org/message-id/CACjxUsNmpQDL58zRm3EFS9atqGT8%2BX_2%2BFOCXpYBwWZw5wgi-A%40mail.gmail.com Michael -- Michael Banck Projektleiter / Senior Berater Tel.: +49 2166 9901-171 Fax: +49 2166 9901-100 Email: michael.banck@credativ.de credativ GmbH, HRB Mönchengladbach 12080 USt-ID-Nummer: DE204566209 Trompeterallee 108, 41189 Mönchengladbach Geschäftsführung: Dr. Michael Meskes, Jörg Folz, Sascha Heuer
On 2/22/17 18:24, Jim Nasby wrote: > On 2/22/17 12:29 PM, Peter Eisentraut wrote: >> On 2/22/17 10:14, Jim Nasby wrote: >>> CREATE MATERIALIZED VIEW tmv AS SELECT * FROM pg_subscription; >>> SELECT 0 >>> >>> IOW, you can create matviews that depend on any other >>> table/view/matview, but right now if the matview includes certain items >>> it will mysteriously end up empty post-restore. >> >> Yes, by that logic matview refresh should always be last. > > Patches for head attached. > > RLS was the first item added after DO_REFRESH_MATVIEW, which was added > in 9.5. So if we want to treat this as a bug, they'd need to be patched > as well, which is a simple matter of swapping 33 and 34. I wonder whether we should emphasize this change by assigning DO_REFRESH_MATVIEW a higher number, like 100? -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
On 3/1/17 08:36, Peter Eisentraut wrote: > On 2/22/17 18:24, Jim Nasby wrote: >>> Yes, by that logic matview refresh should always be last. >> >> Patches for head attached. >> >> RLS was the first item added after DO_REFRESH_MATVIEW, which was added >> in 9.5. So if we want to treat this as a bug, they'd need to be patched >> as well, which is a simple matter of swapping 33 and 34. > > I wonder whether we should emphasize this change by assigning > DO_REFRESH_MATVIEW a higher number, like 100? Since there wasn't any interest in that idea, I have committed Jim's patch as is. -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
On 3/4/17 11:49 AM, Peter Eisentraut wrote: >> I wonder whether we should emphasize this change by assigning >> DO_REFRESH_MATVIEW a higher number, like 100? > Since there wasn't any interest in that idea, I have committed Jim's > patch as is. Thanks. Something else that seems somewhat useful would be to have the sort defined by an array of the ENUM values in the correct order, and then have the code do the mechanical map generation. I'm guessing the only reasonable way to make that work would be to have some kind of a last item indicator value, so you know how many values were in the ENUM. Maybe there's a better way to do that... -- Jim Nasby, Chief Data Architect, OpenSCG
Hi, On Sat, Mar 04, 2017 at 02:49:36PM -0500, Peter Eisentraut wrote: > On 3/1/17 08:36, Peter Eisentraut wrote: > > On 2/22/17 18:24, Jim Nasby wrote: > >>> Yes, by that logic matview refresh should always be last. > >> > >> Patches for head attached. > >> > >> RLS was the first item added after DO_REFRESH_MATVIEW, which was added > >> in 9.5. So if we want to treat this as a bug, they'd need to be patched > >> as well, which is a simple matter of swapping 33 and 34. > > > > I wonder whether we should emphasize this change by assigning > > DO_REFRESH_MATVIEW a higher number, like 100? > > Since there wasn't any interest in that idea, I have committed Jim's > patch as is. Would this be a candidate for backpatching, or is the behaviour change in pg_dump trumping the issues it solves? Michael -- Michael Banck Projektleiter / Senior Berater Tel.: +49 2166 9901-171 Fax: +49 2166 9901-100 Email: michael.banck@credativ.de credativ GmbH, HRB Mönchengladbach 12080 USt-ID-Nummer: DE204566209 Trompeterallee 108, 41189 Mönchengladbach Geschäftsführung: Dr. Michael Meskes, Jörg Folz, Sascha Heuer
On 3/6/17 03:33, Michael Banck wrote: > Would this be a candidate for backpatching, or is the behaviour change > in pg_dump trumping the issues it solves? Unless someone literally has a materialized view on pg_policy, it wouldn't make a difference, so I'm not very keen on bothering to backpatch this. -- Peter Eisentraut http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
* Peter Eisentraut (peter.eisentraut@2ndquadrant.com) wrote: > On 3/6/17 03:33, Michael Banck wrote: > > Would this be a candidate for backpatching, or is the behaviour change > > in pg_dump trumping the issues it solves? > > Unless someone literally has a materialized view on pg_policy, it > wouldn't make a difference, so I'm not very keen on bothering to > backpatch this. Agreed. Thanks! Stephen
Stephen Frost <sfrost@snowman.net> writes: > * Peter Eisentraut (peter.eisentraut@2ndquadrant.com) wrote: >> On 3/6/17 03:33, Michael Banck wrote: >>> Would this be a candidate for backpatching, or is the behaviour change >>> in pg_dump trumping the issues it solves? >> Unless someone literally has a materialized view on pg_policy, it >> wouldn't make a difference, so I'm not very keen on bothering to >> backpatch this. > Agreed. So actually, the problem with Jim's patch is that it doesn't fix the problem. pg_dump's attempts to REFRESH matviews will still fail in common cases, because they still come out before GRANTs, because pg_dump treats ACLs as a completely independent thing to be done last. This was noted as far back as 2015 (in a thread previously linked from this thread), and it's also the cause of Jordan Gigov's current complaint at https://www.postgresql.org/message-id/CA%2BnBocAmQ%2BOPNSKUzaaLa-6eGYVw5KqexWJaRoGvrvLyDir9gg%40mail.gmail.com Digging around in the archives, I find that Kevin had already proposed a fix in https://www.postgresql.org/message-id/flat/20160202161407.2778.24659%40wrigleys.postgresql.org which I didn't particularly care for, and apparently nobody else did either. But we really oughta do *something*. The main problem with Kevin's fix, after thinking about it more, is that it shoves matview refresh commands into the same final processing phase where ACLs are done, which means that in a parallel restore they will not be done in parallel. That seems like a pretty serious objection, although maybe not so serious that we'd be willing to accept a major rewrite in the back branches to avoid it. I'm wondering at this point about having restore create a fake DO_ACLS object (fake in the sense that it isn't in the dump file) that would participate normally in the dependency sort, and which we'd give a priority before matview refreshes but after everything else. "Restore" of that object would perform the same operation we do now of running through the whole TOC and emitting grants/revokes. So it couldn't be parallelized in itself (at least not without an additional batch of work) but it could be treated as an indivisible parallelized task, and then the matview refreshes could be parallelizable tasks after that. There's also Peter's proposal of splitting up GRANTs from REVOKEs and putting only the latter at the end. I'm not quite convinced that that's a good idea but it certainly merits consideration. regards, tom lane
I wrote: > The main problem with Kevin's fix, after thinking about it more, is that > it shoves matview refresh commands into the same final processing phase > where ACLs are done, which means that in a parallel restore they will not > be done in parallel. That seems like a pretty serious objection, although > maybe not so serious that we'd be willing to accept a major rewrite in the > back branches to avoid it. > I'm wondering at this point about having restore create a fake DO_ACLS > object (fake in the sense that it isn't in the dump file) that would > participate normally in the dependency sort, and which we'd give a > priority before matview refreshes but after everything else. "Restore" > of that object would perform the same operation we do now of running > through the whole TOC and emitting grants/revokes. So it couldn't be > parallelized in itself (at least not without an additional batch of work) > but it could be treated as an indivisible parallelized task, and then the > matview refreshes could be parallelizable tasks after that. > There's also Peter's proposal of splitting up GRANTs from REVOKEs and > putting only the latter at the end. I'm not quite convinced that that's > a good idea but it certainly merits consideration. After studying things for awhile, I've concluded that that last option is probably not workable. ACL items contain a blob of SQL that would be tricky to pull apart, and is both version and options dependent, and contains ordering dependencies that seem likely to defeat any desire to put the REVOKEs last anyway. Instead, I've prepared the attached draft patch, which addresses the problem by teaching pg_backup_archiver.c to process TOC entries in three separate passes, "main" then ACLs then matview refreshes. It's survived light testing but could doubtless use further review. Another way we could attack this is to adopt something similar to the PRE_DATA_BOUNDARY/POST_DATA_BOUNDARY mechanism; that is, invent more dummy section boundary objects, add dependencies sufficient to constrain all TOC objects to be before or after the appropriate boundaries, and then let the dependency sort go at it. But I think that way is probably more expensive than this one, and it doesn't have any real advantage if there's not a potential for circular dependencies that need to be broken. If somebody else wants to try drafting a patch like that, I won't stand in the way, but I don't wanna do so. Not clear where we want to go from here. Should we try to get this into next month's minor releases, or review it in September's commitfest and back-patch after that? regards, tom lane diff --git a/src/bin/pg_dump/pg_backup_archiver.h b/src/bin/pg_dump/pg_backup_archiver.h index 6123859..3687687 100644 *** a/src/bin/pg_dump/pg_backup_archiver.h --- b/src/bin/pg_dump/pg_backup_archiver.h *************** typedef enum *** 203,208 **** --- 203,230 ---- OUTPUT_OTHERDATA /* writing data as INSERT commands */ } ArchiverOutput; + /* + * For historical reasons, ACL items are interspersed with everything else in + * a dump file's TOC; typically they're right after the object they're for. + * However, we need to restore data before ACLs, as otherwise a read-only + * table (ie one where the owner has revoked her own INSERT privilege) causes + * data restore failures. On the other hand, matview REFRESH commands should + * come out after ACLs, as otherwise non-superuser-owned matviews might not + * be able to execute. (If the permissions at the time of dumping would not + * allow a REFRESH, too bad; we won't fix that for you.) These considerations + * force us to make three passes over the TOC, restoring the appropriate + * subset of items in each pass. We assume that the dependency sort resulted + * in an appropriate ordering of items within each subset. + */ + typedef enum + { + RESTORE_PASS_MAIN = 0, /* Main pass (most TOC item types) */ + RESTORE_PASS_ACL, /* ACL item types */ + RESTORE_PASS_REFRESH /* Matview REFRESH items */ + + #define RESTORE_PASS_LAST RESTORE_PASS_REFRESH + } RestorePass; + typedef enum { REQ_SCHEMA = 0x01, /* want schema */ *************** struct _archiveHandle *** 329,334 **** --- 351,357 ---- int noTocComments; ArchiverStage stage; ArchiverStage lastErrorStage; + RestorePass restorePass; /* used only during parallel restore */ struct _tocEntry *currentTE; struct _tocEntry *lastErrorTE; }; diff --git a/src/bin/pg_dump/pg_backup_archiver.c b/src/bin/pg_dump/pg_backup_archiver.c index f461692..4cfb71c 100644 *** a/src/bin/pg_dump/pg_backup_archiver.c --- b/src/bin/pg_dump/pg_backup_archiver.c *************** static ArchiveHandle *_allocAH(const cha *** 58,64 **** SetupWorkerPtrType setupWorkerPtr); static void _getObjectDescription(PQExpBuffer buf, TocEntry *te, ArchiveHandle *AH); ! static void _printTocEntry(ArchiveHandle *AH, TocEntry *te, bool isData, bool acl_pass); static char *replace_line_endings(const char *str); static void _doSetFixedOutputState(ArchiveHandle *AH); static void _doSetSessionAuth(ArchiveHandle *AH, const char *user); --- 58,64 ---- SetupWorkerPtrType setupWorkerPtr); static void _getObjectDescription(PQExpBuffer buf, TocEntry *te, ArchiveHandle *AH); ! static void _printTocEntry(ArchiveHandle *AH, TocEntry *te, bool isData); static char *replace_line_endings(const char *str); static void _doSetFixedOutputState(ArchiveHandle *AH); static void _doSetSessionAuth(ArchiveHandle *AH, const char *user); *************** static void _selectTablespace(ArchiveHan *** 71,76 **** --- 71,77 ---- static void processEncodingEntry(ArchiveHandle *AH, TocEntry *te); static void processStdStringsEntry(ArchiveHandle *AH, TocEntry *te); static teReqs _tocEntryRequired(TocEntry *te, teSection curSection, RestoreOptions *ropt); + static RestorePass _tocEntryRestorePass(TocEntry *te); static bool _tocEntryIsACL(TocEntry *te); static void _disableTriggersIfNecessary(ArchiveHandle *AH, TocEntry *te); static void _enableTriggersIfNecessary(ArchiveHandle *AH, TocEntry *te); *************** static OutputContext SaveOutput(ArchiveH *** 86,98 **** static void RestoreOutput(ArchiveHandle *AH, OutputContext savedContext); static int restore_toc_entry(ArchiveHandle *AH, TocEntry *te, bool is_parallel); ! static void restore_toc_entries_prefork(ArchiveHandle *AH); ! static void restore_toc_entries_parallel(ArchiveHandle *AH, ParallelState *pstate, TocEntry *pending_list); - static void restore_toc_entries_postfork(ArchiveHandle *AH, TocEntry *pending_list); static void par_list_header_init(TocEntry *l); static void par_list_append(TocEntry *l, TocEntry *te); static void par_list_remove(TocEntry *te); static TocEntry *get_next_work_item(ArchiveHandle *AH, TocEntry *ready_list, ParallelState *pstate); --- 87,104 ---- static void RestoreOutput(ArchiveHandle *AH, OutputContext savedContext); static int restore_toc_entry(ArchiveHandle *AH, TocEntry *te, bool is_parallel); ! static void restore_toc_entries_prefork(ArchiveHandle *AH, ! TocEntry *pending_list); ! static void restore_toc_entries_parallel(ArchiveHandle *AH, ! ParallelState *pstate, ! TocEntry *pending_list); ! static void restore_toc_entries_postfork(ArchiveHandle *AH, TocEntry *pending_list); static void par_list_header_init(TocEntry *l); static void par_list_append(TocEntry *l, TocEntry *te); static void par_list_remove(TocEntry *te); + static void move_to_ready_list(TocEntry *pending_list, TocEntry *ready_list, + RestorePass pass); static TocEntry *get_next_work_item(ArchiveHandle *AH, TocEntry *ready_list, ParallelState *pstate); *************** RestoreArchive(Archive *AHX) *** 625,644 **** AH->currSchema = NULL; } - /* - * In serial mode, we now process each non-ACL TOC entry. - * - * In parallel mode, turn control over to the parallel-restore logic. - */ if (parallel_mode) { ParallelState *pstate; TocEntry pending_list; par_list_header_init(&pending_list); /* This runs PRE_DATA items and then disconnects from the database */ ! restore_toc_entries_prefork(AH); Assert(AH->connection == NULL); /* ParallelBackupStart() will actually fork the processes */ --- 631,648 ---- AH->currSchema = NULL; } if (parallel_mode) { + /* + * In parallel mode, turn control over to the parallel-restore logic. + */ ParallelState *pstate; TocEntry pending_list; par_list_header_init(&pending_list); /* This runs PRE_DATA items and then disconnects from the database */ ! restore_toc_entries_prefork(AH, &pending_list); Assert(AH->connection == NULL); /* ParallelBackupStart() will actually fork the processes */ *************** RestoreArchive(Archive *AHX) *** 652,679 **** } else { for (te = AH->toc->next; te != AH->toc; te = te->next) ! (void) restore_toc_entry(AH, te, false); ! } ! /* ! * Scan TOC again to output ownership commands and ACLs ! */ ! for (te = AH->toc->next; te != AH->toc; te = te->next) ! { ! AH->currentTE = te; ! /* Both schema and data objects might now have ownership/ACLs */ ! if ((te->reqs & (REQ_SCHEMA | REQ_DATA)) != 0) { ! /* Show namespace if available */ ! if (te->namespace) ! ahlog(AH, 1, "setting owner and privileges for %s \"%s.%s\"\n", ! te->desc, te->namespace, te->tag); ! else ! ahlog(AH, 1, "setting owner and privileges for %s \"%s\"\n", ! te->desc, te->tag); ! _printTocEntry(AH, te, false, true); } } --- 656,706 ---- } else { + /* + * In serial mode, process everything in three phases: normal items, + * then ACLs, then matview refresh items. We might be able to skip + * one or both extra phases in some cases, eg data-only restores. + */ + bool haveACL = false; + bool haveRefresh = false; + for (te = AH->toc->next; te != AH->toc; te = te->next) ! { ! if ((te->reqs & (REQ_SCHEMA | REQ_DATA)) == 0) ! continue; /* ignore if not to be dumped at all */ ! switch (_tocEntryRestorePass(te)) ! { ! case RESTORE_PASS_MAIN: ! (void) restore_toc_entry(AH, te, false); ! break; ! case RESTORE_PASS_ACL: ! haveACL = true; ! break; ! case RESTORE_PASS_REFRESH: ! haveRefresh = true; ! break; ! } ! } ! if (haveACL) { ! for (te = AH->toc->next; te != AH->toc; te = te->next) ! { ! if ((te->reqs & (REQ_SCHEMA | REQ_DATA)) != 0 && ! _tocEntryRestorePass(te) == RESTORE_PASS_ACL) ! (void) restore_toc_entry(AH, te, false); ! } ! } ! ! if (haveRefresh) ! { ! for (te = AH->toc->next; te != AH->toc; te = te->next) ! { ! if ((te->reqs & (REQ_SCHEMA | REQ_DATA)) != 0 && ! _tocEntryRestorePass(te) == RESTORE_PASS_REFRESH) ! (void) restore_toc_entry(AH, te, false); ! } } } *************** restore_toc_entry(ArchiveHandle *AH, Toc *** 720,729 **** AH->currentTE = te; /* Work out what, if anything, we want from this entry */ ! if (_tocEntryIsACL(te)) ! reqs = 0; /* ACLs are never restored here */ ! else ! reqs = te->reqs; /* * Ignore DATABASE entry unless we should create it. We must check this --- 747,753 ---- AH->currentTE = te; /* Work out what, if anything, we want from this entry */ ! reqs = te->reqs; /* * Ignore DATABASE entry unless we should create it. We must check this *************** restore_toc_entry(ArchiveHandle *AH, Toc *** 744,760 **** defnDumped = false; ! if ((reqs & REQ_SCHEMA) != 0) /* We want the schema */ { ! /* Show namespace if available */ if (te->namespace) ahlog(AH, 1, "creating %s \"%s.%s\"\n", te->desc, te->namespace, te->tag); else ahlog(AH, 1, "creating %s \"%s\"\n", te->desc, te->tag); ! ! _printTocEntry(AH, te, false, false); defnDumped = true; if (strcmp(te->desc, "TABLE") == 0) --- 768,786 ---- defnDumped = false; ! /* ! * If it has a schema component that we want, then process that ! */ ! if ((reqs & REQ_SCHEMA) != 0) { ! /* Show namespace in log message if available */ if (te->namespace) ahlog(AH, 1, "creating %s \"%s.%s\"\n", te->desc, te->namespace, te->tag); else ahlog(AH, 1, "creating %s \"%s\"\n", te->desc, te->tag); ! _printTocEntry(AH, te, false); defnDumped = true; if (strcmp(te->desc, "TABLE") == 0) *************** restore_toc_entry(ArchiveHandle *AH, Toc *** 810,816 **** } /* ! * If we have a data component, then process it */ if ((reqs & REQ_DATA) != 0) { --- 836,842 ---- } /* ! * If it has a data component that we want, then process that */ if ((reqs & REQ_DATA) != 0) { *************** restore_toc_entry(ArchiveHandle *AH, Toc *** 826,832 **** */ if (AH->PrintTocDataPtr != NULL) { ! _printTocEntry(AH, te, true, false); if (strcmp(te->desc, "BLOBS") == 0 || strcmp(te->desc, "BLOB COMMENTS") == 0) --- 852,858 ---- */ if (AH->PrintTocDataPtr != NULL) { ! _printTocEntry(AH, te, true); if (strcmp(te->desc, "BLOBS") == 0 || strcmp(te->desc, "BLOB COMMENTS") == 0) *************** restore_toc_entry(ArchiveHandle *AH, Toc *** 914,920 **** { /* If we haven't already dumped the defn part, do so now */ ahlog(AH, 1, "executing %s %s\n", te->desc, te->tag); ! _printTocEntry(AH, te, false, false); } } --- 940,946 ---- { /* If we haven't already dumped the defn part, do so now */ ahlog(AH, 1, "executing %s %s\n", te->desc, te->tag); ! _printTocEntry(AH, te, false); } } *************** _tocEntryRequired(TocEntry *te, teSectio *** 2944,2950 **** --- 2970,2998 ---- } /* + * Identify which pass we should restore this TOC entry in. + * + * See notes with the RestorePass typedef in pg_backup_archiver.h. + */ + static RestorePass + _tocEntryRestorePass(TocEntry *te) + { + /* "ACL LANGUAGE" was a crock emitted only in PG 7.4 */ + if (strcmp(te->desc, "ACL") == 0 || + strcmp(te->desc, "ACL LANGUAGE") == 0 || + strcmp(te->desc, "DEFAULT ACL") == 0) + return RESTORE_PASS_ACL; + if (strcmp(te->desc, "MATERIALIZED VIEW DATA") == 0) + return RESTORE_PASS_REFRESH; + return RESTORE_PASS_MAIN; + } + + /* * Identify TOC entries that are ACLs. + * + * Note: it seems worth duplicating some code here to avoid a hard-wired + * assumption that these are exactly the same entries that we restore during + * the RESTORE_PASS_ACL phase. */ static bool _tocEntryIsACL(TocEntry *te) *************** _getObjectDescription(PQExpBuffer buf, T *** 3364,3386 **** type); } static void ! _printTocEntry(ArchiveHandle *AH, TocEntry *te, bool isData, bool acl_pass) { RestoreOptions *ropt = AH->public.ropt; - /* ACLs are dumped only during acl pass */ - if (acl_pass) - { - if (!_tocEntryIsACL(te)) - return; - } - else - { - if (_tocEntryIsACL(te)) - return; - } - /* * Avoid dumping the public schema, as it will already be created ... * unless we are using --clean mode (and *not* --create mode), in which --- 3412,3429 ---- type); } + /* + * Emit the SQL commands to create the object represented by a TOC entry + * + * This now also includes issuing an ALTER OWNER command to restore the + * object's ownership, if wanted. But note that the object's permissions + * will remain at default, until the matching ACL TOC entry is restored. + */ static void ! _printTocEntry(ArchiveHandle *AH, TocEntry *te, bool isData) { RestoreOptions *ropt = AH->public.ropt; /* * Avoid dumping the public schema, as it will already be created ... * unless we are using --clean mode (and *not* --create mode), in which *************** _printTocEntry(ArchiveHandle *AH, TocEnt *** 3567,3573 **** * If it's an ACL entry, it might contain SET SESSION AUTHORIZATION * commands, so we can no longer assume we know the current auth setting. */ ! if (acl_pass) { if (AH->currUser) free(AH->currUser); --- 3610,3616 ---- * If it's an ACL entry, it might contain SET SESSION AUTHORIZATION * commands, so we can no longer assume we know the current auth setting. */ ! if (_tocEntryIsACL(te)) { if (AH->currUser) free(AH->currUser); *************** replace_line_endings(const char *str) *** 3597,3602 **** --- 3640,3648 ---- return result; } + /* + * Write the file header for a custom-format archive + */ void WriteHead(ArchiveHandle *AH) { *************** dumpTimestamp(ArchiveHandle *AH, const c *** 3772,3787 **** /* * Main engine for parallel restore. * ! * Work is done in three phases. ! * First we process all SECTION_PRE_DATA tocEntries, in a single connection, ! * just as for a standard restore. Second we process the remaining non-ACL ! * steps in parallel worker children (threads on Windows, processes on Unix), ! * each of which connects separately to the database. Finally we process all ! * the ACL entries in a single connection (that happens back in ! * RestoreArchive). */ static void ! restore_toc_entries_prefork(ArchiveHandle *AH) { bool skipped_some; TocEntry *next_work_item; --- 3818,3831 ---- /* * Main engine for parallel restore. * ! * Parallel restore is done in three phases. In this first phase, ! * we'll process all SECTION_PRE_DATA TOC entries that are allowed to be ! * processed in the RESTORE_PASS_MAIN pass. (In practice, that's all ! * PRE_DATA items other than ACLs.) Entries we can't process now are ! * added to the pending_list for later phases to deal with. */ static void ! restore_toc_entries_prefork(ArchiveHandle *AH, TocEntry *pending_list) { bool skipped_some; TocEntry *next_work_item; *************** restore_toc_entries_prefork(ArchiveHandl *** 3799,3821 **** * about showing all the dependencies of SECTION_PRE_DATA items, so we do * not risk trying to process them out-of-order. * * Note: as of 9.2, it should be guaranteed that all PRE_DATA items appear * before DATA items, and all DATA items before POST_DATA items. That is * not certain to be true in older archives, though, so this loop is coded * to not assume it. */ skipped_some = false; for (next_work_item = AH->toc->next; next_work_item != AH->toc; next_work_item = next_work_item->next) { ! /* NB: process-or-continue logic must be the inverse of loop below */ if (next_work_item->section != SECTION_PRE_DATA) { /* DATA and POST_DATA items are just ignored for now */ if (next_work_item->section == SECTION_DATA || next_work_item->section == SECTION_POST_DATA) { skipped_some = true; - continue; } else { --- 3843,3873 ---- * about showing all the dependencies of SECTION_PRE_DATA items, so we do * not risk trying to process them out-of-order. * + * Stuff that we can't do immediately gets added to the pending_list. + * Note: we don't yet filter out entries that aren't going to be restored. + * They might participate in dependency chains connecting entries that + * should be restored, so we treat them as live until we actually process + * them. + * * Note: as of 9.2, it should be guaranteed that all PRE_DATA items appear * before DATA items, and all DATA items before POST_DATA items. That is * not certain to be true in older archives, though, so this loop is coded * to not assume it. */ + AH->restorePass = RESTORE_PASS_MAIN; skipped_some = false; for (next_work_item = AH->toc->next; next_work_item != AH->toc; next_work_item = next_work_item->next) { ! bool do_now = true; ! if (next_work_item->section != SECTION_PRE_DATA) { /* DATA and POST_DATA items are just ignored for now */ if (next_work_item->section == SECTION_DATA || next_work_item->section == SECTION_POST_DATA) { + do_now = false; skipped_some = true; } else { *************** restore_toc_entries_prefork(ArchiveHandl *** 3826,3843 **** * comment's dependencies are satisfied, so skip it for now. */ if (skipped_some) ! continue; } } ! ahlog(AH, 1, "processing item %d %s %s\n", ! next_work_item->dumpId, ! next_work_item->desc, next_work_item->tag); ! (void) restore_toc_entry(AH, next_work_item, false); ! /* there should be no touch of ready_list here, so pass NULL */ ! reduce_dependencies(AH, next_work_item, NULL); } /* --- 3878,3912 ---- * comment's dependencies are satisfied, so skip it for now. */ if (skipped_some) ! do_now = false; } } ! /* ! * Also skip items that need to be forced into later passes. We need ! * not set skipped_some in this case, since by assumption no main-pass ! * items could depend on these. ! */ ! if (_tocEntryRestorePass(next_work_item) != RESTORE_PASS_MAIN) ! do_now = false; ! if (do_now) ! { ! /* OK, restore the item and update its dependencies */ ! ahlog(AH, 1, "processing item %d %s %s\n", ! next_work_item->dumpId, ! next_work_item->desc, next_work_item->tag); ! (void) restore_toc_entry(AH, next_work_item, false); ! ! /* there should be no touch of ready_list here, so pass NULL */ ! reduce_dependencies(AH, next_work_item, NULL); ! } ! else ! { ! /* Nope, so add it to pending_list */ ! par_list_append(pending_list, next_work_item); ! } } /* *************** restore_toc_entries_prefork(ArchiveHandl *** 3863,3951 **** /* * Main engine for parallel restore. * ! * Work is done in three phases. ! * First we process all SECTION_PRE_DATA tocEntries, in a single connection, ! * just as for a standard restore. This is done in restore_toc_entries_prefork(). ! * Second we process the remaining non-ACL steps in parallel worker children ! * (threads on Windows, processes on Unix), these fork off and set up their ! * connections before we call restore_toc_entries_parallel_forked. ! * Finally we process all the ACL entries in a single connection (that happens ! * back in RestoreArchive). */ static void restore_toc_entries_parallel(ArchiveHandle *AH, ParallelState *pstate, TocEntry *pending_list) { - bool skipped_some; TocEntry ready_list; TocEntry *next_work_item; ahlog(AH, 2, "entering restore_toc_entries_parallel\n"); /* ! * Initialize the lists of ready items, the list for pending items has ! * already been initialized in the caller. After this setup, the pending ! * list is everything that needs to be done but is blocked by one or more ! * dependencies, while the ready list contains items that have no ! * remaining dependencies. Note: we don't yet filter out entries that ! * aren't going to be restored. They might participate in dependency ! * chains connecting entries that should be restored, so we treat them as ! * live until we actually process them. */ par_list_header_init(&ready_list); ! skipped_some = false; ! for (next_work_item = AH->toc->next; next_work_item != AH->toc; next_work_item = next_work_item->next) ! { ! /* NB: process-or-continue logic must be the inverse of loop above */ ! if (next_work_item->section == SECTION_PRE_DATA) ! { ! /* All PRE_DATA items were dealt with above */ ! continue; ! } ! if (next_work_item->section == SECTION_DATA || ! next_work_item->section == SECTION_POST_DATA) ! { ! /* set this flag at same point that previous loop did */ ! skipped_some = true; ! } ! else ! { ! /* SECTION_NONE items must be processed if previous loop didn't */ ! if (!skipped_some) ! continue; ! } ! ! if (next_work_item->depCount > 0) ! par_list_append(pending_list, next_work_item); ! else ! par_list_append(&ready_list, next_work_item); ! } /* * main parent loop * * Keep going until there is no worker still running AND there is no work ! * left to be done. */ - ahlog(AH, 1, "entering main parallel loop\n"); ! while ((next_work_item = get_next_work_item(AH, &ready_list, pstate)) != NULL || ! !IsEveryWorkerIdle(pstate)) { if (next_work_item != NULL) { /* If not to be restored, don't waste time launching a worker */ ! if ((next_work_item->reqs & (REQ_SCHEMA | REQ_DATA)) == 0 || ! _tocEntryIsACL(next_work_item)) { ahlog(AH, 1, "skipping item %d %s %s\n", next_work_item->dumpId, next_work_item->desc, next_work_item->tag); ! par_list_remove(next_work_item); reduce_dependencies(AH, next_work_item, &ready_list); ! continue; } --- 3932,3991 ---- /* * Main engine for parallel restore. * ! * Parallel restore is done in three phases. In this second phase, ! * we process entries by dispatching them to parallel worker children ! * (processes on Unix, threads on Windows), each of which connects ! * separately to the database. Inter-entry dependencies are respected, ! * and so is the RestorePass multi-pass structure. When we can no longer ! * make any entries ready to process, we exit. Normally, there will be ! * nothing left to do; but if there is, the third phase will mop up. */ static void restore_toc_entries_parallel(ArchiveHandle *AH, ParallelState *pstate, TocEntry *pending_list) { TocEntry ready_list; TocEntry *next_work_item; ahlog(AH, 2, "entering restore_toc_entries_parallel\n"); /* ! * The pending_list contains all items that we need to restore. Move all ! * items that are available to process immediately into the ready_list. ! * After this setup, the pending list is everything that needs to be done ! * but is blocked by one or more dependencies, while the ready list ! * contains items that have no remaining dependencies and are OK to ! * process in the current restore pass. */ par_list_header_init(&ready_list); ! AH->restorePass = RESTORE_PASS_MAIN; ! move_to_ready_list(pending_list, &ready_list, AH->restorePass); /* * main parent loop * * Keep going until there is no worker still running AND there is no work ! * left to be done. Note invariant: at top of loop, there should always ! * be at least one worker available to dispatch a job to. */ ahlog(AH, 1, "entering main parallel loop\n"); ! for (;;) { + /* Look for an item ready to be dispatched to a worker */ + next_work_item = get_next_work_item(AH, &ready_list, pstate); if (next_work_item != NULL) { /* If not to be restored, don't waste time launching a worker */ ! if ((next_work_item->reqs & (REQ_SCHEMA | REQ_DATA)) == 0) { ahlog(AH, 1, "skipping item %d %s %s\n", next_work_item->dumpId, next_work_item->desc, next_work_item->tag); ! /* Drop it from ready_list, and update its dependencies */ par_list_remove(next_work_item); reduce_dependencies(AH, next_work_item, &ready_list); ! /* Loop around to see if anything else can be dispatched */ continue; } *************** restore_toc_entries_parallel(ArchiveHand *** 3953,3966 **** next_work_item->dumpId, next_work_item->desc, next_work_item->tag); par_list_remove(next_work_item); DispatchJobForTocEntry(AH, pstate, next_work_item, ACT_RESTORE, mark_restore_job_done, &ready_list); } else { ! /* at least one child is working and we have nothing ready. */ } /* --- 3993,4026 ---- next_work_item->dumpId, next_work_item->desc, next_work_item->tag); + /* Remove it from ready_list, and dispatch to some worker */ par_list_remove(next_work_item); DispatchJobForTocEntry(AH, pstate, next_work_item, ACT_RESTORE, mark_restore_job_done, &ready_list); } + else if (IsEveryWorkerIdle(pstate)) + { + /* + * Nothing is ready and no worker is running, so we're done with + * the current pass or maybe with the whole process. + */ + if (AH->restorePass == RESTORE_PASS_LAST) + break; /* No more parallel processing is possible */ + + /* Advance to next restore pass */ + AH->restorePass++; + /* That probably allows some stuff to be made ready */ + move_to_ready_list(pending_list, &ready_list, AH->restorePass); + /* Loop around to see if anything's now ready */ + continue; + } else { ! /* ! * We have nothing ready, but at least one child is working, so ! * wait for some subjob to finish. ! */ } /* *************** restore_toc_entries_parallel(ArchiveHand *** 3980,3988 **** --- 4040,4060 ---- next_work_item ? WFW_ONE_IDLE : WFW_GOT_STATUS); } + /* There should now be nothing in ready_list. */ + Assert(ready_list.par_next == &ready_list); + ahlog(AH, 1, "finished main parallel loop\n"); } + /* + * Main engine for parallel restore. + * + * Parallel restore is done in three phases. In this third phase, + * we mop up any remaining TOC entries by processing them serially. + * This phase normally should have nothing to do, but if we've somehow + * gotten stuck due to circular dependencies or some such, this provides + * at least some chance of completing the restore successfully. + */ static void restore_toc_entries_postfork(ArchiveHandle *AH, TocEntry *pending_list) { *************** restore_toc_entries_postfork(ArchiveHand *** 4002,4010 **** _doSetFixedOutputState(AH); /* ! * Make sure there is no non-ACL work left due to, say, circular ! * dependencies, or some other pathological condition. If so, do it in the ! * single parent connection. */ for (te = pending_list->par_next; te != pending_list; te = te->par_next) { --- 4074,4083 ---- _doSetFixedOutputState(AH); /* ! * Make sure there is no work left due to, say, circular dependencies, or ! * some other pathological condition. If so, do it in the single parent ! * connection. We don't sweat about RestorePass ordering; it's likely we ! * already violated that. */ for (te = pending_list->par_next; te != pending_list; te = te->par_next) { *************** restore_toc_entries_postfork(ArchiveHand *** 4012,4019 **** te->dumpId, te->desc, te->tag); (void) restore_toc_entry(AH, te, false); } - - /* The ACLs will be handled back in RestoreArchive. */ } /* --- 4085,4090 ---- *************** par_list_remove(TocEntry *te) *** 4073,4078 **** --- 4144,4179 ---- /* + * Move all immediately-ready items from pending_list to ready_list. + * + * Items are considered ready if they have no remaining dependencies and + * they belong in the current restore pass. (See also reduce_dependencies, + * which applies the same logic one-at-a-time.) + */ + static void + move_to_ready_list(TocEntry *pending_list, TocEntry *ready_list, + RestorePass pass) + { + TocEntry *te; + TocEntry *next_te; + + for (te = pending_list->par_next; te != pending_list; te = next_te) + { + /* must save list link before possibly moving te to other list */ + next_te = te->par_next; + + if (te->depCount == 0 && + _tocEntryRestorePass(te) == pass) + { + /* Remove it from pending_list ... */ + par_list_remove(te); + /* ... and add to ready_list */ + par_list_append(ready_list, te); + } + } + } + + /* * Find the next work item (if any) that is capable of being run now. * * To qualify, the item must have no remaining dependencies *************** reduce_dependencies(ArchiveHandle *AH, T *** 4457,4464 **** { TocEntry *otherte = AH->tocsByDumpId[te->revDeps[i]]; otherte->depCount--; ! if (otherte->depCount == 0 && otherte->par_prev != NULL) { /* It must be in the pending list, so remove it ... */ par_list_remove(otherte); --- 4558,4574 ---- { TocEntry *otherte = AH->tocsByDumpId[te->revDeps[i]]; + Assert(otherte->depCount > 0); otherte->depCount--; ! ! /* ! * It's ready if it has no remaining dependencies and it belongs in ! * the current restore pass. However, don't move it if it has not yet ! * been put into the pending list. ! */ ! if (otherte->depCount == 0 && ! _tocEntryRestorePass(otherte) == AH->restorePass && ! otherte->par_prev != NULL) { /* It must be in the pending list, so remove it ... */ par_list_remove(otherte); -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
On Tue, Jul 25, 2017 at 10:45 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote: > Instead, I've prepared the attached draft patch, which addresses the > problem by teaching pg_backup_archiver.c to process TOC entries in > three separate passes, "main" then ACLs then matview refreshes. > It's survived light testing but could doubtless use further review. What worries me a bit is the possibility that something might depend on a matview having already been refreshed. I cannot immediately think of a case whether such a thing happens that you won't dismiss as wrong-headed, but how sure are we that none such will arise? I mean, a case that would actually break is if you had a CHECK constraint or a functional index that contained a function which referenced a materialized view for some validation or transformation purpose. Then it wouldn't be formally immutable, of course. But maybe we can imagine that some other case not involving lying could exist, or come to exist. -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company
Robert Haas <robertmhaas@gmail.com> writes: > On Tue, Jul 25, 2017 at 10:45 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote: >> Instead, I've prepared the attached draft patch, which addresses the >> problem by teaching pg_backup_archiver.c to process TOC entries in >> three separate passes, "main" then ACLs then matview refreshes. > What worries me a bit is the possibility that something might depend > on a matview having already been refreshed. I cannot immediately > think of a case whether such a thing happens that you won't dismiss as > wrong-headed, but how sure are we that none such will arise? Um, there are already precisely zero guarantees about that. pg_dump has always been free to order these actions in any way that satisfies the dependencies it knows about. It's certainly possible to break it by introducing hidden dependencies within CHECK conditions. But that's always been true, with or without materialized views, and we've always dismissed complaints about it with "sorry, we won't support that". (I don't think the trigger case is such a problem, because we restore data before creating any triggers.) Meanwhile, we have problems that bite people who aren't doing anything stranger than having a matview owned by a non-superuser. How do you propose to fix that without reordering pg_dump's actions? Lastly, the proposed patch has the advantage that it acts at the restore stage, not when a dump is being prepared. Since it isn't affecting what goes into dump archives, it doesn't tie our hands if we think of some better idea later. regards, tom lane
But why should a superuser need the ACL to be applied before being allowed access? If you make the permission-checking function check if the user is a superuser before looking for per-user grants, wouldn't that solve the issue?
On 26 July 2017 at 16:30, Tom Lane <tgl@sss.pgh.pa.us> wrote:
Robert Haas <robertmhaas@gmail.com> writes:
> On Tue, Jul 25, 2017 at 10:45 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Instead, I've prepared the attached draft patch, which addresses the
>> problem by teaching pg_backup_archiver.c to process TOC entries in
>> three separate passes, "main" then ACLs then matview refreshes.
> What worries me a bit is the possibility that something might depend
> on a matview having already been refreshed. I cannot immediately
> think of a case whether such a thing happens that you won't dismiss as
> wrong-headed, but how sure are we that none such will arise?
Um, there are already precisely zero guarantees about that. pg_dump
has always been free to order these actions in any way that satisfies
the dependencies it knows about.
It's certainly possible to break it by introducing hidden dependencies
within CHECK conditions. But that's always been true, with or without
materialized views, and we've always dismissed complaints about it with
"sorry, we won't support that". (I don't think the trigger case is
such a problem, because we restore data before creating any triggers.)
Meanwhile, we have problems that bite people who aren't doing anything
stranger than having a matview owned by a non-superuser. How do you
propose to fix that without reordering pg_dump's actions?
Lastly, the proposed patch has the advantage that it acts at the restore
stage, not when a dump is being prepared. Since it isn't affecting what
goes into dump archives, it doesn't tie our hands if we think of some
better idea later.
regards, tom lane
Jordan Gigov <coladict@gmail.com> writes: > But why should a superuser need the ACL to be applied before being allowed > access? If you make the permission-checking function check if the user is a > superuser before looking for per-user grants, wouldn't that solve the issue? The superuser's permissions are not relevant, because the materialized view is run with the permissions of its owner, not the superuser. We are not going to consider changing that, either, because it would open trivial-to-exploit security holes (any user could set up a trojan horse matview and just wait for the next pg_upgrade or dump/restore). regards, tom lane
On Wed, Jul 26, 2017 at 9:30 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote: > Meanwhile, we have problems that bite people who aren't doing anything > stranger than having a matview owned by a non-superuser. How do you > propose to fix that without reordering pg_dump's actions? Obviously changing the order is essential. What I wasn't sure about was whether a hard division into phases was a good idea. The advantage of the dependency mechanism is that, at least in theory, you can get things into any order you need by sticking the right dependencies in there. Your description made it sound like you'd hard-coded matview entries to the end rather than relying on dependencies, which could be a problem if something later turns up where we don't want them all the way at the end. -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company
I thought of a quite different way that we might attack this problem, but I'm not sure if it's worth pursuing or not. The idea is basically that we should get rid of the existing kluge for pushing ACLs to the end altogether, and instead rely on dependency sorting to make things work. This'd require some significant surgery on pg_dump, but it seems doable: * Make ACLs have actual DumpableObject representations that participate in the topological sort. * Add more explicit dependencies between these objects and other ones. For example, we'd fix the matview-permissions problem by adding dependencies not only on the tables a matview references, but on their ACLs. Another case is that we'd want to ensure that a table's ACL comes out after the table data, so as to solve the original problem that the current behavior was meant to deal with, ie allowing restore of tables even if they've been made read-only by revoking the owner's INSERT privilege. But I think that case would best be dealt with by forcing table ACLs to be post-data objects. (Otherwise they might come out in the middle "data" section of a restore, which is likely to break some client assumptions somewhere.) Another variant of that is that table ACLs probably need to come out after CREATE TRIGGER and foreign-key constraints, in case an owner has revoked their own TRIGGER or REFERENCES privilege. This seems potentially doable, but I sure don't see it coming out small enough to be a back-patchable fix; nor would it make things work for existing archive files, only new dumps. In fact, if we don't want to break parallel restore for existing dump files, we'd probably still have to implement the postpone-all-ACLs rule when dealing with an old dump file. (But maybe we could blow off that case, and just say you might have to not use parallel restore with old dumps that contain self-revoke ACLs?) The bigger issue is whether there's some failure case this would cause that I'm missing altogether. Thoughts? regards, tom lane
On Wed, Jul 26, 2017 at 2:41 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote: > The bigger issue is whether there's some failure case this would cause > that I'm missing altogether. Thoughts? I think dependencies are fundamentally the right model for this sort of problem. I can't imagine what could go wrong that wouldn't amount to a failure to insert all of the right dependencies, and thus be fixable by inserting whatever was missing. It's possible I am lacking in imagination, so take that opinion for what it's worth. -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company
Robert Haas <robertmhaas@gmail.com> writes: > On Wed, Jul 26, 2017 at 2:41 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote: >> The bigger issue is whether there's some failure case this would cause >> that I'm missing altogether. Thoughts? > I think dependencies are fundamentally the right model for this sort > of problem. I can't imagine what could go wrong that wouldn't amount > to a failure to insert all of the right dependencies, and thus be > fixable by inserting whatever was missing. I tend to agree. One fairly obvious risk factor is that we end up with circular dependencies, but in that case we have conflicting requirements and we're gonna have to decide what to do about them no matter what. Another potential issue is pg_dump performance; this could result in a large increase in the number of DumpableObjects and dependency links. The topological sort is O(N log N), so we shouldn't hit any serious big-O problems, but even a more-or-less-linear slowdown might be problematic for some people. On the whole though, I believe pg_dump is mostly limited by its server queries, and that would probably still be true. But the big point: even if we had the code for this ready-to-go, I'd be hesitant to inject it into v10 at this point, let alone back-patch. If we go down this path we won't be seeing a fix for the matview ordering problem before v11 (at best). Maybe that's acceptable considering it's been there for several years already, but it's annoying. So I'm thinking we should consider the multi-pass patch I posted as a short-term, backpatchable workaround, which we could hope to replace with dependency logic later. regards, tom lane
Am Donnerstag, den 27.07.2017, 15:52 -0400 schrieb Tom Lane: > So I'm thinking we should consider the multi-pass patch I posted as > a short-term, backpatchable workaround, which we could hope to > replace with dependency logic later. +1, it would be really nice if this could be fixed in the back-branches before v11. Michael -- Michael Banck Projektleiter / Senior Berater Tel.: +49 2166 9901-171 Fax: +49 2166 9901-100 Email: michael.banck@credativ.de credativ GmbH, HRB Mönchengladbach 12080 USt-ID-Nummer: DE204566209 Trompeterallee 108, 41189 Mönchengladbach Geschäftsführung: Dr. Michael Meskes, Jörg Folz, Sascha Heuer
Michael Banck <michael.banck@credativ.de> writes: > Am Donnerstag, den 27.07.2017, 15:52 -0400 schrieb Tom Lane: >> So I'm thinking we should consider the multi-pass patch I posted as >> a short-term, backpatchable workaround, which we could hope to >> replace with dependency logic later. > +1, it would be really nice if this could be fixed in the back-branches > before v11. Done. regards, tom lane