Re: [PATCH] pg_upgrade fails when postgres/template1 isn't in default tablespace - Mailing list pgsql-hackers
From | Bruce Momjian |
---|---|
Subject | Re: [PATCH] pg_upgrade fails when postgres/template1 isn't in default tablespace |
Date | |
Msg-id | 20150810173552.GD6582@momjian.us Whole thread Raw |
In response to | [PATCH] pg_upgrade fails when postgres/template1 isn't in default tablespace (Marti Raudsepp <marti@juffo.org>) |
Responses |
Re: [PATCH] pg_upgrade fails when postgres/template1 isn't
in default tablespace
|
List | pgsql-hackers |
On Fri, Jun 19, 2015 at 07:10:40PM +0300, Marti Raudsepp wrote: > Hi list Sorry I am just getting this report. Thanks to the people who "stalled" for me. > One of my databases failed to upgrade successfully and produced this error in > the copying phase: > > error while copying relation "pg_catalog.pg_largeobject" ("/srv/ssd/ > PG_9.3_201306121/1/12023" to "/PG_9.4_201409291/1/12130"): No such file or > directory As with all good bug reports, there are multiple bugs here. ;-) Notice the second path has an invalid prefix: "/PG_9.4_201409291/1/12130" --- obviously something is seriously wrong. The failure of pg_dumpall to move 'postgres' and 'template1' databases to the new tablespace cannot explain that, i.e. I could understand pg_upgrade looking for pg_largeobject in the default data directory, or in the new one, but not in a path that doesn't even exist. pg_upgrade tracks old and new tablespaces separately, so there is obviously something wrong. And I found it, patch attached. The pg_upgrade code was testing for the tablespace of the _old_ object, then assigning the old and _new_ tablespaces based on that. The first patch fixes that, and should be backpatched to all supported pg_upgrade versions. > Turns out this happens when either the "postgres" or "template1" databases have > been moved to a non-default tablespace. pg_dumpall does not dump attributes > (such as tablespace) for these databases; pg_upgrade queries the new cluster > about the tablespace for these relations and builds a broken destination path > for the copy/link operation. > > The least bad solution seems to be generating ALTER DATBASE SET TABLESPACE > commands for these from pg_dumpall. Previously a --globals-only dump didn't > generate psql \connect commands, but you can't run SET TABLESPACE from within > the same database you're connected to. So to move "postgres", it needs to > connect to "template1" and vice versa. That seems fine for the purpose of > pg_upgrade which can assume a freshly created cluster with both databases > intact. Yes, seems like a good solution. > I have implemented a proof of concept patch for this. Currently I'm only > tackling the binary upgrade failure and not general pg_dumpall. > > Alternatively, we could allow SET TABLESPACE in the current database, which > seems less ugly to me. A code comment says "Obviously can't move the tables of > my own database", but it's not obvious to me why. If I'm the only connected > backend, it seems that any caches and open files could be invalidated. But I > don't know how big of an undertaking that would be. Your submitted patch, attached, also looks good, and should be backpatched. My only question is whether this should be for all runs of pg_dumpall, not just in binary upgrade mode. Comments? Once we agree on this, I will apply these to all back branches and run tests by moving template1 before the upgrade to make sure it works for all PG versions. > pg_upgrade) misses: > * Nothing at all is dumped for the template0 database, although ACLs, settings > and the tablespace can be changed by the user Uh, we _assume_ no one is connecting to template0, but you are right people can change things _about_ template0. It would be nice to preserve those. > * template1 & postgres databases retain ACLs and settings, but not attributes > like TABLESPACE or CONNECTION LIMIT. Other attributes like LC_COLLATE and > LC_CTYPE can't be changed without recreating the DB, but those don't matter > for the pg_upgrade case anyway. > > It seems those would be good material for another patch? Agreed. I am not sure how we have gotten this far with so few complaints about this problem. -- Bruce Momjian <bruce@momjian.us> http://momjian.us EnterpriseDB http://enterprisedb.com + Everyone has their own god. +
Attachment
pgsql-hackers by date: