OK I have it fixed;; just for anyone who's interested - the error was in the base backup procedure.
When switched to plain "rsync -az" - it works like a charm.
Most probably, the fault was I assumed that you can use the rsync --update option when doing base backup.
You cannot, especially when time sync on both servers is not accurate. In my case, destination server clock was few minutes in future.
So the pg_clog was broken due to this. Which means a completely corrupted database.
thanks Stephen & Andres for your responses.
Hi.
There is a large (>5T) database on PostgreSQL 9.0.23.
I would like to setup new WAL-shipping standby.
https://www.postgresql.org/docs/9.0/warm-standby.html
On my way I find unexpected issues. Here's the story, in short:
1. WAL archiving to remote archive is setup & verified
2. base backup is transferred directly to new server using
pg_start_backup + rsync + pg_stop_backup.
3. recovery.conf is created
4. Server is started and consumes all the remaining WAL segments
accumulated in the archive - finishing with optimistic message LOG:
consistent recovery state reached at 9FC1/112BEE10.
5. When I go to postgres on the standby and try to connect system
"postgres" database psql: FATAL: could not open file "global/11819":
No such file or directory
I guessed the OID refereds to the pg_authid, but other system tables
might be affected too.
What could be wrong here?
Thanks!