Home > mailing lists

Re: Beginner Question:Why it always make sure that the postgres better than common csv file storage in disaster recovery? - Mailing list pgsql-general

From	Tom Lane
Subject	Re: Beginner Question:Why it always make sure that the postgres better than common csv file storage in disaster recovery?
Date	July 4, 2022 03:37:05
Msg-id	1449790.1656905825@sss.pgh.pa.us Whole thread Raw
In response to	Beginner Question:Why it always make sure that the postgres better than common csv file storage in disaster recovery? (Wen Yi <chuxuec@outlook.com>)
List	pgsql-general

Tree view

Wen Yi <chuxuec@outlook.com> writes:
> Since it's all built on top of the file system,why it always make sure 
> that the postgres better than common csv file storage in disaster 
> recovery?

Sure, Postgres cannot be any more reliable than the filesystem it's
sitting on top of (nor the physical storage underneath that, etc etc).

However, if you're comparing to some program that just writes a
flat file in CSV format or the like, that program is probably
not even *trying* to offer reliable storage.  Some things that
are likely missing:

* POSIX-compatible file systems promise nothing about the durability
of data that hasn't been successfully fsync'd.  You need to issue
fsync's, and you need a plan about what to do if you crash between
writing some data and getting an fsync confirmation, because maybe
those bits are safely down on disk, or maybe they aren't, or maybe
just some of them are.

* If you did crash partway through an update, you'd like some
assurances that the user-visible state after recovery will be
what it was before starting the failed update.  That CSV-using
program probably isn't even trying to do that.  Getting back
to a consistent state after a crash typically involves some
scheme along the lines of replaying a write-ahead log.

* None of this is worth anything if you can't even tell the
difference between good data and bad data.  CSV is pretty low
on redundancy --- not as bad as some formats, sure, but it's far
from checkable.

There's more to it than that, but if there's not any attention
to crash recovery then it's not what I'd call a database.  The
filesystem alone won't promise much here.

            regards, tom lane

pgsql-general by date:

From: Adrian Klaver
Date: 04 July 2022, 03:31:57
Subject: Re: Beginner Question:Why it always make sure that the postgres better than common csv file storage in disaster recovery?

From: Bogdan Siara
Date: 04 July 2022, 08:33:45
Subject: Postgresql 13.7 hangs down

Re: Beginner Question:Why it always make sure that the postgres better than common csv file storage in disaster recovery? - Mailing list pgsql-general

Previous

Next