Thread: PostgreSQL is extremely slow on Windows

PostgreSQL is extremely slow on Windows

From

"Vig, Sandor (G/FI-2)"

Date:

23 February 2005, 06:28:39

Hi,

I've downloaded the latest release (PostgreSQL 8.0) for windows.
Installation was OK, but I have tried to restore a database.
It had more than ~100.000 records. Usually I use PostgreSQL
under Linux, and it used to be done under 10 minutes.

Under W2k und XP it took 3 hours(!) Why is it so slow????

The commands I used:

Under Linux: (duration: 1 minute)
    pg_dump -D databasename > databasename.db

Under Windows: (duration: 3 - 3.5 hours(!))
    psql databasename < databasename.db >nul

It seemed to me, that only 20-30 transactions/sec were
writen to the database.

I need to write scripts for automatic (sheduled) database
backup and recovery.

Help anyone?

Bye,

Vig Sándor


The information transmitted is intended only for the person or entity to
which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipient is prohibited. If you received
this in error, please contact the sender and delete the material from any
computer.

Re: PostgreSQL is extremely slow on Windows

From

"Vig, Sandor (G/FI-2)"

Date:

23 February 2005, 10:41:44

Hi,

I changed fsync to false. It took 8 minutes to restore the full database.
That is 26 times faster than before. :-/ (aprox. 200 tps)
With background writer it took 12 minutes. :-(

The funny thing is, I had a VMWARE emulation on the same Windows mashine,
running Red Hat, with fsync turned on. It took also 8 minutes to finish.
Probably the Linux code is better + VMWARE optimises (physical) disk
access.(?)

It seems to me, I need 2 types of operating modes:
- For bulk loading (database restore) : fsync=false
- Normal operation fsync=true

Am I right? How can I do it "elegantly"?

I Think, it should be a "performance tuning guide" in the docomentation.
(not just explaning the settings) Playing with the settings could be quite
anoying.

Anyway, thanks for the tips.

Bye,
Vig Sándor



-----Original Message-----
From: Magnus Hagander [mailto:mha@sollentuna.net]
Sent: Tuesday, February 22, 2005 7:15 PM
To: Vig, Sandor (G/FI-2); pgsql-performance@postgresql.org
Subject: RE: [PERFORM] PostgreSQL is extremely slow on Windows



>I've downloaded the latest release (PostgreSQL 8.0) for windows.
>Installation was OK, but I have tried to restore a database.
>It had more than ~100.000 records. Usually I use PostgreSQL
>under Linux, and it used to be done under 10 minutes.
>
>Under W2k und XP it took 3 hours(!) Why is it so slow????
>
>The commands I used:
>
>Under Linux: (duration: 1 minute)
>    pg_dump -D databasename > databasename.db
>
>Under Windows: (duration: 3 - 3.5 hours(!))
>    psql databasename < databasename.db >nul
>
>It seemed to me, that only 20-30 transactions/sec were
>writen to the database.

20-30 transactionsi s about what you'll get on a single disk on Windows
today.
We have a patch in testing that will bring this up to about 80.
You can *never* get above 80 without using write cache, regardless of
your OS, if you have a single disk. You might want to look into wether
write cacheing is enabled on your linux box, and disable it. (unless you
are using RAID) A lot points towards write cache enabled on your system.

If you need the performance that equals the one with write cache on, you
can set fsync=off. But then you will lose the guarantee that your
machine will survive an unclean shutdown or crash. I would strongly
advice against it on a production system - same goes for running with
write cache!

//Magnus

The information transmitted is intended only for the person or entity to
which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipient is prohibited. If you received
this in error, please contact the sender and delete the material from any
computer.

Re: PostgreSQL is extremely slow on Windows

From

"Magnus Hagander"

Date:

23 February 2005, 14:50:59

> > You can *never* get above 80 without using write cache,
> regardless of
> > your OS, if you have a single disk.
>
> Why? Even with, say, a 15K RPM disk? Or the ability to
> fsync() multiple concurrently-committing transactions at once?

Uh. What I meant was a single *IDE* disk. Sorry. Been too deep into
helping ppl with IDE disks lately to remember that SCSI can be a lot
faster :-) And we're talking about restore of a dump, so it's a single
session.

(Strictly, that shuld be a 7200rpm IDE disk. I don't know if any others
are common, though)

//mha

Re: PostgreSQL is extremely slow on Windows

From

"Magnus Hagander"

Date:

23 February 2005, 14:55:12

> Hi,
>
> I changed fsync to false. It took 8 minutes to restore the
> full database.
> That is 26 times faster than before. :-/ (aprox. 200 tps)
> With background writer it took 12 minutes. :-(

That seems reasonable.


> The funny thing is, I had a VMWARE emulation on the same
> Windows mashine, running Red Hat, with fsync turned on. It
> took also 8 minutes to finish.
> Probably the Linux code is better + VMWARE optimises (physical) disk
> access.(?)

Vmware makes fsync() into a no-op. It will always cache the disk.
(This is vmware workstation. Their server products behave differntly, of
course)


> It seems to me, I need 2 types of operating modes:
> - For bulk loading (database restore) : fsync=false
> - Normal operation fsync=true

Yes, fsync=false is very good for bulk loading *IFF* you can live with
data loss in case you get a crash during load.


> Am I right? How can I do it "elegantly"?

You'll need to edit postgresql.conf and restart the server for this.


> I Think, it should be a "performance tuning guide" in the
> docomentation.
> (not just explaning the settings) Playing with the settings
> could be quite anoying.

There is some information on techdocs.postgresql.org you miht be
interested in.

//Magnus

Re: PostgreSQL is extremely slow on Windows

From

Neil Conway

Date:

23 February 2005, 22:36:58

Magnus Hagander wrote:
> Yes, fsync=false is very good for bulk loading *IFF* you can live with
> data loss in case you get a crash during load.

It's not merely data loss -- you could encounter potentially
unrecoverable database corruption.

There is a TODO item about allowing the delaying of WAL writes. If we
maintain the WAL invariant (that is, a WAL record describing a change
must hit disk before the change itself does) but simply don't flush the
WAL at transaction commit, we should be able to get better performance
without the risk of database corruption (so we would need to keep pages
modified by the committed transaction pinned in memory until the WAL has
been flushed, which might be done on a periodic basis).

Naturally, there is a risk of losing data in the period between
transaction commit and syncing the WAL, but no risk of database
corruption. This seems a reasonable approach to providing better
performance for people who don't need the strict guarantees provided by
fsync=true.

-Neil

Re: PostgreSQL is extremely slow on Windows

From

Tom Lane

Date:

23 February 2005, 22:56:50

Neil Conway <neilc@samurai.com> writes:
> There is a TODO item about allowing the delaying of WAL writes. If we
> maintain the WAL invariant (that is, a WAL record describing a change
> must hit disk before the change itself does) but simply don't flush the
> WAL at transaction commit, we should be able to get better performance
> without the risk of database corruption (so we would need to keep pages
> modified by the committed transaction pinned in memory until the WAL has
> been flushed, which might be done on a periodic basis).

That interlock already exists, in the form of the bufmgr LSN logic.

I think this "feature" might be as simple as

            XLogFlush(recptr);

becomes

            /* Must flush if we are deleting files... */
            if (PerCommitFlush || nrels > 0)
                XLogFlush(recptr);

in RecordTransactionCommit.

            regards, tom lane

Re: PostgreSQL is extremely slow on Windows

From

Bruce Momjian

Date:

24 February 2005, 21:25:50

Neil Conway wrote:
> Magnus Hagander wrote:
> > Yes, fsync=false is very good for bulk loading *IFF* you can live with
> > data loss in case you get a crash during load.
>
> It's not merely data loss -- you could encounter potentially
> unrecoverable database corruption.
>
> There is a TODO item about allowing the delaying of WAL writes. If we
> maintain the WAL invariant (that is, a WAL record describing a change
> must hit disk before the change itself does) but simply don't flush the
> WAL at transaction commit, we should be able to get better performance
> without the risk of database corruption (so we would need to keep pages
> modified by the committed transaction pinned in memory until the WAL has
> been flushed, which might be done on a periodic basis).
>
> Naturally, there is a risk of losing data in the period between
> transaction commit and syncing the WAL, but no risk of database
> corruption. This seems a reasonable approach to providing better
> performance for people who don't need the strict guarantees provided by
> fsync=true.

Right.  Just for clarity, you might lose the last 5 seconds of
transactions, but all transactsions would be completely committed or
aborted in your datbase.  Right now with fsync off you can get
transactions partially commited in your database, which is a serious
problem (think moving money from one account to another).

--
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Re: PostgreSQL is extremely slow on Windows

From

Greg Stark

Date:

25 February 2005, 05:02:42

Bruce Momjian <pgman@candle.pha.pa.us> writes:

> Right now with fsync off you can get transactions partially commited in your
> database, which is a serious problem (think moving money from one account to
> another).

It's worse than that. You can get a totally corrupted database. Things like
duplicated records (the before and after image of an update). Or indexes that
are out of sync with the table. This can cause strange inconsistent results
depending on the plan queries use, or outright database crashes.

--
greg