Home > mailing lists

Re: Major differences between oracle and postgres performance - what can I do ? - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: Major differences between oracle and postgres performance - what can I do ?
Date	June 18, 2004 14:58:16
Msg-id	20412.1087570627@sss.pgh.pa.us Whole thread Raw
In response to	Re: Major differences between oracle and postgres performance - what can I do ? (Gary Cowell <gary_cowell@yahoo.co.uk>)
Responses	Re: Major differences between oracle and postgres performance - what can I do ?
List	pgsql-performance

Tree view

=?iso-8859-1?q?Gary=20Cowell?= <gary_cowell@yahoo.co.uk> writes:
>    ->  Sort  (cost=117865.77..119220.13 rows=541741
> width=132) (actual time=63623.417..66127.641
> rows=541741 loops=1)

This is clearly where the time is going.

> sort_mem = 16384

Probably not enough for this problem.  The estimated data size is
upwards of 60 meg (132 bytes * half a mil rows); allowing for per-row
overhead I suspect that you'd need sort_mem approaching 100 meg for
a fully-in-memory sort.  (Also I'd take the width=132 with a *big*
grain of salt, unless you have reason to know that it's accurate.)

The on-disk sorting algorithm that we use is designed to favor minimum
disk space consumption over speed.  It has a fairly nonrandom access
pattern that can be pretty slow if your disks don't have good seek-time
specs.

I don't know whether Oracle's performance advantage is because they're
not swapping the sort to disk at all, or because they use a different
on-disk sort method with a more sequential access pattern.

[... thinks for awhile ...]  It seems possible that they may use sort
code that knows it is performing a DISTINCT operation and discards
duplicates on sight.  Given that there are only 534 distinct values,
the sort would easily stay in memory if that were happening.

It would be interesting to compare Oracle and PG times for a straight
sort of half a million rows, without the DISTINCT part; that would
give us a clue whether they simply have much better sort technology,
or whether they have a special optimization for sort+unique.

            regards, tom lane

pgsql-performance by date:

From: Tom Lane
Date: 18 June 2004, 14:29:35
Subject: Re: *very* inefficient choice made by the planner (regarding

From: Tom Lane
Date: 18 June 2004, 15:12:17
Subject: Re: [BULK] Problems with vacuum!

Re: Major differences between oracle and postgres performance - what can I do ? - Mailing list pgsql-performance

Previous

Next