Re: Looking for ideas on how to speed up warehouse loading - Mailing list pgsql-performance

From CoL
Subject Re: Looking for ideas on how to speed up warehouse loading
Date
Msg-id c69j1e$27jc$1@news.hub.org
Whole thread Raw
In response to Looking for ideas on how to speed up warehouse loading  (Sean Shanny <shannyconsulting@earthlink.net>)
List pgsql-performance
hi,

Sean Shanny wrote, On 4/22/2004 23:56:
>
> SELECT t1.id, t2.url FROM referral_temp t2 LEFT OUTER JOIN d_referral t1
> ON t2.url = t1.referral_raw_url ORDER BY t1.id

index on url (text) has no sense. Try to use and md5 (char(32) column)
which contains the md5 hash of url field. and join these ones. You can
have a better index on this char 32 field.

do not forget to analyze the tables after data load, and you can fine
tune you postgresql.conf, default_statistics_target for better index
info, and others.
check this info pages:
http://www.varlena.com/varlena/GeneralBits/Tidbits/annotated_conf_e.html
http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html

C.

pgsql-performance by date:

Previous
From: Kenneth Marshall
Date:
Subject: Re: Wierd context-switching issue on Xeon
Next
From: Michiel Meeuwissen
Date:
Subject: Re: order by index, and inheritance