Home > mailing lists

Re: Performance problem with 50,000,000 rows - Mailing list pgsql-general

From	Mark kirkwood
Subject	Re: Performance problem with 50,000,000 rows
Date	October 12, 2001 16:07:53
Msg-id	01101117575900.01048@spikey.slithery.org Whole thread Raw
In response to	Performance problem with 50,000,000 rows (David Link <dlink@soundscan.com>)
List	pgsql-general

Tree view

Previously :
>
>Table sizes .....
>
>bk_inv  : 46,790,877
>bk_title:  2,311,710
>
>Query :
>
>select   i.isbn,
>         t.vendor,
>         i.store,
>         i.qty
>from     bk_inv i,
>         bk_title t
>where    i.isbn = t.isbn
>and      t.vendor = '01672708' ;
>

It might be worth putting and index on bk_title.vendor, then the scan of this
table will quickly find the appropriate rows( assuming vendor is
selective)... then the join to bk_inv can use the bk_inv.isbn index as
before, but hopefully with a smaller dataset....(so hopefully it might be
able to use a nested loop from bk_title -> bk_inv instead of a huge merge
join)

It also might be worth playing with sort_mem and shared_buffers (if you have
not done so already) - since you have a reasonable amount of memory.

good luck

Mark

pgsql-general by date:

From: "Marc G. Fournier"
Date: 12 October 2001, 16:07:22
Subject: Two corruptions in as many days?

From: Allan Engelhardt
Date: 12 October 2001, 16:08:43
Subject: Re: error codes when running pg_dumpall from a perl script.

Re: Performance problem with 50,000,000 rows - Mailing list pgsql-general

Previous

Next