Home > mailing lists

Re: index questions - Mailing list pgsql-performance

From	Franco Bruno Borghesi
Subject	Re: index questions
Date	July 25, 2003 15:30:45
Msg-id	1059158034.1183.43.camel@taz Whole thread
In response to	Re: index questions (Josh Berkus <josh@agliodbs.com>)
List	pgsql-performance

Tree view

what you say is that the index is not effective because of its size, but it would still be used *if* the conditions are right... In this case, I care about performance, not space.

But what you say about the index not being good because 100% of rows match the condition confirms what I suspected.

Thanks for your help.

On Fri, 2003-07-25 at 14:28, Josh Berkus wrote:

Franco,

> CREATE INDEX sales_k1 ON sales(clientId, branchId, productId,
> employeeId, saleDate, price, qty);

A 7-column index is unlikely to be effective -- the index will be almost as 
large as the table.   Try indexing only the first 3-4 columns instead.  

> I want to make a function that returns the FIRS saleId of the sale that
> matches some conditions. I will
> always receive the Client Id, but not always the other arguments (sent
> as NULLs).

Well, keep in mind that your multi-column index will only be useful if all 
columns are queried starting from the left.  That is, the index will be 
ignored if you have a "where productId = x" without a "where branchid = y".

> CREATE OR REPLACE FUNCTION findSale(INTEGER, INTEGER, INTEGER, INTEGER)
> RETURNS INTEGER AS '
> DECLARE
>    a_clientId ALIAS FOR $1;
>    a_branchId ALIAS FOR $1;
>    a_productId ALIAS FOR $1;
>    a_employeeId ALIAS FOR $1;

Your aliases are wrong here.

>       branchId=coalesce(a_branchId, branchId) AND /*branchId is null?
> anything will be ok*/
>       productId=coalesce(a_productId, productId) AND /*productId is
> null? anything will be ok*/

On a very large table this will be very inefficient.   you'll be comparing the 
productid, for example, even if no productid is passed ... and the index 
won't do you any good because the planner should figure out that 100% of rows 
match the condition.  

Instead, I recommend that you build up a dynamic query as a string and then 
pass only the conditions sent by the user.  You can then EXECUTE the query 
and loop through it for a result.

Of course, YMMV.   My approach will require you to create more indexes which 
could be a problem if you have limited disk space.

Attachment

signature.asc

pgsql-performance by date:

From: Josh Berkus
Date: 25 July 2003, 14:29:48
Subject: Re: index questions

From: "Mendola Gaetano"
Date: 25 July 2003, 21:45:05
Subject: Wrong rows number expected

Re: index questions - Mailing list pgsql-performance

Attachment

Previous

Next