Re: using hash index when BETWEEN is specified - Mailing list pgsql-hackers

From Zdenek Kotala
Subject Re: using hash index when BETWEEN is specified
Date
Msg-id 48C7DA44.303@sun.com
Whole thread Raw
In response to Re: using hash index when BETWEEN is specified  (Hannu Krosing <hannu@2ndQuadrant.com>)
Responses Re: using hash index when BETWEEN is specified  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
Hannu Krosing napsal(a):
> On Wed, 2008-09-10 at 07:13 -0400, Robert Haas wrote:
>>>> I'm not planner guru but it seems to me that BETWEEN clause could be
>>>> rewritten as a IN clause for integer data types and small interval.
>>> Where should the line be drawn.
>>> Define small :)
>> When the estimated cost is lower?
> 
> You still need to draw a line for when to even try estimating the cost .
> 
> Will this be interval of 10 ? or 100 ? or 10000 ?

I think it depends of ration of unique integer number in a table and 
numbers of requested interval, number distribution and total number of rows.

For example if you have 10 distinct number and each has 100 occurrence 
then full scan is better (for between 1 and 5). But if each number 
occurs 100000x. Then using hash index should be effective.
    Zdenek


pgsql-hackers by date:

Previous
From: Markus Wanner
Date:
Subject: Re: Synchronous Log Shipping Replication
Next
From: "Marko Kreen"
Date:
Subject: Re: Base64 decode/encode performance