Re: Hash index use presently(?) discouraged since 2005: revive or bury it? - Mailing list pgsql-performance

From Stefan Keller
Subject Re: Hash index use presently(?) discouraged since 2005: revive or bury it?
Date
Msg-id CAFcOn2-1qLTQCTbqwqHxSm+WccD+uqD2AkqqwFdSUq8bTQtZ=Q@mail.gmail.com
Whole thread Raw
In response to Re: Hash index use presently(?) discouraged since 2005: revive or bury it?  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Hash index use presently(?) discouraged since 2005: revive or bury it?
List pgsql-performance
2011/9/14 Tom Lane <tgl@sss.pgh.pa.us>:
> (...) I think that
> the current state of affairs is still what depesz said, namely that
> there might be cases where they'd be a win to use, except the lack of
> WAL support is a killer.  I imagine somebody will step up and do that
> eventually.

Should I open a ticket?

Stefan

2011/9/14 Tom Lane <tgl@sss.pgh.pa.us>:
> Peter Geoghegan <peter@2ndquadrant.com> writes:
>> On 14 September 2011 00:04, Stefan Keller <sfkeller@gmail.com> wrote:
>>> Has this been verified on a recent release? I can't believe that hash
>>> performs so bad over all these points. Theory tells me otherwise and
>>> http://en.wikipedia.org/wiki/Hash_table seems to be a success.
>
>> Hash indexes have been improved since 2005 - their performance was
>> improved quite a bit in 9.0. Here's a more recent analysis:
>
>> http://www.depesz.com/index.php/2010/06/28/should-you-use-hash-index/
>
> Yeah, looking into the git logs shows several separate major changes
> committed during 2008, including storing only the hash code not the
> whole indexed value (big win on wide values, and lets you index values
> larger than one index page, which doesn't work in btree).  I think that
> the current state of affairs is still what depesz said, namely that
> there might be cases where they'd be a win to use, except the lack of
> WAL support is a killer.  I imagine somebody will step up and do that
> eventually.
>
> The big picture though is that we're not going to remove hash indexes,
> even if they're nearly useless in themselves, because hash index
> opclasses provide the foundation for the system's knowledge of how to
> do the datatype-specific hashing needed for hash joins and hash
> aggregation.  And those things *are* big wins, even if hash indexes
> themselves never become so.
>
>                        regards, tom lane
>

pgsql-performance by date:

Previous
From: "Kevin Grittner"
Date:
Subject: Re: Migrated from 8.3 to 9.0 - need to update config (re-post)
Next
From: Greg Smith
Date:
Subject: Re: RAID Controller (HP P400) beat by SW-RAID?