Re: GSoC on WAL-logging hash indexes - Mailing list pgsql-hackers

From ktm@rice.edu
Subject Re: GSoC on WAL-logging hash indexes
Date
Msg-id 20140430125544.GG5746@aart.rice.edu
Whole thread Raw
In response to Re: GSoC on WAL-logging hash indexes  (Peter Geoghegan <pg@heroku.com>)
Responses Re: GSoC on WAL-logging hash indexes  (Peter Geoghegan <pg@heroku.com>)
List pgsql-hackers
On Wed, Apr 30, 2014 at 12:26:20AM -0700, Peter Geoghegan wrote:
> On Mon, Mar 3, 2014 at 8:12 AM, Robert Haas <robertmhaas@gmail.com> wrote:
> >> As a GSoC student, I will implement WAL recovery of hash indexes using the
> >> other index types' WAL code as a guide.
>
> Frankly, I'm skeptical of the idea that hash indexes will ever really
> be useful. I realize that that's a counter-intuitive conclusion, but
> there are many things we could do to improve B-Tree CPU costs to make
> them closer to those of hash indexes, without making them any less
> flexible. I myself would much rather work on that, and intend to.
>
> The O(1) cost seems attractive when you consider that that only
> requires that we read one index page from disk to service any given
> index scan, but in fact B-Trees almost always only require the same.
> They are of course also much more flexible. The concurrency
> characteristics B-Trees are a lot better understood. I sincerely
> suggest that we forget about conventional hash table type indexes. I
> fear they're a lost cause.
>
> --
> Peter Geoghegan
>
Hi Peter,

I do not think that CPU costs matter as much as the O(1) probe to
get a result value specifically for very large indexes/tables where
even caching the upper levels of a B-tree index would kill your
working set in memory. I know, I know, everyone has so much memory
and can just buy more... but this does matter. I also think that
development of hash indexes has been stalled waiting for WAL
logging. For example, hash indexes can almost trivially become
more space efficient as they grow in size by utilizing the page
number to represent the prefix bits of the hash value for a bucket.

My 2 cents.
Ken


pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: Considerer Harmful Considered Harmful
Next
From: Hannu Krosing
Date:
Subject: Re: "Considerer Harmful Considered Harmful" categorized as Mostly Harmless