Re: [HACKERS] A Better External Sort? - Mailing list pgsql-performance

From Gregory Maxwell
Subject Re: [HACKERS] A Better External Sort?
Date
Msg-id e692861c0509301907r7f1cd5b8h8b2b1f3a85321313@mail.gmail.com
Whole thread Raw
In response to Re: [HACKERS] A Better External Sort?  (Ron Peacetree <rjpeace@earthlink.net>)
List pgsql-performance
On 9/28/05, Ron Peacetree <rjpeace@earthlink.net> wrote:
> 2= We use my method to sort two different tables.  We now have these
> very efficient representations of a specific ordering on these tables.  A
> join operation can now be done using these Btrees rather than the
> original data tables that involves less overhead than many current
> methods.

If we want to make joins very fast we should implement them using RD
trees. For the example cases where a join against a very large table
will produce a much smaller output, a RD tree will provide pretty much
the optimal behavior at a very low memory cost.

On the subject of high speed tree code for in-core applications, you
should check out http://judy.sourceforge.net/ . The performance
(insert, remove, lookup, AND storage) is really quite impressive.
Producing cache friendly code is harder than one might expect, and it
appears the judy library has already done a lot of the hard work.
Though it is *L*GPLed, so perhaps that might scare some here away from
it. :) and good luck directly doing joins with a LC-TRIE. ;)

pgsql-performance by date:

Previous
From: Hannu Krosing
Date:
Subject: Re: [HACKERS] A Better External Sort?
Next
From: "ALÝ ÇELÝK"
Date:
Subject: Which one FreeBSD or Linux