Home > mailing lists

Re: Use "average field correlation per hard disk - Mailing list pgsql-hackers

From	Philip Warner
Subject	Re: Use "average field correlation per hard disk
Date	March 10, 2004 02:02:59
Msg-id	6.0.0.22.0.20040310165855.04817b30@203.8.195.10 Whole thread Raw
In response to	Re: Use "average field correlation per hard disk page" instead of global one? (Tom Lane <tgl@sss.pgh.pa.us>)
List	pgsql-hackers

Tree view

At 04:08 PM 10/03/2004, Tom Lane wrote:
>None at the moment, but I'm open to suggestions.  It seems like we might
>need different stats for equality probes than range probes.

What about my suggestion from August 2000:
    "There might be a way to side-step the issue here. I assume that    the index nodes contain a pointer to a record
ina file, which    has some kind of file position. By comparing the file positions    on one leaf node, and then
averagingthe node cluster values,    you might be able to get a pretty good idea of the *real* clustering."
 

I don't use the CLUSTER command, but I have clustered data and would like 
to be able to take advantage of the fact if possible. *If* the record 
pointers can be used to indicate closeness, then the same approach of 
randomly sampling index nodes would seem to work. Then again, maybe I don't 
know enough about the storage techniques...




----------------------------------------------------------------
Philip Warner                    |     __---_____
Albatross Consulting Pty. Ltd.   |----/       -  \
(A.B.N. 75 008 659 498)          |          /(@)   ______---_
Tel: (+61) 0500 83 82 81         |                 _________  \
Fax: (+61) 03 5330 3172          |                 ___________ |
Http://www.rhyme.com.au          |                /           \|                                 |    --________--
PGP key available upon request,  |  /
and from pgp.mit.edu:11371       |/

pgsql-hackers by date:

From: Tom Lane
Date: 10 March 2004, 01:08:28
Subject: Re: Use "average field correlation per hard disk page" instead of global one?

From: Tom Lane
Date: 10 March 2004, 02:13:24
Subject: Re: Timing of 'SELECT 1'

Re: Use "average field correlation per hard disk - Mailing list pgsql-hackers

Previous

Next