Home > mailing lists

Re: multi-column index - Mailing list pgsql-performance

From	Manfred Koizar
Subject	Re: multi-column index
Date	March 18, 2005 10:31:56
Msg-id	985l31t5jn1bvmhaihifslitbc940pdo5f@email.aon.at Whole thread Raw
In response to	multi-column index (Daniel Crisan <d.crisan@ibcp.fr>)
List	pgsql-performance

Tree view

On Thu, 17 Mar 2005 23:48:30 -0800, Ron Mayer
<rm_pg@cheapcomplexdevices.com> wrote:
>Would this also help estimates in the case where values in a table
>are tightly clustered, though not in strictly ascending or descending
>order?

No, I was just expanding the existing notion of correlation from single
columns to index tuples.

>For example, address data has many fields that are related
>to each other (postal codes, cities, states/provinces).

This looks like a case for cross-column statistics, though you might not
have meant it as such.  I guess what you're talking about can also be
described with a single column.  In a list like

  3 3 ... 3 1 1 ... 1 7 7 ... 7 4 4 ... 4 ...

equal items are "clustered" together but the values are not "correlated"
to their positions.  This would require a whole new column
characteristic, something like the probability that we find the same
value in adjacent heap tuples, or the number of different values we can
expect on one heap page.  The latter might even be easy to compute
during ANALYSE.

Servus
 Manfred

pgsql-performance by date:

From: Hannu Krosing
Date: 18 March 2005, 09:10:12
Subject: Re: One tuple per transaction

From: Manfred Koizar
Date: 18 March 2005, 10:40:07
Subject: Re: multi-column index

Re: multi-column index - Mailing list pgsql-performance

Previous

Next