Home > mailing lists

Index over only uncommon values in table - Mailing list pgsql-general

From	Steven Schlansker
Subject	Index over only uncommon values in table
Date	June 18, 2013 19:17:44
Msg-id	A0FF27CE-2ABC-43C2-88BE-3C284CD8E802@likeness.com Whole thread Raw
Responses	Re: Index over only uncommon values in table Re: Index over only uncommon values in table Re: Index over only uncommon values in table
List	pgsql-general

Tree view

Hi everyone,

I assume this is not easy with standard PG but I wanted to double check.

I have a column that has a very uneven distribution of values.  ~95% of the values will be the same, with some long
tailof another few dozens of values. 

I want to have an index over this value.  Queries that select the most common value will not use the index, because it
isa overwhelming percentage of the table.  This means that ~95% of the disk space and IOPS to maintain the index is
"wasted".

I cannot use a hardcoded partial index because:
1) The common value is not known at schema definition time, and may change (very slowly) over time.
2) JDBC uses prepared statements for everything, and the value to be selected is not known at statement prepare time,
soany partial indices are ignored (this is a really really obnoxious behavior and makes partial indices almost useless
combinedwith prepared statements, sadly…) 

The table size is expected to approach the 0.5 billion row mark within the next few months, hence my eagerness to save
evenseemingly small amounts of per-row costs. 

Curious if anyone has a good way to approach this problem.
Thanks,
Steven

pgsql-general by date:

From: John R Pierce
Date: 18 June 2013, 19:10:13
Subject: Re: earthdistance compass bearing

From: John R Pierce
Date: 18 June 2013, 19:23:47
Subject: Re: Index over only uncommon values in table

Index over only uncommon values in table - Mailing list pgsql-general

Previous

Next