Home > mailing lists

Re: vacuum, performance, and MVCC, and compression - Mailing list pgsql-hackers

From	PFC
Subject	Re: vacuum, performance, and MVCC, and compression
Date	June 26, 2006 12:41:46
Msg-id	op.tbrfaonicigqcu@apollo13 Whole thread Raw
In response to	Re: vacuum, performance, and MVCC (Bruce Momjian <bruce@momjian.us>)
Responses	Re: vacuum, performance, and MVCC, and compression
List	pgsql-hackers

Tree view

There were some talks lately about compression.With a bit of lateral thinking I guess this can be used to contain the

bloat induced by updates.Of course this is just my hypothesis.
Compression in indexes :
Instead of storing (value, tuple identifier) keys in the indexes, store  
(value, [tuple identifier list]) ; ie. all tuples which have the same  
indexed value are referenced by the same index tuple, instead of having  
one index tuple per actual tuple.The length of the list would of course be limited to the space actually  
available on an index page ; if many rows have the same indexed value,  
several index tuples would be generated so that index tuples fit on index  
pages.This would make the index smaller (more likely to fit in RAM) at the cost  
of a little CPU overhead for index modifications, but would make the index  
scans actually use less CPU (no need to compare the indexed value on each  
table tuple).
Compression in data pages :
The article that circulated on the list suggested several types of  
compression, offset, dictionary, etc. The point is that several row  
versions on the same page can be compressed well because these versions  
probably have similar column values.
Just a thought...

pgsql-hackers by date:

From: "Dave Page"
Date: 26 June 2006, 12:38:30
Subject: Re: Anyone still care about Cygwin? (was Re: [CORE] GPL

From: Tom Lane
Date: 26 June 2006, 12:45:02
Subject: Re: "Truncated" tuples for tuple hash tables

Re: vacuum, performance, and MVCC, and compression - Mailing list pgsql-hackers

Previous

Next