Home > mailing lists

Re: Gsoc2012 idea, tablesample - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Gsoc2012 idea, tablesample
Date	May 11, 2012 14:13:51
Msg-id	3460.1336745597@sss.pgh.pa.us Whole thread Raw
In response to	Re: Gsoc2012 idea, tablesample (Florian Pflug <fgp@phlo.org>)
List	pgsql-hackers

Tree view

Florian Pflug <fgp@phlo.org> writes:
> This all hinges on the ability to produce a sufficient accurate estimate of the
> TID density p_tup/p_tid, of course.

I think that's the least of its problems.  AFAICS this analysis ignores
(1) the problem that the TID space is nonuniform, ie we don't know how
many tuples in each page until we look;
(2) the problem that we don't know the overall number of tuples
beforehand.

I'm not sure that there is any way to deal with (1) fully without
examining every single page, but algorithms that assume that the TIDs
are numbered linearly are broken before they start.
        regards, tom lane

pgsql-hackers by date:

From: "Kevin Grittner"
Date: 11 May 2012, 14:03:42
Subject: Re: Gsoc2012 idea, tablesample

From: Bruce Momjian
Date: 11 May 2012, 14:16:10
Subject: Re: Draft release notes complete

Re: Gsoc2012 idea, tablesample - Mailing list pgsql-hackers

Previous

Next