Home > mailing lists

Re: [HACKERS] CLUSTER command progress monitor - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: [HACKERS] CLUSTER command progress monitor
Date	November 21, 2017 23:55:23
Msg-id	CA+TgmoYQ_sF8oZjtfa8x0dVyjfSA_yFwEQCjEwQjv3G64zP6=w@mail.gmail.com Whole thread Raw
In response to	Re: [HACKERS] CLUSTER command progress monitor (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: [HACKERS] CLUSTER command progress monitor Re: [HACKERS] CLUSTER command progress monitor
List	pgsql-hackers

Tree view

On Mon, Nov 20, 2017 at 12:25 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Antonin Houska <ah@cybertec.at> writes:
>> Robert Haas <robertmhaas@gmail.com> wrote:
>>> These two phases overlap, though. I believe progress reporting for
>>> sorts is really hard.
>
>> Whatever complexity is hidden in the sort, cost_sort() should have taken it
>> into consideration when called via plan_cluster_use_sort(). Thus I think that
>> once we have both startup and total cost, the current progress of the sort
>> stage can be estimated from the current number of input and output
>> rows. Please remind me if my proposal appears to be too simplistic.
>
> Well, even if you assume that the planner's cost model omits nothing
> (which I wouldn't bet on), its result is only going to be as good as the
> planner's estimate of the number of rows to be sorted.  And, in cases
> where people actually care about progress monitoring, it's likely that
> the planner got that wrong, maybe horribly so.  I think it's a bad idea
> for progress monitoring to depend on the planner's estimates in any way
> whatsoever.

I agree.

I have been of the opinion all along that progress monitoring needs to
report facts, not theories.  The number of tuples read thus far is a
fact, and is fine to report for whatever value it may have to someone.
The number of tuples that will be read in the future is a theory, and
as you say, progress monitoring is most likely to be used in cases
where theory and practice ended up being very different.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Robert Haas
Date: 21 November 2017, 23:51:59
Subject: Re: Combine function returning NULL unhandled?

From: Robert Haas
Date: 21 November 2017, 23:57:19
Subject: Re: [HACKERS] CLUSTER command progress monitor

Re: [HACKERS] CLUSTER command progress monitor - Mailing list pgsql-hackers

Previous

Next