Home > mailing lists

Re: Add index scan progress to pg_stat_progress_vacuum - Mailing list pgsql-hackers

From	Andres Freund
Subject	Re: Add index scan progress to pg_stat_progress_vacuum
Date	January 10, 2023 04:52:47
Msg-id	20230110015247.yspx3hkz2sk6p6u3@awork3.anarazel.de Whole thread Raw
In response to	Re: Add index scan progress to pg_stat_progress_vacuum ("Imseih (AWS), Sami" <simseih@amazon.com>)
Responses	Re: Add index scan progress to pg_stat_progress_vacuum
List	pgsql-hackers

Tree view

Hi,

On 2023-01-07 01:59:40 +0000, Imseih (AWS), Sami wrote:
> --- a/src/backend/access/nbtree/nbtree.c
> +++ b/src/backend/access/nbtree/nbtree.c
> @@ -998,6 +998,8 @@ btvacuumscan(IndexVacuumInfo *info, IndexBulkDeleteResult *stats,
>              if (info->report_progress)
>                  pgstat_progress_update_param(PROGRESS_SCAN_BLOCKS_DONE,
>                                               scanblkno);
> +            if (info->report_parallel_progress && (scanblkno % REPORT_PARALLEL_VACUUM_EVERY_PAGES) == 0)
> +                parallel_vacuum_update_progress(info->parallel_vacuum_state);
>          }
>      }

I still think it's wrong to need multiple pgstat_progress_*() calls within one
scan. If we really need it, it should be abstracted into a helper function
that wrapps all the vacuum progress stuff needed for an index.

> @@ -688,7 +703,13 @@ parallel_vacuum_process_all_indexes(ParallelVacuumState *pvs, int num_index_scan
>       */
>      if (nworkers > 0)
>      {
> -        /* Wait for all vacuum workers to finish */
> +        /*
> +         * Wait for all indexes to be vacuumed while
> +         * updating the parallel vacuum index progress,
> +         * and then wait for all workers to finish.
> +         */
> +        parallel_vacuum_wait_to_finish(pvs);
> +
>          WaitForParallelWorkersToFinish(pvs->pcxt);
>  
>          for (int i = 0; i < pvs->pcxt->nworkers_launched; i++)

I don't think it's a good idea to have two difference routines that wait for
workers to exit.  And certainly not when one of them basically just polls in a
regular interval as parallel_vacuum_wait_to_finish().

I think the problem here is that you're basically trying to work around the
lack of an asynchronous state update mechanism between leader and workers. The
workaround is to add a lot of different places that poll whether there has
been any progress. And you're not doing that by integrating with the existing
machinery for interrupt processing (i.e. CHECK_FOR_INTERRUPTS()), but by
developing a new mechanism.

I think your best bet would be to integrate with HandleParallelMessages().

Greetings,

Andres Freund

pgsql-hackers by date:

From: Peter Geoghegan
Date: 10 January 2023, 04:40:06
Subject: Re: Decoupling antiwraparound autovacuum from special rules around auto cancellation

From: Andres Freund
Date: 10 January 2023, 05:05:01
Subject: Re: postgres_fdw uninterruptible during connection establishment / ProcSignalBarrier

Re: Add index scan progress to pg_stat_progress_vacuum - Mailing list pgsql-hackers

Previous

Next