Re: Avoid detoast overhead when possible - Mailing list pgsql-hackers

From zhihuifan1213@163.com
Subject Re: Avoid detoast overhead when possible
Date
Msg-id 87a5qqhzka.fsf@163.com
Whole thread Raw
In response to Re: Avoid detoast overhead when possible  (Matthias van de Meent <boekewurm+postgres@gmail.com>)
Responses Re: Avoid detoast overhead when possible
List pgsql-hackers
Hi,

Matthias van de Meent <boekewurm+postgres@gmail.com> writes:

> On Mon, 4 Dec 2023 at 07:56, <zhihuifan1213@163.com> wrote:

> ..It would also add overhead when
> we write results to disk, such as spilling merge sorts, hash join
> spills, or CTE materializations.
>
> Could you find a way to reduce this memory and IO usage when the value
> is not going to be used immediately? Using the toast pointer at such
> points surely will be cheaper than storing the full value again and
> again.

I'm not sure I understand you correctly, I think the issue you raised
here is covered by the below design (not implemented in the patch).

"
However this patch just throws away almost all the benefits of toast, so
how can we draw a line between should vs should not do this code path?
IMO, we should only run the 'eagerly detoast' when we know that we will
have a FuncCall against the toast_col on **the current plan node**. I
think this information can be get from Qual and TargetList. If so, we
can set the slot->detoast_attrs accordingly.
"

Let's see an example of this:

SELECT f(t1.toastable_col) FROM t1 join t2 using(c);

Suppose it is using hash join and t1 should be hashed.  With the above
design, we will NOT detoast toastable_col at the scan of t1 or hash t1
since there is no one "funcall" access it in either SeqScan of t1 or
hash (t1). But when we do the projection on the joinrel, the detoast
would happen.

I'm still working on how to know if a toast_col will be detoast for a
given PlanState. If there is no design error, I think I can work out a
version tomorrow.

-- 
Best Regards
Andy Fan




pgsql-hackers by date:

Previous
From: Isaac Morland
Date:
Subject: Re: Should REINDEX be listed under DDL?
Next
From: Joe Conway
Date:
Subject: Re: Emitting JSON to file using COPY TO