Thread: First draft of back-branch release notes is done

First draft of back-branch release notes is done

From

Tom Lane

Date:

03 February 2023, 19:32:39

... at

https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=f282b026787da69d88a35404cf62f1cc21cfbb7c

As usual, please send corrections/comments by Sunday.

            regards, tom lane

Re: First draft of back-branch release notes is done

From

"Jonathan S. Katz"

Date:

04 February 2023, 15:27:49

> On Feb 4, 2023, at 10:24 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>
> ... at
>
> https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=f282b026787da69d88a35404cf62f1cc21cfbb7c
>
> As usual, please send corrections/comments by Sunday.

While reviewing for the release announcement, I noticed this (abbreviated as I’m on my mobile):

“Prevent clobbering of cached parsetrees…Bad things could happen if…”

While I chuckled over the phrasing, I’m left to wonder what the “bad things” are, in case I
need to check an older version to see if I’m susceptible to this bug.

Thanks,

Jonathan

Re: First draft of back-branch release notes is done

From

Alvaro Herrera

Date:

05 February 2023, 12:10:47

On 2023-Feb-03, Tom Lane wrote:

> ... at
> 
> https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=f282b026787da69d88a35404cf62f1cc21cfbb7c
> 
> As usual, please send corrections/comments by Sunday.

      Fix edge-case data corruption in shared tuplestores (Dmitry Astapov)

      If the final chunk of a large tuple being written out to disk was
      exactly 32760 bytes, it would be corrupted due to a fencepost bug.
      This is a hazard for parallelized plans that require a tuplestore,
      such as parallel hash join.  The query would typically fail later
      with corrupted-data symptoms.

I think this sounds really scary, because people are going to think that
their stored data can get corrupted -- they don't necessarily know what
a "shared tuplestore" is.  Maybe "Avoid query failures in parallel hash
joins" as headline?

-- 
Álvaro Herrera        Breisgau, Deutschland  —  https://www.EnterpriseDB.com/
"The Gord often wonders why people threaten never to come back after they've
been told never to return" (www.actsofgord.com)

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

05 February 2023, 19:57:39

Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
> On 2023-Feb-03, Tom Lane wrote:
>>       Fix edge-case data corruption in shared tuplestores (Dmitry Astapov)

> I think this sounds really scary, because people are going to think that
> their stored data can get corrupted -- they don't necessarily know what
> a "shared tuplestore" is.  Maybe "Avoid query failures in parallel hash
> joins" as headline?

Hmmm ... are we sure it *can't* lead to corruption of stored data,
if this happens during an INSERT or UPDATE plan?  I'll grant that
such a case seems pretty unlikely though, as the bogus data
retrieved from the tuplestore would have to not cause a failure
within the query before it can get written out.

Also, aren't shared tuplestores used in more places than just
parallel hash join?  I mentioned that as an example, not an
exhaustive list of trouble spots.

            regards, tom lane

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

05 February 2023, 20:01:33

"Jonathan S. Katz" <jkatz@postgresql.org> writes:
> On Feb 4, 2023, at 10:24 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> “Prevent clobbering of cached parsetrees…Bad things could happen if…”

> While I chuckled over the phrasing, I’m left to wonder what the “bad things” are, in case I
> need to check an older version to see if I’m susceptible to this bug.

Fair.  I was trying to avoid committing to specific consequences.
The assertion failure seen in the original report (#17702) wouldn't
occur for typical users, but they might see crashes or "unexpected node
type" failures.  Maybe we can say that instead.

            regards, tom lane

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

05 February 2023, 20:09:23

I wrote:
> Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
>> I think this sounds really scary, because people are going to think that
>> their stored data can get corrupted -- they don't necessarily know what
>> a "shared tuplestore" is.  Maybe "Avoid query failures in parallel hash
>> joins" as headline?

Maybe less scary if we make it clear we're talking about a temporary file?

     <para>
      Fix edge-case corruption of temporary data within shared tuplestores
      (Dmitry Astapov)
     </para>

     <para>
      If the final chunk of a large tuple being written out to a temporary
      file was exactly 32760 bytes, it would be corrupted due to a
      fencepost bug.  This is a hazard for parallelized plans that require
      a tuplestore, such as parallel hash join.  The query would typically
      fail later with corrupted-data symptoms.
     </para>

            regards, tom lane

Re: First draft of back-branch release notes is done

From

Thomas Munro

Date:

05 February 2023, 20:37:10

On Mon, Feb 6, 2023 at 8:57 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> Alvaro Herrera <alvherre@alvh.no-ip.org> writes:
> > On 2023-Feb-03, Tom Lane wrote:
> >>       Fix edge-case data corruption in shared tuplestores (Dmitry Astapov)
>
> > I think this sounds really scary, because people are going to think that
> > their stored data can get corrupted -- they don't necessarily know what
> > a "shared tuplestore" is.  Maybe "Avoid query failures in parallel hash
> > joins" as headline?
>
> Hmmm ... are we sure it *can't* lead to corruption of stored data,
> if this happens during an INSERT or UPDATE plan?  I'll grant that
> such a case seems pretty unlikely though, as the bogus data
> retrieved from the tuplestore would have to not cause a failure
> within the query before it can get written out.

Agreed.  I think you have to be quite unlucky to hit this in the first
place (very large tuples with very particular alignment), and then
you'd be highly likely to fail in some way due to corrupted tuple
size, making permanent corruption extremely unlikely.

> Also, aren't shared tuplestores used in more places than just
> parallel hash join?  I mentioned that as an example, not an
> exhaustive list of trouble spots.

Shared file sets (= a directory of temp files with automatic cleanup)
are used by more things, but shared tuplestores (= a shared file set
with a tuple-oriented interface on top, to support partial scan) are
currently only used by PHJ.

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

05 February 2023, 20:44:13

Thomas Munro <thomas.munro@gmail.com> writes:
> On Mon, Feb 6, 2023 at 8:57 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> Also, aren't shared tuplestores used in more places than just
>> parallel hash join?  I mentioned that as an example, not an
>> exhaustive list of trouble spots.

> Shared file sets (= a directory of temp files with automatic cleanup)
> are used by more things, but shared tuplestores (= a shared file set
> with a tuple-oriented interface on top, to support partial scan) are
> currently only used by PHJ.

Oh, okay.  I'll change it to say "corruption within parallel hash
join", then, and not use the word "tuplestore" at all.

            regards, tom lane

Re: First draft of back-branch release notes is done

From

"Jonathan S. Katz"

Date:

06 February 2023, 01:46:15

On 2/5/23 3:01 PM, Tom Lane wrote:
> "Jonathan S. Katz" <jkatz@postgresql.org> writes:
>> On Feb 4, 2023, at 10:24 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>>> “Prevent clobbering of cached parsetrees…Bad things could happen if…”
> 
>> While I chuckled over the phrasing, I’m left to wonder what the “bad things” are, in case I
>> need to check an older version to see if I’m susceptible to this bug.
> 
> Fair.  I was trying to avoid committing to specific consequences.
> The assertion failure seen in the original report (#17702) wouldn't
> occur for typical users, but they might see crashes or "unexpected node
> type" failures.  Maybe we can say that instead.

I did a quick readthrough of #17702. Your proposal sounds reasonable.

Based on that explanation and reading #17702, I'm still not sure if this 
will make the cut in the release announcement itself, but +1 for 
modifying it in the release notes.

Thanks,

Jonathan

Attachment

OpenPGP_signature

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

06 February 2023, 02:39:13

"Jonathan S. Katz" <jkatz@postgresql.org> writes:
> On 2/5/23 3:01 PM, Tom Lane wrote:
>> Fair.  I was trying to avoid committing to specific consequences.
>> The assertion failure seen in the original report (#17702) wouldn't
>> occur for typical users, but they might see crashes or "unexpected node
>> type" failures.  Maybe we can say that instead.

> I did a quick readthrough of #17702. Your proposal sounds reasonable.

> Based on that explanation and reading #17702, I'm still not sure if this 
> will make the cut in the release announcement itself, but +1 for 
> modifying it in the release notes.

The notes now say

     <para>
      Prevent clobbering of cached parsetrees for utility statements in
      SQL functions (Tom Lane, Daniel Gustafsson)
     </para>

     <para>
      If a SQL-language function executes the same utility command more
      than once within a single calling query, it could crash or report
      strange errors such as <quote>unrecognized node type</quote>.
     </para>

            regards, tom lane

Re: First draft of back-branch release notes is done

From

"Jonathan S. Katz"

Date:

06 February 2023, 03:08:44

On 2/5/23 9:39 PM, Tom Lane wrote:

> 
>       <para>
>        Prevent clobbering of cached parsetrees for utility statements in
>        SQL functions (Tom Lane, Daniel Gustafsson)
>       </para>
> 
>       <para>
>        If a SQL-language function executes the same utility command more
>        than once within a single calling query, it could crash or report
>        strange errors such as <quote>unrecognized node type</quote>.
>       </para>
> 
>             regards, tom lane

+1. Thanks!

Jonathan

Attachment

OpenPGP_signature

Re: First draft of back-branch release notes is done

From

Justin Pryzby

Date:

08 February 2023, 22:08:35

On Fri, Feb 03, 2023 at 02:32:39PM -0500, Tom Lane wrote:
> ... at
> 
> https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=f282b026787da69d88a35404cf62f1cc21cfbb7c
> 
> As usual, please send corrections/comments by Sunday.

It's of no concern, but I was curious why this one wasn't included:

commit 72aea955d49712a17c08748aa9abcbcf98c32fc5
Author: Thomas Munro <tmunro@postgresql.org>
Date:   Fri Jan 6 16:38:46 2023 +1300

    Fix pg_truncate() on Windows.
    
    Commit 57faaf376 added pg_truncate(const char *path, off_t length), but
    "length" was ignored under WIN32 and the file was unconditionally
    truncated to 0.
    
    There was no live bug, since the only caller passes 0.
    
    Fix, and back-patch to 14 where the function arrived.

Re: First draft of back-branch release notes is done

From

Tom Lane

Date:

08 February 2023, 22:22:04

Justin Pryzby <pryzby@telsasoft.com> writes:
> It's of no concern, but I was curious why this one wasn't included:

> commit 72aea955d49712a17c08748aa9abcbcf98c32fc5
> Author: Thomas Munro <tmunro@postgresql.org>
> Date:   Fri Jan 6 16:38:46 2023 +1300

>     Fix pg_truncate() on Windows.
    
>     Commit 57faaf376 added pg_truncate(const char *path, off_t length), but
>     "length" was ignored under WIN32 and the file was unconditionally
>     truncated to 0.
    
>     There was no live bug, since the only caller passes 0.

I concluded that due to the lack of live bug, this would not be of
interest to end users.  The back-patch was just for future-proofing.

            regards, tom lane