Home > mailing lists

Re: Bug: Reading from single byte character column type may cause out of bounds memory reads. - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Bug: Reading from single byte character column type may cause out of bounds memory reads.
Date	July 13, 2022 18:11:34
Msg-id	2231634.1657725094@sss.pgh.pa.us Whole thread Raw
In response to	Re: Bug: Reading from single byte character column type may cause out of bounds memory reads. (Aleksander Alekseev <aleksander@timescale.com>)
Responses	Re: Bug: Reading from single byte character column type may cause out of bounds memory reads.
List	pgsql-hackers

Tree view

Aleksander Alekseev <aleksander@timescale.com> writes:
> Although the bug is easy to fix for this particular case (see the
> patch) I'm not sure if this solution is general enough. E.g. is there
> something that generally prevents pg_mblen() from doing out of bound
> reading in cases similar to this one? Should we prevent such an INSERT
> from happening instead?

This is ultimately down to char_text() generating a string that's alleged
to be a valid "text" type value, but it contains illegally-encoded data.
Where we need to fix it is there: if we try to make every single
text-using function be 100% bulletproof against wrongly-encoded data,
we'll still be fixing bugs at the heat death of the universe.

I complained about this in [1], but that thread died off before reaching a
clear consensus about exactly what to do.

            regards, tom lane

[1] https://www.postgresql.org/message-id/flat/2318797.1638558730%40sss.pgh.pa.us

pgsql-hackers by date:

From: Andrew Dunstan
Date: 13 July 2022, 18:03:43
Subject: Re: [PATCH] Optimize json_lex_string by batching character copying

From: Andres Freund
Date: 13 July 2022, 18:32:12
Subject: Re: PG15 beta1 sort performance regression due to Generation context change

Re: Bug: Reading from single byte character column type may cause out of bounds memory reads. - Mailing list pgsql-hackers

Previous

Next