Home > mailing lists

pgsql: Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on - Mailing list pgsql-committers

From	tgl@postgresql.org (Tom Lane)
Subject	pgsql: Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on
Date	January 24, 2007 13:29:14
Msg-id	20070124171223.DB9E59FB61C@postgresql.org Whole thread Raw
List	pgsql-committers

Tree view

Log Message:
-----------
Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on the same
page about the maximum UTF8 sequence length we support (4 bytes since 8.1,
3 before that).  pg_utf2wchar_with_len never got updated to support 4-byte
characters at all, and in any case had a buffer-overrun risk in that it
could produce multiple pg_wchars from what mblen claims to be just one UTF8
character.  The only reason we don't have a major security hole is that most
callers allocate worst-case output buffers; the sole exception in released
versions appears to be pre-8.2 iwchareq() (ie, ILIKE), which can be crashed
due to zeroing out its return address --- but AFAICS that can't be exploited
for anything more than a crash, due to inability to control what gets written
there.  Per report from James Russell and Michael Fuhr.

Pre-8.1 the risk is much less, but I still think pg_utf2wchar_with_len's
behavior given an incomplete final character risks buffer overrun, so
back-patch that logic change anyway.

This patch also makes sure that UTF8 sequences exceeding the supported
length (whichever it is) are consistently treated as error cases, rather
than being treated like a valid shorter sequence in some places.

Tags:
----
REL8_2_STABLE

Modified Files:
--------------
    pgsql/src/backend/utils/mb:
        wchar.c (r1.58 -> r1.58.2.1)
        (http://developer.postgresql.org/cvsweb.cgi/pgsql/src/backend/utils/mb/wchar.c.diff?r1=1.58&r2=1.58.2.1)

pgsql-committers by date:

From: tgl@postgresql.org (Tom Lane)
Date: 23 January 2007, 21:26:11
Subject: pgsql: Relax an Assert() that has been found to be too strict in some

From: tgl@postgresql.org (Tom Lane)
Date: 24 January 2007, 13:29:18
Subject: pgsql: Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on

pgsql: Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on - Mailing list pgsql-committers

Previous

Next