Re: Enforcing database encoding and locale match - Mailing list pgsql-hackers

From Tom Lane
Subject Re: Enforcing database encoding and locale match
Date
Msg-id 17375.1191007865@sss.pgh.pa.us
Whole thread Raw
In response to Re: Enforcing database encoding and locale match  (Andrew Dunstan <andrew@dunslane.net>)
Responses Re: Enforcing database encoding and locale match
List pgsql-hackers
Andrew Dunstan <andrew@dunslane.net> writes:
> Gregory Stark wrote:
>> "Tom Lane" <tgl@sss.pgh.pa.us> writes:
>>> Another possibility is to treat the case as a WARNING if you're
>>> superuser and an ERROR if you're not.  This would satisfy people
>>> who are uncomfortable with the idea that CREATEDB privilege comes
>>> with a built-in denial-of-service attack, while still leaving a
>>> loophole for anyone for whom the test didn't work properly.
>> 
>> That sounds like a good combination
> +1

After further experimentation I want to change the proposal a bit.
AFAICS, if we recognize the nl_langinfo(CODESET) result, there is
no reason not to trust the answer, so we might as well throw an
error always.  The case that is problematic is where we can get a
CODESET string but we don't recognize it.  In this case it seems
appropriate to do
   ereport(WARNING,           (errmsg("could not determine encoding for locale \"%s\": codeset is \"%s\"",
    ctype, sys),            errdetail("Please report this to <pgsql-bugs@postgresql.org>.")));
 

and then let the user do what he wants.

There need to be two exceptions to the error-on-mismatch policy.

First off, if the locale is C/POSIX then we can allow any encoding.

Second, it appears that we have to allow SQL_ASCII encoding to be
selected regardless of locale; if we don't, the "make installcheck"
regression tests fail, because they try to do exactly that; and I'm
sure that there are other users out there who don't (think they)
care about encoding.  This is not quite as bad as the generic mismatch
case, because the backend will never try to do encoding conversion
and so the recursive-error panic can't happen.  But you could still
have unexpected sorting behavior and probably index corruption.

What I propose is that we allow SQL_ASCII databases to be created
when the locale is not C, but only by superusers.

Comments?
        regards, tom lane


pgsql-hackers by date:

Previous
From: Bruce Momjian
Date:
Subject: Re: Hash index todo list item
Next
From: Zdenek Kotala
Date:
Subject: Re: Enforcing database encoding and locale match