Re: Per-column collation - Mailing list pgsql-hackers

From Pavel Stehule
Subject Re: Per-column collation
Date
Msg-id AANLkTi=4ZaL3L0vL-uCrHQqL5u=BQzNoZ75gqCGN7YQ5@mail.gmail.com
Whole thread Raw
In response to Per-column collation  (Peter Eisentraut <peter_e@gmx.net>)
Responses Re: Per-column collation
List pgsql-hackers
Hello

I am checking a patch. I found a problem with initdb

[postgres@pavel-stehule postgresql]$ /usr/local/pgsql/bin/initdb -D
/usr/local/pgsql/data/
could not change directory to "/home/pavel/src/postgresql"
The files belonging to this database system will be owned by user "postgres".
This user must also own the server process.

The database cluster will be initialized with locale en_US.utf8.
The default database encoding has accordingly been set to UTF8.
The default text search configuration will be set to "english".

fixing permissions on existing directory /usr/local/pgsql/data ... ok
creating subdirectories ... ok
selecting default max_connections ... 100
selecting default shared_buffers ... 24MB
creating configuration files ... ok
creating template1 database in /usr/local/pgsql/data/base/1 ... ok
initializing pg_authid ... ok
initializing dependencies ... ok
creating system views ... ok
loading system objects' descriptions ... ok
creating collations ...initdb: locale name has non-ASCII characters,
skipped: bokm�linitdb: locale name has non-ASCII characters, skipped:
fran�aiscould not determine encoding for locale "hy_AM.armscii8":
codeset is "ARMSCII-8"
could not determine encoding for locale "ka_GE": codeset is "GEORGIAN-PS"
could not determine encoding for locale "ka_GE.georgianps": codeset is
"GEORGIAN-PS"
could not determine encoding for locale "kk_KZ": codeset is "PT154"
could not determine encoding for locale "kk_KZ.pt154": codeset is "PT154"
could not determine encoding for locale "tg_TJ": codeset is "KOI8-T"
could not determine encoding for locale "tg_TJ.koi8t": codeset is "KOI8-T"
could not determine encoding for locale "thai": codeset is "TIS-620"
could not determine encoding for locale "th_TH": codeset is "TIS-620"
could not determine encoding for locale "th_TH.tis620": codeset is "TIS-620"
could not determine encoding for locale "vi_VN.tcvn": codeset is "TCVN5712-1"
FATAL:  invalid byte sequence for encoding "UTF8": 0xe56c27
child process exited with exit code 1
initdb: removing contents of data directory "/usr/local/pgsql/data

tested on fedora 13

[postgres@pavel-stehule local]$ locale -a| wc -l
731

Regards

Pavel Stehule



2010/11/15 Peter Eisentraut <peter_e@gmx.net>:
> Here is the next patch in this epic series. [0]
>
> I have addressed most of the issues pointed out in previous reviews and
> removed all major outstanding problems that were marked in the code.  So
> it might just almost really work.
>
> The documentation now also covers everything that's interesting, so
> newcomers can start with that.
>
>
> For those who have previously reviewed this, two major changes:
>
> * The locales to be loaded are now computed by initdb, no longer during
> the build process.
>
> * The regression test file has been removed from the main test set.  To
> run it, use
>
> make check MULTIBYTE=UTF8 EXTRA_TESTS=collate
>
>
> Stuff that still cannot be expected to work:
>
> * no CREATE COLLATION yet, maybe later
>
> * no support for regular expression searches
>
> * not text search support
>
> These would not be release blockers, I think.
>
>
> [0] http://archives.postgresql.org/message-id/1284583568.4696.20.camel@vanquo.pezone.net
>
>
> --
> Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-hackers
>
>


pgsql-hackers by date:

Previous
From: Itagaki Takahiro
Date:
Subject: How to rename each field in ROW expression?
Next
From: Greg Stark
Date:
Subject: Amazon now supporting GPU focused EC2 instances