Hello,
I have made patches that allow PostgreSQL to handle multi-byte
characters. The patches makes most commands including "regexp" and
"like" 8 bit, multi-byte aware. You have a choice of one of followings
for coding schema at the compile time:
1. single byte 7 bit ASCII/8 bit characters(mostly used in Europ)
2. EUC (extend Unix Code) for Japanese, Chinese and Korean
3. UNICODE (UTF-8)
4. Mule internal code
I only modified include/regexp, backend/regex and
backend/utils/adt/like.c. So the patches would not affect other parts
of PostgreSQL, I believe.
Note that I changed regexp coming with PostgreSQL rather than
borrowing GNU's regex to avoid license issues.
Should I post the patches for 6.3b now or wait after v6.3?
--
Tatsuo Ishii
t-ishii@sra.co.jp