Re: Regexps vs. locale - Mailing list pgsql-hackers

From Andrew Gierth
Subject Re: Regexps vs. locale
Date
Msg-id 87vdtuo9bg.fsf@news-spur.riddles.org.uk
Whole thread Raw
In response to Re: Regexps vs. locale  (Tom Lane <tgl@sss.pgh.pa.us>)
Responses Re: Regexps vs. locale
List pgsql-hackers
>>>>> "Tom" == Tom Lane <tgl@sss.pgh.pa.us> writes:
> Andrew Gierth <andrew@tao11.riddles.org.uk> writes:>> Obviously, this happens because the locale support functions
in>>backend/regex/regc_locale.c are (presumably intentionally)>> crippled so as not to support non-ascii chars, despite
allthe>> code there using wide chars for everything otherwise.
 
Tom> It's not so much intentional as that no one has gotten around toTom> making it work.  The difficulty is that the
wide-charcodes weTom> are using might not match what the <wctype.h> functions expect,Tom> and it's unclear what we
coulddo to fix that.
 

Couldn't we follow the example of lower(), and convert the string to
wchar_t using mbstowcs (rather than pg_wchar_t and pg_mb2wchar)?

This obviously requires that we have a matching lc_ctype for the
encoding, but we insist on that now anyway, no?

-- 
Andrew.


pgsql-hackers by date:

Previous
From: Decibel!
Date:
Subject: Re: Hint bits vs. OS readahead
Next
From: Tom Lane
Date:
Subject: Re: cvs head initdb hangs on unixware