Re: Unicode and unaccent() - Mailing list pgsql-general

From Daniel Verite
Subject Re: Unicode and unaccent()
Date
Msg-id 20050506111123.1377220@localhost
Whole thread Raw
In response to Unicode and unaccent()  ("Mark Borins" <mark.borins@rigadev.com>)
List pgsql-general
    Mark Borins wrote:

> The encoding on my DB is Unicode, so far I have found an unaccent() function
> by looking in the mail archives it looks like the following:
>
>
> CREATE FUNCTION unaccent(text) RETURNS text AS $$
>    BEGIN
>        RETURN translate($1, '\342\347\350\351\352\364\373', 'aceeeou')
>  ;  END;  $$ LANGUAGE plpgsql IMMUTABLE STRICT;
>
> My problem is that the values like \342 are for LATIN1 type encoding.  I

Why wouldn't this:
   RETURN translate($1, 'éçàêè...', 'ecaee...') ;
work just fine? It's even portable across encodings.

--
 Daniel
 PostgreSQL-powered mail user agent and storage: http://www.manitou-mail.org


pgsql-general by date:

Previous
From: Peter Wilson
Date:
Subject: Re: Slony v. DBMirror
Next
From: CSN
Date:
Subject: plphp1.1 make fails