Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2" - Mailing list pgsql-general

From Alvaro Herrera
Subject Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"
Date
Msg-id 20090803232305.GH19615@alvh.no-ip.org
Whole thread Raw
In response to Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"  (Andreas Kalsch <andreaskalsch@gmx.de>)
List pgsql-general
Andreas Kalsch wrote:

> My question again: Is there a native Postgres solution to simplify
> characters consistently? It means to completely remove all
> diacriticals from Unicode characters.

There's a to_ascii() function but it supports a subset of charsets, and
IIRC UTF8 is not one of them.  Patches welcome.


> I will validate input data on the client side (PHP or Python) and send
> it to the server. Of course the only encoding I will use on any side
> is UTF8. I just wnated to use this Latin thing for simplification of
> characters.

Hmm, seems you're using the wrong tool for that purpose.  Changing to a
different encoding does not remove any diacritical marks, only change
the underlying byte encoding.

--
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support

pgsql-general by date:

Previous
From: Andreas Kalsch
Date:
Subject: Re: character 0xe29986 of encoding "UTF8" has no equivalent in "LATIN2"
Next
From: Andreas Kalsch
Date:
Subject: Refer to another database