Re: [GENERAL] Czech2ASCII with --mb=Latin2 - Mailing list pgsql-general

From Peter Eisentraut
Subject Re: [GENERAL] Czech2ASCII with --mb=Latin2
Date
Msg-id Pine.LNX.4.21.9912161929560.5199-100000@localhost.localdomain
Whole thread Raw
In response to Czech2ASCII with --mb=Latin2  (Robert <robert@robert.cz>)
List pgsql-general
On 1999-12-15, Robert mentioned:

>   I have a database in Latin2 encoding (Czech stuff) and Latin2/Win1250
> on-the-fly recoding with 'set client_encoding' works smoothly. Now, when
> I set client encoding to SQL_ASCII, accented characters are converted to
> (hexa) codes. Is there any (simple) way to make this recoding convert
> accented characters to just the chars themselves but without accents?

I think this sort of thing has been the dream of many folks using
internationalized software, but it's not that easy. Perhaps one could
write a function that does this sort of conversion, which would have to
keep a gigantic table internally.

However, perhaps in your language it's customary to just leave off the
diacritic marks if they're not available, but in other languages such as
Swedish or German there are rules about converting those to sequences of
other letters. And if you start encoding rules of natural languages into
software, oh boy ...

--
Peter Eisentraut                  Sernanders väg 10:115
peter_e@gmx.net                   75262 Uppsala
http://yi.org/peter-e/            Sweden



************


pgsql-general by date:

Previous
From: admin
Date:
Subject: multiple key indices vs single key
Next
From: Peter Eisentraut
Date:
Subject: Re: [HACKERS] \copy problem