Re: Mac OS: invalid byte sequence for encoding "UTF8" - Mailing list pgsql-hackers

From Stas Kelvich
Subject Re: Mac OS: invalid byte sequence for encoding "UTF8"
Date
Msg-id 7E773E60-39C4-4B8D-B691-2FBC62D4BB6E@gmail.com
Whole thread Raw
In response to Re: Mac OS: invalid byte sequence for encoding "UTF8"  (Artur Zakirov <a.zakirov@postgrespro.ru>)
Responses Re: Mac OS: invalid byte sequence for encoding "UTF8"  (Artur Zakirov <a.zakirov@postgrespro.ru>)
List pgsql-hackers
Hi.

I tried that and confirm strange behaviour. It seems that problem with small cyrillic letter ‘х’. (simplest obscene
languagefilter? =) 

That can be reproduced with simpler test

Stas



> On 27 Jan 2016, at 13:59, Artur Zakirov <a.zakirov@postgrespro.ru> wrote:
>
> On 27.01.2016 13:46, Shulgin, Oleksandr wrote:
>>
>> Not sure why the file uses "SET KOI8-R" directive then?
>>
>
> This directive is used only by Hunspell program. PostgreSQL ignores this directive and assumes that input affix and
dictionaryfiles in the UTF-8 encoding. 
>
>>
>>
>> What error message do you get with this test program?  (I don't get any,
>> but I'm not on Mac OS.)
>> --
>> Alex
>>
>>
>
> With this program you will get wrong output. A error message is not called. You can execute the following commands:
>
> > cc test.c -o test
> > ./test
>
> You will get the output:
>
> SFX/Y/?/аться/шутся
>
> Although the output should be:
>
> SFX/Y/хаться/шутся/хаться
>
> --
> Artur Zakirov
> Postgres Professional: http://www.postgrespro.com
> Russian Postgres Company
>
>
> --
> Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-hackers


Attachment

pgsql-hackers by date:

Previous
From: Erik Rijkers
Date:
Subject: Fwd: [DOCS] pgbench doc typos
Next
From: Ashutosh Bapat
Date:
Subject: Using user mapping OID as hash key for connection hash