Re: Mac OS: invalid byte sequence for encoding "UTF8" - Mailing list pgsql-hackers

From Chapman Flack
Subject Re: Mac OS: invalid byte sequence for encoding "UTF8"
Date
Msg-id 56BC1A0C.1070104@anastigmatix.net
Whole thread Raw
In response to Re: Mac OS: invalid byte sequence for encoding "UTF8"  (Tom Lane <tgl@sss.pgh.pa.us>)
List pgsql-hackers
On 02/10/16 23:55, Tom Lane wrote:

> Yeah, I got that --- what seems squishier is that none of the other C1
> control characters are considered whitespace?

That seems to be exactly the case:

http://www.unicode.org/Public/5.2.0/ucd/PropList.txt

09..0D, 20, 85, and A0 are the only whitespace chars whose codepoints
fit in a byte.

-Chap



pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Mac OS: invalid byte sequence for encoding "UTF8"
Next
From: Vitaly Burovoy
Date:
Subject: Re: custom function for converting human readable sizes to bytes