Thread: Czech character broken?
On our page at http://www.postgresql.org/community/international we have the following: Informace o PostgreSQL v četině My validator complains that is an invalid numeric entity in UTF-8. Can someone who knows what it should be let us know what the correct entity is? //Magnus
On Tue, Feb 13, 2007 at 10:06:47AM +0100, Magnus Hagander wrote: > On our page at http://www.postgresql.org/community/international we have the following: > Informace o PostgreSQL v četině > > My validator complains that is an invalid numeric entity in > UTF-8. Can someone who knows what it should be let us know what the > correct entity is? I don't know for sure since I don't speak Czech, but I'd guess the correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>. The Windows-1250 mapping of that character is 154 (0x9a). -- Michael Fuhr
Michael Fuhr wrote: > On Tue, Feb 13, 2007 at 10:06:47AM +0100, Magnus Hagander wrote: >> On our page at http://www.postgresql.org/community/international we have the following: >> Informace o PostgreSQL v četině >> >> My validator complains that is an invalid numeric entity in >> UTF-8. Can someone who knows what it should be let us know what the >> correct entity is? > > I don't know for sure since I don't speak Czech, but I'd guess the > correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>. > The Windows-1250 mapping of that character is 154 (0x9a). Seems to be correct - at least my firefox on Windows still shows the same character after fixing that. Thanks. //Magnus
On Sun, Feb 18, 2007 at 06:30:14PM +0100, Magnus Hagander wrote: > Michael Fuhr wrote: > > I don't know for sure since I don't speak Czech, but I'd guess the > > correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>. > > The Windows-1250 mapping of that character is 154 (0x9a). > > Seems to be correct - at least my firefox on Windows still shows the > same character after fixing that. That character is 154 in Windows-1252 as well. I'd guess that some browsers, upon finding a C1 control character (128-159 / 0x80-0x9f), assume that it's really a graphical character from one of the Windows encodings and convert it appropriately. -- Michael Fuhr