Re: Bug in UTF8-Validation Code? - Mailing list pgsql-hackers

From Bruce Momjian
Subject Re: Bug in UTF8-Validation Code?
Date
Msg-id 200703222054.l2MKsvq08654@momjian.us
Whole thread Raw
In response to Bug in UTF8-Validation Code?  (Mario Weilguni <mweilguni@sime.com>)
Responses Re: Bug in UTF8-Validation Code?
List pgsql-hackers
Added to TODO:
* Fix cases where invalid byte encodings are accepted by the database,  but throw an error on SELECT
http://archives.postgresql.org/pgsql-hackers/2007-03/msg00767.php

Is anyone working on fixing this bug?

---------------------------------------------------------------------------

Mario Weilguni wrote:
> Hi,
> 
> I've a problem with a database, I can dump the database to a file, but 
> restoration fails, happens with 8.1.4.
> 
> Steps to reproduce:
> create database testdb with encoding='UTF8';
> \c testdb
> create table test(x text);
> insert into test values ('\244'); ==> Is akzepted, even if not UTF8.
> 
> pg_dump testdb -f testdb.dump -Fc
> pg_restore -f testdb.dump -d testdb => fails with an error: 
> ERROR:  invalid byte sequence for encoding "UTF8": 0xa4
> 
> The problem itself comes from a CSV file, which is imported with \copy without 
> proper quoting (so I have to fix this anyway), but I still think this is an 
> error, making restoration very complicated in such cases...
> 
> Or am I doing something completly wrong here?
> 
> Best regards,
> Mario Weilguni
> 
> 
> ---------------------------(end of broadcast)---------------------------
> TIP 9: In versions below 8.0, the planner will ignore your desire to
>        choose an index scan if your joining column's datatypes do not
>        match

--  Bruce Momjian  <bruce@momjian.us>          http://momjian.us EnterpriseDB
http://www.enterprisedb.com
 + If your life is a hard drive, Christ can be your backup. +


pgsql-hackers by date:

Previous
From: Bruce Momjian
Date:
Subject: Re: [RFC] CLUSTER VERBOSE
Next
From: Peter Eisentraut
Date:
Subject: Re: [PATCHES] xpath_array with namespaces support