Bug in UTF8-Validation Code? - Mailing list pgsql-hackers

From Mario Weilguni
Subject Bug in UTF8-Validation Code?
Date
Msg-id 200703131200.58918.mweilguni@sime.com
Whole thread Raw
Responses Re: Bug in UTF8-Validation Code?  (Jeff Davis <pgsql@j-davis.com>)
Re: Bug in UTF8-Validation Code?  (Bruce Momjian <bruce@momjian.us>)
List pgsql-hackers
Hi,

I've a problem with a database, I can dump the database to a file, but 
restoration fails, happens with 8.1.4.

Steps to reproduce:
create database testdb with encoding='UTF8';
\c testdb
create table test(x text);
insert into test values ('\244'); ==> Is akzepted, even if not UTF8.

pg_dump testdb -f testdb.dump -Fc
pg_restore -f testdb.dump -d testdb => fails with an error: 
ERROR:  invalid byte sequence for encoding "UTF8": 0xa4

The problem itself comes from a CSV file, which is imported with \copy without 
proper quoting (so I have to fix this anyway), but I still think this is an 
error, making restoration very complicated in such cases...

Or am I doing something completly wrong here?

Best regards,
Mario Weilguni



pgsql-hackers by date:

Previous
From: "Simon Riggs"
Date:
Subject: Re: Bug: Buffer cache is not scan resistant
Next
From: Andrew Dunstan
Date:
Subject: Re: [COMMITTERS] pgsql: Make configuration parameters fall back to their default values