Re: Differences in UTF8 between 8.0 and 8.1 - Mailing list pgsql-hackers

From Christopher Kings-Lynne
Subject Re: Differences in UTF8 between 8.0 and 8.1
Date
Msg-id 4360323C.2020701@familyhealth.com.au
Whole thread Raw
In response to Re: Differences in UTF8 between 8.0 and 8.1  (Paul Lindner <lindner@inuus.com>)
Responses Re: Differences in UTF8 between 8.0 and 8.1
List pgsql-hackers
> However I'm running into another problem now.  The command:
> 
>   iconv -c -f UTF8 -t UTF8 
> 
> does strip out the invalid characters.  However, iconv reads the
> entire file into memory before it writes out any data.  This is not so
> good for multi-gigabyte dump files and doesn't allow for it to be used
> in a pipe between pg_dump and psql.
> 
> Anyone have any other recommendations?  GNU recode might do it, but
> I'm a bit stymied by the syntax.  A quick perl script using
> Text::Iconv didn't work either.  I'm off to look at some other perl
> modules and will try to create a script so I can strip out the invalid
> characters.

recode UTF-8..UTF-8 < dump_in.sql > dump_out.sql

Chris



pgsql-hackers by date:

Previous
From: Andrej Ricnik-Bay
Date:
Subject: Re: Differences in UTF8 between 8.0 and 8.1
Next
From: Bruce Momjian
Date:
Subject: Re: Call for port reports