Re: \COPY to accept non UTF-8 chars in CHAR columns - Mailing list pgsql-general

From Andrew Gierth
Subject Re: \COPY to accept non UTF-8 chars in CHAR columns
Date
Msg-id 87k135a2hd.fsf@news-spur.riddles.org.uk
Whole thread Raw
In response to Re: \COPY to accept non UTF-8 chars in CHAR columns  (Thomas Munro <thomas.munro@gmail.com>)
Responses Re: \COPY to accept non UTF-8 chars in CHAR columns
List pgsql-general
>>>>> "Thomas" == Thomas Munro <thomas.munro@gmail.com> writes:

 Thomas> Something like this approach might be useful for fixing the CSV file:

 Thomas> https://codereview.stackexchange.com/questions/185821/convert-a-mix-of-latin-1-and-utf-8-to-proper-utf-8

Or:

perl -MEncode -pe '
 use bytes;
 sub c { decode("UTF-8",shift,sub { decode("windows-1252", chr(shift)) }); }
 s/([\x80-\xFF]+)/encode("UTF-8",c($1))/eg' <infile >outfile

-- 
Andrew (irc:RhodiumToad)



pgsql-general by date:

Previous
From: Thomas Munro
Date:
Subject: Re: \COPY to accept non UTF-8 chars in CHAR columns
Next
From: Justin King
Date:
Subject: Re: PG12 autovac issues