Re: New "raw" COPY format - Mailing list pgsql-hackers

From Tatsuo Ishii
Subject Re: New "raw" COPY format
Date
Msg-id 20241013.185204.1992992241505690866.ishii@postgresql.org
Whole thread Raw
Responses Re: New "raw" COPY format
List pgsql-hackers
> Hi hackers,
> 
> This thread is about implementing a new "raw" COPY format.
> 
> This idea came up in a different thread [1], moved here.
> 
> [1] https://postgr.es/m/47b5c6a7-5c0e-40aa-8ea2-c7b95ccf296f%40app.fastmail.com
> 
> The main use-case for the raw format, is when needing to import arbitrary
> unstructured text files, such as log files, into a single text column
> of a table.

After copy imported the "unstructured text file" in "row" COPY format,
what the column type is?  text? or bytea? If it's text, how do you
handle encoding conversion if the "unstructured text file" is encoded
in server side unsafe encoding such as SJIS?

>  All characters are taken literally.
>  There is no special handling for quotes, backslashes, or escape sequences.

If SJIS text is imported "literally" (i.e. no encoding conversion), it
should be rejected.

Best reagards,
--
Tatsuo Ishii
SRA OSS K.K.
English: http://www.sraoss.co.jp/index_en/
Japanese:http://www.sraoss.co.jp



pgsql-hackers by date:

Previous
From: Noah Misch
Date:
Subject: Re: Inval reliability, especially for inplace updates
Next
From: "Joel Jacobson"
Date:
Subject: Re: New "raw" COPY format