Re: New "raw" COPY format

From: Tatsuo Ishii <ishii(at)postgresql(dot)org>
To: joel(at)compiler(dot)org
Cc: pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject: Re: New "raw" COPY format
Date: 2024-10-13 09:52:04
Message-ID: 20241013.185204.1992992241505690866.ishii@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> Hi hackers,
>
> This thread is about implementing a new "raw" COPY format.
>
> This idea came up in a different thread [1], moved here.
>
> [1] https://postgr.es/m/47b5c6a7-5c0e-40aa-8ea2-c7b95ccf296f%40app.fastmail.com
>
> The main use-case for the raw format, is when needing to import arbitrary
> unstructured text files, such as log files, into a single text column
> of a table.

After copy imported the "unstructured text file" in "row" COPY format,
what the column type is? text? or bytea? If it's text, how do you
handle encoding conversion if the "unstructured text file" is encoded
in server side unsafe encoding such as SJIS?

> All characters are taken literally.
> There is no special handling for quotes, backslashes, or escape sequences.

If SJIS text is imported "literally" (i.e. no encoding conversion), it
should be rejected.

Best reagards,
--
Tatsuo Ishii
SRA OSS K.K.
English: http://www.sraoss.co.jp/index_en/
Japanese:http://www.sraoss.co.jp

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Alexander Korotkov 2024-10-13 12:08:38 Re: type cache cleanup improvements
Previous Message Paul A Jungwirth 2024-10-13 05:25:40 Re: SQL:2011 application time