Re: New Copy Formats - avro/orc/parquet

From: Adrian Klaver <adrian(dot)klaver(at)aklaver(dot)com>
To: Nicolas Paris <niparisco(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: New Copy Formats - avro/orc/parquet
Date: 2018-02-10 15:25:18
Message-ID: 78a8f0bd-4b6f-b791-b6ce-80b19e2b6d1b@aklaver.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On 02/10/2018 07:13 AM, Nicolas Paris wrote:
> Hello
>
> I d'found useful to be able to import/export from postgres to those modern data
> formats:
> - avro (c writer=https://avro.apache.org/docs/1.8.2/api/c/index.html)
> - parquet (c++ writer=https://github.com/apache/parquet-cpp)
> - orc (all writers=https://github.com/apache/orc)
>
> Something like :
> COPY table TO STDOUT ORC;
>
> Would be lovely.
>
> This would greatly enhance how postgres integrates in big-data ecosystem.
>
> Any thought ?

https://www.postgresql.org/docs/10/static/sql-copy.html

"PROGRAM

A command to execute. In COPY FROM, the input is read from standard
output of the command, and in COPY TO, the output is written to the
standard input of the command.

Note that the command is invoked by the shell, so if you need to
pass any arguments to shell command that come from an untrusted source,
you must be careful to strip or escape any special characters that might
have a special meaning for the shell. For security reasons, it is best
to use a fixed command string, or at least avoid passing any user input
in it.
"

>
> Thanks
>

--
Adrian Klaver
adrian(dot)klaver(at)aklaver(dot)com

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Nicolas Paris 2018-02-10 15:30:11 Re: New Copy Formats - avro/orc/parquet
Previous Message Nicolas Paris 2018-02-10 15:13:04 New Copy Formats - avro/orc/parquet