I would like to alter the COPY command

From: Mason <freemason(at)gmail(dot)com>
To: pgsql-sql(at)postgresql(dot)org
Subject: I would like to alter the COPY command
Date: 2006-12-24 03:08:24
Message-ID: 523e84030612231908s32ef5145s4e18582b4a579d07@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-patches pgsql-sql

What I have is data with two different characters for "start quote"
and "end quote". In my case it's '[' and ']', but it could be
anything from "smart quotes", to parentheses, to brackets, braces, ^/$
in regexps, etc. I think this isn't too unreasonable a feature to
have to make copy more functional when importing data that is
difficult to transform properly beforehand (in my case is about half a
terabyte of log files, which takes hours and hours, just to cat, let
alone reparse and dump into COPY).

Now, in my case I can just say "cat file | tr '[]' '""' | psql -f
import.sql", but then I lose the ability for psql to do anything smart
like using mmap (I'm making assumptions that it does anything smart
like that, but even if it doesn't now, it could some day).

So, I'm a passable c/c++ programmer, when I have to be, so
theoretically I can do the work myself, but I have never touched
postgres before, so I don't know where to begin. Any ideas how to add
this?

Responses

Browse pgsql-patches by date

  From Date Subject
Next Message Mason 2006-12-24 03:08:56 Fwd: I would like to alter the COPY command
Previous Message Bruce Momjian 2006-12-23 01:59:22 Re: Patch(es) to expose n_live_tuples and

Browse pgsql-sql by date

  From Date Subject
Next Message Mason 2006-12-24 03:08:56 Fwd: I would like to alter the COPY command
Previous Message Ragnar 2006-12-22 23:52:03 Re: Changing character set when the damage is done