Quick Links

Re: COPY from STDIN vs file with large CSVs

From:	Wells Oliver <wells(dot)oliver(at)gmail(dot)com>
To:	Ron <ronljohnsonjr(at)gmail(dot)com>
Cc:	pgsql-admin(at)lists(dot)postgresql(dot)org
Subject:	Re: COPY from STDIN vs file with large CSVs
Date:	2020-01-08 17:50:14
Message-ID:	CAOC+FBVp=DA0++VT9VaF0azd1_w4GcT2WP6zFNsZ+X0tMzfRug@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-admin

Thanks, I had looked into pg_bulkload a bit but it does not seem to be
available for PG 12? It's not in the extension directory, it's not
available through apt-cache search, and I have all kinds of issues with it
finding pgcommon and pgport when I try to build from source. Using Ubuntu
18 LTS...

On Wed, Jan 8, 2020 at 9:09 AM Ron <ronljohnsonjr(at)gmail(dot)com> wrote:

> On 1/8/20 10:54 AM, Wells Oliver wrote:
> > I have a CSV that's ~30GB. Some 400m rows. Would there be a meaningful
> > performance difference to run COPY from STDIN using: cat f.csv | psql
> > "COPY .. FROM STDIN WITH CSV" versus just doing "COPY ... FROM 'f.csv'
> > WITH CSV"?
> >
> > Thanks. It took about four hours to copy one and I felt that was a
> little
> > much.
>
> catting the file starts another process, and opens a pipe. That can't be
> faster than "COPY ... FROM ... WITH CSV".
>
> pg_bulkload (which might be in your repository) is probably what you
> really
> want.
>
> --
> Angular momentum makes the world go 'round.
>
>
>

--
Wells Oliver
wells(dot)oliver(at)gmail(dot)com <wellsoliver(at)gmail(dot)com>

In response to

Re: COPY from STDIN vs file with large CSVs at 2020-01-08 17:08:56 from Ron

Browse pgsql-admin by date

	From	Date	Subject
Next Message	David G. Johnston	2020-01-08 18:33:51	Re: COPY from STDIN vs file with large CSVs
Previous Message	Scott Ribe	2020-01-08 17:49:07	Re: COPY from STDIN vs file with large CSVs