| From: | "Worky Workerson" <worky(dot)workerson(at)gmail(dot)com> | 
|---|---|
| To: | "Luke Lonergan" <llonergan(at)greenplum(dot)com> | 
| Cc: | "Merlin Moncure" <mmoncure(at)gmail(dot)com>, pgsql-performance(at)postgresql(dot)org | 
| Subject: | Re: Best COPY Performance | 
| Date: | 2006-10-31 21:13:59 | 
| Message-ID: | ce4072df0610311313w6bee6b7cwf04796d838a0766a@mail.gmail.com | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-performance | 
> >>>> 1 0 345732 29304 770272 12946764  0  0 16 16428 1192 3105 12  2 85  1
> >>>> 1 0 345732 30840 770060 12945480  0  0 20 16456 1196 3151 12  2 84  1
> >>>> 1 0 345732 32760 769972 12943528  0  0 12 16460 1185 3103 11  2 86  1
> >>
> >> iirc, he is running quad opteron 885 (8 cores), so if my math is
> >> correct he can split up his process for an easy gain.
> >
> > Are you saying that I should be able to issue multiple COPY commands
> > because my I/O wait is low?  I was under the impression that I am I/O
> > bound, so multiple simeoultaneous loads would have a detrimental
> > effect ...
>
> The reason I asked how many CPUs was to make sense of the 12% usr CPU time
> in the above.  That means you are CPU bound and are fully using one CPU.  So
> you aren't being limited by the I/O in this case, it's the CPU.
... snip ...
> For now, you could simply split the file in two pieces and load two copies
> at once, then watch the same "vmstat 1" for 10 seconds and look at your "bo"
> rate.
Significantly higher on average, and a parallel loads were ~30% faster
that a single with index builds (240s vs 340s) and about ~45% (150s vs
230s) without the PK index.  I'll definitely look into the bizgres
java loader.
Thanks!
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Ivan Voras | 2006-10-31 21:55:40 | MVCC & indexes? | 
| Previous Message | Teemu Torma | 2006-10-31 21:03:57 | Re: Best COPY Performance |