From: | Mark Moellering <markmoellering(at)psyberation(dot)com> |
---|---|
To: | |
Cc: | Postgres General <pgsql-general(at)postgresql(dot)org> |
Subject: | Re: best way to write large data-streams quickly? |
Date: | 2018-04-10 14:17:43 |
Message-ID: | CAA0uU3W4Loyv3Bubo_m9TVp80W0odnq1euthH6X+rjgSKWiSjw@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
On Mon, Apr 9, 2018 at 12:01 PM, Steve Atkins <steve(at)blighty(dot)com> wrote:
>
> > On Apr 9, 2018, at 8:49 AM, Mark Moellering <markmoellering(at)psyberation(dot)
> com> wrote:
> >
> > Everyone,
> >
> > We are trying to architect a new system, which will have to take several
> large datastreams (total of ~200,000 parsed files per second) and place
> them in a database. I am trying to figure out the best way to import that
> sort of data into Postgres.
> >
> > I keep thinking i can't be the first to have this problem and there are
> common solutions but I can't find any. Does anyone know of some sort
> method, third party program, etc, that can accept data from a number of
> different sources, and push it into Postgres as fast as possible?
>
> Take a look at http://ossc-db.github.io/pg_bulkload/index.html. Check the
> benchmarks for different situations compared to COPY.
>
> Depending on what you're doing using custom code to parse your data and
> then do multiple binary COPYs in parallel may be better.
>
> Cheers,
> Steve
>
>
>
(fighting google slightly to keep from top-posting...)
Thanks!
How long can you run COPY? I have been looking at it more closely. In
some ways, it would be simple just to take data from stdin and send it to
postgres but can I do that literally 24/7? I am monitoring data feeds that
will never stop and I don't know if that is how Copy is meant to be used or
if I have to let it finish and start another one at some point?
Thanks for everyones' help and input!
Mark Moellering
From | Date | Subject | |
---|---|---|---|
Next Message | Achilleas Mantzios | 2018-04-10 14:20:43 | Re: Postgresql Split Brain: Which one is latest |
Previous Message | Vikas Sharma | 2018-04-10 13:50:06 | Postgresql Split Brain: Which one is latest |