Quick Links

Re:Using PostgreSQL for Machine Learning Data Pipelines

From:	chris <yuanzefuwater(at)126(dot)com>
To:	"Pankaj Jangid" <pankaj(dot)jangid(at)gmail(dot)com>
Cc:	"Postgres General" <pgsql-general(at)postgresql(dot)org>
Subject:	Re:Using PostgreSQL for Machine Learning Data Pipelines
Date:	2019-10-18 01:20:57
Message-ID:	59caef3f.ab49.16ddc7411b2.Coremail.yuanzefuwater@126.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Hi there,

There is a project named Apache MADlib, may help you.

http://madlib.apache.org

On 10/18/2019 02:04，Pankaj Jangid<pankaj(dot)jangid(at)gmail(dot)com> wrote：
Hi,

I am working on a machine-learning project. Because of the available
study material in the ML area, the team is inclined towards Apache
Kafka, Apache Spark for data-pipelines and analytics.

Our requirement is to store huge amounts of continuously increasing data
that cannot fit into a single machine. The algorithms require data in
batches so it is not necessary to keep full data ready for
consumption. Using Kafka, the data can be distributed and fetched in
varying batch sizes as and when required.

I am more comfortable with PostgreSQL. And wanted to know more about
case-studies where PostgreSQL is deployed for ML use. Any pointers
referring to study material will be helpful. Please share in this
thread.

--
Thanks & Regards,
Pankaj Jangid

In response to

Using PostgreSQL for Machine Learning Data Pipelines at 2019-10-17 18:04:43 from Pankaj Jangid

Browse pgsql-general by date

	From	Date	Subject
Next Message	M Tarkeshwar Rao	2019-10-18 03:43:38	Can you please tell us how set this prefetch attribute in following lines.
Previous Message	Ron	2019-10-17 23:55:53	Re: drop database