Quick Links

Hadoop backend?

From:	Paul Sheer <paulsheer(at)gmail(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Hadoop backend?
Date:	2009-02-21 20:17:30
Message-ID:	c67e3dc60902211217p66906a35pe2cabe2c832e7b2d@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hadoop backend for PostGreSQL....

A problem that my client has, and one that I come across often,
is that a database seems to always be associated with a particular
physical machine, a physical machine that has to be upgraded,
replaced, or otherwise maintained.

Even if the database is replicated, it just means there are two or
more machines. Replication is also a difficult thing to properly
manage.

With a distributed data store, the data would become a logical
object - no adding or removal of machines would affect the data.
This is an ideal that would remove a tremendous maintenance
burden from many sites ---- well, at least the one's I have worked
at as far as I can see.

Does anyone know of plans to implement PostGreSQL over Hadoop?

Yahoo seems to be doing this:
http://glinden.blogspot.com/2008/05/yahoo-builds-two-petabyte-postgresql.html

But they store tables column-ways for their performance situation.
If one is doing a lot of inserts I don't think this is most efficient - ?

Has Yahoo put the source code for their work online?

Many thanks for any pointers.

-paul

Responses

Re: Hadoop backend? at 2009-02-22 02:37:29 from pi song
Re: Hadoop backend? at 2009-02-24 19:30:12 from Josh Berkus
Re: Hadoop backend? at 2009-07-22 03:29:22 from Ron Mayer

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	pi song	2009-02-22 02:37:29	Re: Hadoop backend?
Previous Message	Tom Lane	2009-02-21 18:46:07	Okay to change TypeCreate() signature in back branches?