From: | Paul Sheer <paulsheer(at)gmail(dot)com> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Subject: | Hadoop backend? |
Date: | 2009-02-21 20:17:30 |
Message-ID: | c67e3dc60902211217p66906a35pe2cabe2c832e7b2d@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hadoop backend for PostGreSQL....
A problem that my client has, and one that I come across often,
is that a database seems to always be associated with a particular
physical machine, a physical machine that has to be upgraded,
replaced, or otherwise maintained.
Even if the database is replicated, it just means there are two or
more machines. Replication is also a difficult thing to properly
manage.
With a distributed data store, the data would become a logical
object - no adding or removal of machines would affect the data.
This is an ideal that would remove a tremendous maintenance
burden from many sites ---- well, at least the one's I have worked
at as far as I can see.
Does anyone know of plans to implement PostGreSQL over Hadoop?
Yahoo seems to be doing this:
http://glinden.blogspot.com/2008/05/yahoo-builds-two-petabyte-postgresql.html
But they store tables column-ways for their performance situation.
If one is doing a lot of inserts I don't think this is most efficient - ?
Has Yahoo put the source code for their work online?
Many thanks for any pointers.
-paul
From | Date | Subject | |
---|---|---|---|
Next Message | pi song | 2009-02-22 02:37:29 | Re: Hadoop backend? |
Previous Message | Tom Lane | 2009-02-21 18:46:07 | Okay to change TypeCreate() signature in back branches? |