Hadoop backend? - Mailing list pgsql-hackers

From Paul Sheer
Subject Hadoop backend?
Date
Msg-id c67e3dc60902211217p66906a35pe2cabe2c832e7b2d@mail.gmail.com
Whole thread Raw
Responses Re: Hadoop backend?  (pi song <pi.songs@gmail.com>)
Re: Hadoop backend?  (Josh Berkus <josh@agliodbs.com>)
Re: Hadoop backend?  (Ron Mayer <rm_pg@cheapcomplexdevices.com>)
List pgsql-hackers
Hadoop backend for PostGreSQL....

A problem that my client has, and one that I come across often,
is that a database seems to always be associated with a particular
physical machine, a physical machine that has to be upgraded,
replaced, or otherwise maintained.

Even if the database is replicated, it just means there are two or
more machines. Replication is also a difficult thing to properly
manage.

With a distributed data store, the data would become a logical
object - no adding or removal of machines would affect the data.
This is an ideal that would remove a tremendous maintenance
burden from many sites ---- well, at least the one's I have worked
at as far as I can see.

Does anyone know of plans to implement PostGreSQL over Hadoop?

Yahoo seems to be doing this:     http://glinden.blogspot.com/2008/05/yahoo-builds-two-petabyte-postgresql.html

But they store tables column-ways for their performance situation.
If one is doing a lot of inserts I don't think this is most efficient - ?

Has Yahoo put the source code for their work online?

Many thanks for any pointers.

-paul


pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Okay to change TypeCreate() signature in back branches?
Next
From: pi song
Date:
Subject: Re: Hadoop backend?