Re: Hadoop backend? - Mailing list pgsql-hackers

From Josh Berkus
Subject Re: Hadoop backend?
Date
Msg-id 49A44AC4.8040002@agliodbs.com
Whole thread Raw
In response to Hadoop backend?  (Paul Sheer <paulsheer@gmail.com>)
List pgsql-hackers
> With a distributed data store, the data would become a logical
> object - no adding or removal of machines would affect the data.
> This is an ideal that would remove a tremendous maintenance
> burden from many sites ---- well, at least the one's I have worked
> at as far as I can see.

Two things:

1) Hadoop is the wrong technology.  It's not designed to support 
transactional operations.

2) Transactional operations are, in general, your Big Obstacle for doing 
anything in the way of a distributed storage manager.

It's possible you could make both of the above "go away" if you were 
planning for a DW platform in which transactions weren't important. 
However, that would have to become an incompatible fork of PostgreSQL.

AFAIK, the Yahoo platform does not involve Hadoop at all.

--Josh



pgsql-hackers by date:

Previous
From: Jeff Davis
Date:
Subject: Re: GIN fast insert
Next
From: "Joshua D. Drake"
Date:
Subject: Re: Synchronous replication & Hot standby patches