Thread: cross-db queries (was Are we losing momentum?)

cross-db queries (was Are we losing momentum?)

From

Rob Butler

Date:

16 April 2003, 10:10:29

Hello all,

I took a look at the docs for Postgres views & rules.  I also took a look at dblink.  I think there is a very plausible
wayto get "cross-db" queries in postgres
 

1) Support for 2PC added.
2) Modify dblink (if it needs modification) to operate within the current transaction created by the user (if a user
transactionwas created).  If no transaction exists (was created by the user) but local and remote tables / views are
updatedby the SQL statement then postgres creates a transaction to operate within.  That way both local and remote
tables/ views are updated as a single atomic unit.  (Which is what the user would expect to happen.)
 
3) Add a new "keyword" to Postgres.  Choose any of the following:a) create remoteview ... (all necessary information
aboutremote machine and query)b) create view [local | remote] .... (necessary info)c) create view
remote-server.schema.table(necessary info)
 

Postgres could then create a read only view of the remote table using dblink.  A user could add additional rules to
allowfor the view to be updateable just as they do now.
 

Also, as an additional feature for postgres we could simplify creating updateable views for "simple" queries.  I.E. 
create view [updateable] (necessary info / simple select statement).  If a simple select statement is used, and
updateableis specified, postgres creates all the rules to implement an updateable view.  This is not necessary of
course,it would just simply make the common (?) case of creating an updateable view from a simple query.
 

The support for 2PC and modification of dblink is only necessary IF we want to be able to update tables on the remote
serverAND have guarantees that the updates on both ends succeeds / fails as a unit.  If we want read only access to
remotetables, then one could create a view (using a custom on select rule) and use dblink right now!
 

What do you think?

Later
Rob

Re: cross-db queries (was Are we losing momentum?)

From

Shridhar Daithankar

Date:

16 April 2003, 10:25:40

On Wednesday 16 April 2003 19:40, Rob Butler wrote:
>     a) create remoteview ... (all necessary information about remote machine
> and query) b) create view [local | remote] .... (necessary info)
>     c) create view remote-server.schema.table (necessary info)

That is not a good way. Oracle does/recommends a create synonym so that either 
a remote view or table can be treated as if it is a loacl table/view. This 
takes care of select/insert/update/delete etc. I think that is a better way 
of integrating remote objects in current database. 

Of course, if there are some standards regarding remote database object, they 
will take precedence over oracle style of syntax. But personally I  think 
oracle syntax is good enough. (I just check sql 92 syntax and it does have 
provision for remote database access)
Shridhar

Re: cross-db queries (was Are we losing momentum?)

From

Rob Butler

Date:

16 April 2003, 11:31:03

> 
> On Wednesday 16 April 2003 19:40, Rob Butler wrote:
> >     a) create remoteview ... (all necessary information about remote machine
> > and query) b) create view [local | remote] .... (necessary info)
> >     c) create view remote-server.schema.table (necessary info)
> 
> That is not a good way. Oracle does/recommends a create synonym so that either 
> a remote view or table can be treated as if it is a loacl table/view. This 
> takes care of select/insert/update/delete etc. I think that is a better way 
> of integrating remote objects in current database. 
> 
It may not be the "best" solution, but it is one that is possible to use now (if you don't want atomic remote updates)
orcould be done with atomic remote updates relatively soon.
 

It is similar to the way MS-SQL works.  And, once setup the local and remote tables all look the same to the client
application. I think that is important.
 

Later
Rob

Re: cross-db queries (was Are we losing momentum?)

From

Joe Conway

Date:

16 April 2003, 12:22:09

Rob Butler wrote:
> 
> What do you think?
> 

See my last post to Shridhar -- the answer is to follow the existing 
spec covering external data access. Search the archives for SQL-MED. It 
is a non-trivial undertaking which is probably why it has not been 
undertaken yet ;-)

Joe

Re: cross-db queries (was Are we losing momentum?)

From

Darko Prenosil

Date:

16 April 2003, 12:23:30

On Wednesday 16 April 2003 15:31, Rob Butler wrote:
> > On Wednesday 16 April 2003 19:40, Rob Butler wrote:
> > >     a) create remoteview ... (all necessary information about remote
> > > machine and query) b) create view [local | remote] .... (necessary
> > > info) c) create view remote-server.schema.table (necessary info)
> >
> > That is not a good way. Oracle does/recommends a create synonym so that
> > either a remote view or table can be treated as if it is a loacl
> > table/view. This takes care of select/insert/update/delete etc. I think
> > that is a better way of integrating remote objects in current database.
>
> It may not be the "best" solution, but it is one that is possible to use
> now (if you don't want atomic remote updates) or could be done with atomic
> remote updates relatively soon.
>
> It is similar to the way MS-SQL works.  And, once setup the local and
> remote tables all look the same to the client application.  I think that is
> important.
>
...Except if you need only few records from remote database(on ISDN link for
example) and remote view (dblink) first selects all the records from remote,
and after that WHERE clause is executed on prepared result.
I used dblink a lot, and the only way to avoid this was to create
function(with parameters) that executes dblink , or to create view that
sends the original query to the host (Which is why Joe added
dblink_current_query() function to dblink at first place ). First way has limitations because You can't add rewrite
rulefor the  
function(or at last I newer succeed with that).The second way has very bad limitation because you always must:SELECT *
fromremoteView(all the fields), otherwise you broke view  
definition, and you can't for example SELECT count(*) FROM remoteView.

Unfortunately any other way ends up with first selecting *ALL* records from
host ! If there is no such limitation I'll be pretty satisfied with dblink,
and will newer ask for "cross-db-queries" again !!!

P.S.: Sorry for bad English !

Re: cross-db queries (was Are we losing momentum?)

From

Rob Butler

Date:

16 April 2003, 12:39:56

>     ...Except if you need only few records from remote database(on ISDN link for 
> example) and remote view (dblink) first selects all the records from remote, 
> and after that WHERE clause is executed on prepared result. 

You are absolutely right.  Hadn't considered that situation.  

Later
Rob

Re: cross-db queries (was Are we losing momentum?)

From

Joe Conway

Date:

16 April 2003, 12:40:46

Darko Prenosil wrote:
> 
> Unfortunately any other way ends up with first selecting *ALL* records from 
> host ! If there is no such limitation I'll be pretty satisfied with dblink, 
> and will newer ask for "cross-db-queries" again !!!
> 

Yeah, this is why a proper implementation following the spec is needed. 
If the external access was part of the backend, then the planner could 
be taught to push down qualifiers to the external source where 
appropriate (I think -- maybe Tom will comment on this).

Joe

Re: cross-db queries (was Are we losing momentum?)

From

Tom Lane

Date:

16 April 2003, 13:35:58

Joe Conway <mail@joeconway.com> writes:
> Darko Prenosil wrote:
>> Unfortunately any other way ends up with first selecting *ALL* records from 
>> host ! If there is no such limitation I'll be pretty satisfied with dblink, 
>> and will newer ask for "cross-db-queries" again !!!

> Yeah, this is why a proper implementation following the spec is needed. 
> If the external access was part of the backend, then the planner could 
> be taught to push down qualifiers to the external source where 
> appropriate (I think -- maybe Tom will comment on this).

Yes, the newer version of SQL-MED has APIs that allow this sort of thing
to be done.  Of course, it's another huge chunk of work beyond basic
SQL-MED ... but at least the roadmap is there, and when we get to the
end of the road we might even find other DBMSes that can speak the same
language.  If we invent our own spec the chance of handling cross-DBMS
queries intelligently is nil :-(
        regards, tom lane